Framework VersionModelUsagePrecisionThroughputPerf/WattLatency(ms)Batch size
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingavx_fp3251.33 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingavx_fp3249.95 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingavx_fp32405.71 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingavx_fp32351.60 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_int8163.14 token/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_int8150.00 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_int8981.52 token/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_int8686.74 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf1699.17 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf1693.67 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf16787.75 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf16587.60 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_fp16101.69 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_fp1697.47 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_fp16964.57 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_fp16765.42 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf3251.38 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf3250.02 tokens/s  1
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf32576.72 tokens/s  30
Intel PyTorch 2.6.0+IPEX Inf LLMsChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf32466.59 tokens/s  30
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingfp3252.37 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingfp3251.04 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingfp32351.10 tokens/s  16
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingfp32272.81 tokens/s  32
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_int8162.54 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_int8150.34 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_int8962.09 tokens/s  16
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_int8542.06 tokens/s  32
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf1693.69 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf1688.85 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_bf16830.94 tokens/s  16
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_bf16480.82 tokens/s  32
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_fp16100.33 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_fp1695.44 tokens/s  1
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 2016/32Natural Language Processingamx_fp16771.99 tokens/s  32
OpenVINO 2024.4.0 Inf LLMChatGLM3-6B Token Size 1024/128Natural Language Processingamx_fp16551.93 tokens/s  64
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingavx_fp3252.07 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingavx_fp3250.30 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingavx_fp32282.47 tokens/s  17
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingavx_fp32237.34 tokens/s  17
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_int8158.13 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_int8146.94 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_int8765.28 tokens/s  15
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_int8590.43 tokens/s  25
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf1699.54 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf1693.75 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf16673.26 tokens/s  29
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf16512.50 tokens/s  31
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_fp16101.03 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_fp1697.47 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_fp16687.17 tokens/s  24
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_fp16559.85 tokens/s  22
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf3252.20 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf3250.40 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf32360.43 tokens/s  25
Intel PyTorch 2.6.0+ IPEX Inf LLMsGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf32260.74 tokens/s  17
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingavx_fp3253.78 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingavx_fp3251.81 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingavx_fp32281.41 tokens/s  4
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingavx_fp32136.37 tokens/s  32
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_int8168.86 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_int8152.81 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_int8746.81 tokens/s  16
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_int8480.53 tokens/s  32
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf1696.83 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf1692.52 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_bf16657.46 tokens/s  16
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_bf16441.71 tokens/s  32
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_fp1699.86 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_fp1694.62 tokens/s  1
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 2016/32Natural Language Processingamx_fp16558.09 tokens/s  8
OpenVINO 2024.4.0 Inf LLMGPT-J-6B Token Size 1024/128Natural Language Processingamx_fp16347.37 tokens/s  16
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingavx_fp3224.18 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingavx_fp3223.37 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingavx_fp32122.01 tokens/s  10
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingavx_fp3286.45 tokens/s  6
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_int882.13 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_int875.69 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_int8436.95 tokens/s  15
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_int8283.91 tokens/s  15
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf1647.18 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf1644.70 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf16367.51 tokens/s  30
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf16245.02 tokens/s  18
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_fp1648.45 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_fp1646.06 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_fp16380.90 tokens/s  24
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_fp16288.17 tokens/s  18
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf3224.20 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf3223.39 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf32134.45 tokens/s  10
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf3289.23 tokens/s  6
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingavx_fp3223.40 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingavx_fp3222.78 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingavx_fp32132.09 tokens/s  8
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingavx_fp32103.81 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_int880.59 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_int872.12 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_int8477.90 tokens/s  8
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_int8266.37 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf1647.34 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf1644.76 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf16387.62 tokens/s  8
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf16231.78 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf3247.60 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf3244.36 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 2016/32Natural Language Processingamx_bf32395.06 tokens/s  8
OpenVINO 2024.4.0 Inf LLMLLaMA2-13B Token size 1024/128Natural Language Processingamx_bf32226.38 tokens/s  16
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingavx_fp3245.23 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingavx_fp3243.55 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingavx_fp32268.14 tokens/s  23
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingavx_fp32201.32 tokens/s  17
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_int8143.24 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_int8132.19 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_int8660.96 tokens/s  15
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_int8430.51 tokens/s  15
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf1686.96 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf1682.07 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf16604.93 tokens/s  25
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf16432.21 tokens/s  25
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_fp1688.89 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_fp1684.11 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_fp16638.30 tokens/s  25
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_fp16470.77 tokens/s  22
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf3245.26 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf3243.51 tokens/s  1
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf32307.81 tokens/s  23
Intel PyTorch 2.6.0+ IPEX Inf LLMsLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf32224.92 tokens/s  17
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingavx_fp3245.94 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingavx_fp3244.23 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingavx_fp32262.65 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingavx_fp32190.16 tokens/s  32
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_int8138.41 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_int8126.22 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_int8703.42 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_int8481.97 tokens/s  32
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf1684.21 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf1679.43 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf16618.63 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf16424.50 tokens/s  32
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf3285.19 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf3280.02 tokens/s  1
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 2016/32Natural Language Processingamx_bf32613.47 tokens/s  16
OpenVINO 2024.4.0 Inf LLMLLaMA2-7B Token size 1024/128Natural Language Processingamx_bf32439.74 tokens/s  32
OpenVINO 2024.4.0Stable-DiffusionImage Generationfp320.09 samp/s  1
OpenVINO 2024.4.0Stable-DiffusionImage Generationamx_int80.25 samp/s  1
OpenVINO 2024.4.0Stable-DiffusionImage Generationamx_bf160.25 samp/s  1
OpenVINO 2024.4.0Stable-DiffusionImage Generationamx_fp160.25 samp/s  1
OpenVINO 2024.4.0BERTLargeNatural Language Processingfp32121.65 sent/s  1
OpenVINO 2024.4.0BERTLargeNatural Language Processingfp32113.96 sent/s  16
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_int8733.89 sent/s  1
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_int8761.71 sent/s  32
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_bf16456.85 sent/s  1
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_bf16462.06 sent/s  16
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_fp16457.07 sent/s  1
OpenVINO 2024.4.0BERTLargeNatural Language Processingamx_fp16407.68 sent/s  16
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingavx_fp32113.44 sent/s  1
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingavx_fp32109.88 sent/s  40
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_int8829.34 sent/s  1
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_int81030.82 sent/s  64
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_bf16473.94 sent/s  1
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_bf16554.99 sent/s  32
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_fp16441.32 sent/s  1
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_fp16460.63 sent/s  88
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_bf32212.01 sent/s  1
Intel PyTorch 2.6.0 + IPEXBERT LargeNatural Language Processingamx_bf32212.02 sent/s  88
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingfp32108.62 sent/s  1
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingfp3299.07 sent/s  32
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_int8484.98 sent/s  1
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_int8569.70 sent/s  16
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_bf16403.19 sent/s  1
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_bf16438.23 sent/s  32
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_fp16405.00 sent/s  1
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_fp16432.31 sent/s  32
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_bf32202.29 sent/s  1
Intel Tensor Flow 2.19.0BERT LargeNatural Language Processingamx_bf32190.89 sent/s  32
Intel PyTorch 2.6.0 + IPEXDLRM-v2Recommenderavx_fp32844726.49 rec/s  128
Intel PyTorch 2.6.0 + IPEXDLRM-v2Recommenderamx_int86,676,543.49 rec/s  128
Intel PyTorch 2.6.0 + IPEXDLRM-v2Recommenderamx_bf164,481,704.53 rec/s  128
Intel PyTorch 2.6.0 + IPEXDLRM-v2Recommenderamx_fp164,321,739.37 rec/s  128
Intel PyTorch 2.6.0 + IPEXDLRM-v2Recommenderamx_bf321.588,266.49 rec/s  128
Intel PyTorch 2.6.0 + IPEXStable-DiffusionImage Generationavx_fp320.12 img/s  1
Intel PyTorch 2.6.0 + IPEXStable-DiffusionImage Generationamx_int80.41 img/s  1
Intel PyTorch 2.6.0 + IPEXStable-DiffusionImage Generationamx_bf160.35 img/s  1
Intel PyTorch 2.6.0 + IPEXStable-DiffusionImage Generationamx_fp160.37 img/s  1
Intel PyTorch 2.6.0 + IPEXStable-DiffusionImage Generationamx_bf320.15 img/s  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionavx_fp32779.13 fps  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionavx_fp32807.66 fps  160
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_int84490.99 fps  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_int86277.16 fps  94
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_bf162624.42 fps  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_bf163570.05 fps  96
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_fp162558.35 fps  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_fp163442.89 fps  256
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_bf321352.02 fps  1
Intel PyTorch 2.6.0 + IPEXVision-TransformerImage Recognitionamx_bf321572.22 fps  256
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionfp32744.63 fps  1
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionfp32771.33 fps  252
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_int82876.37 fps  1
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_int84085.75 fps  252
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_bf162332.85 fps  1
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_bf163143.31 fps  159
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_fp162379.54 fps  1
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_fp163058.30 fps  159
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_bf321641.15 fps  1
Intel Tensor Flow 2.19.0Vision-TransformerImage Recognitionamx_bf321891.57 fps  239
OpenVINO 2024.4.0Vision-TransformerImage Recognitionfp32812.05 fps  1
OpenVINO 2024.4.0Vision-TransformerImage Recognitionfp32847.73 fps  32
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_int83997.38 fps  1
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_int84198.79 fps  32
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_bf162406.63 fps  1
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_bf162609.63 fps  64
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_fp162358.47 fps  1
OpenVINO 2024.4.0Vision-TransformerImage Recognitionamx_fp162537.90 fps  64
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationfp323776.73 fps  1
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationfp323800.48 fps  64
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_int821,118.56 fps  1
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_int829,484 fps  64
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_bf1614,487.85 fps  1
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_bf1617,805.47 fps  32
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_fp1614,475.74 fps  1
OpenVINO 2024.4.0ResNet50-v1-5Image Classificationamx_fp1617,687.28 fps  32
Intel PyTorch 2.6.0 + IPEXLCMReasoning and Understandingavx_fp321.78  1
Intel PyTorch 2.6.0 + IPEXLCMReasoning and Understandingamx_int86.43  1
Intel PyTorch 2.6.0 + IPEXLCMReasoning and Understandingamx_bf164.96  1
Intel PyTorch 2.6.0 + IPEXLCMReasoning and Understandingamx_fp165.1  1
Intel PyTorch 2.6.0 + IPEXLCMReasoning and Understandingamx_bf322.07  1
OpenVINO 2024.4.0LCMReasoning and Understandingfp321.4  1
OpenVINO 2024.4.0LCMReasoning and Understandingamx_int83.6  1
OpenVINO 2024.4.0LCMReasoning and Understandingamx_bf163.7  1
OpenVINO 2024.4.0LCMReasoning and Understandingamx_fp163.58  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionavx_fp32282.23 fps  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionavx_fp32283.59 fps  21
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_int81403.46 fps  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_int81038.72 fps  10
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_bf161058.43 fps  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_bf161011.21 fps  21
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_fp16994.78 fps  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_fp16961.03 fps  21
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_bf32400.53 fps  1
Intel PyTorch 2.6.0 + IPEXYolo-v7Object Detectionamx_bf32376.71 fps  21
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionfp321415.87 img/s  1
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionfp321509.53 img/s  94
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_bf162726.87 img/s  1
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_bf163986.90 img/s  94
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_fp162882.10 img/s  1
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_fp164199.40 img/s  84
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_bf321587.33 img/s  1
Intel Tensor Flow 2.19.0Yolo-v5Object Detectionamx_bf321879.03 img/s  94
OpenVINO 2024.4.0Yolov-5sObject Detectionfp321570.38 img/s  1
OpenVINO 2024.4.0Yolov-5sObject Detectionfp321388.07 img/s  16
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_int86151.55 img/s  1
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_int85170.74 img/s  16
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_bf164738.79 img/s  1
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_bf163825.40 img/s  16
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_fp164585.38 img/s  1
OpenVINO 2024.4.0Yolov-5sObject Detectionamx_fp163505.09 img/s  16
Intel Tensor Flow 2.19.0R-GATMulti-Relational GraphsFP3215,749.10  1
Intel Tensor Flow 2.19.0R-GATMulti-Relational GraphsFP3215,927.51  2625
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_bf1631,608.73  1
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_bf1640,945.58  2625
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_fp1629,505.19  1
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_fp1632,624.98  2625
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_bf3216,486.88  1
Intel Tensor Flow 2.19.0R-GATMulti-Relational Graphsamx_bf3222,650.06  2625

Hardware and software configuration (measured March 13, 2025):

1-node, 2x Intel® Xeon® 6980P processors, 128 cores, hyperthreading on, turbo on, non-uniform memory access (NUMA) 6.

Integrated accelerators available (used): DLB 8 [0], DSA 8 [0], IAA 8 [0], QAT 8 [0].

Total memory: 1536 GB (24 x 64 GB DDR5 8800 MT/s [8800 MT/s]), BIOS BHSDCRB1.IPC.0033.D57.2406240014, microcode 0x81000290, 1x Ethernet controller I225-LM, 1x 3.5T SSDPF2KX038TZ from Intel, 1x 894.3G Micron_7450_MTFDKBG960TFR, CentOS* Stream 9, 6.6.43. TensorFlow*: 2.19.0, Intel® oneAPI Deep Neural Network Library (oneDNN): e34cb13, PyTorch*: 2.6.0.dev20241124+cpu, Intel® Extension for PyTorch*: 2.6.0+gitc5a2330, oneDNN: v3.6.2, OpenVINO™ toolkit: 2024.4.0, oneDNN: 3.5.0. Test by Intel as of March 13, 2025, 10:45:43 a.m. UTC.