AI Hub
Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence
Sonar Reasoning Pro
95.7 index
Top reasoning
Claude Opus 4.7
94.2
Top math
Grok-4 Heavy
100
Fastest
Llama 3.3 70B Instruct
2220 tok/s
Cheapest
Ling-2.6-flash
$0.01/M
Longest context
Llama 4 Scout
10M
Best open-weights
DeepSeek V3.2 Speciale
89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

1008468523620$0$4$8$12$16$20Input price ($/M tokens)Intelligence indexSonar Reasoning Pro: 95.7 @ $2Sonar Reasoning ProQwen3.7 Max: 92.3 @ $2.5Gemini 3.5 Flash: 92.2 @ $1.5GPT-5.3-Codex: 91.5 @ $1.75Grok 4.20 0309 v2: 91.1 @ $2Claude Opus 4.7: 90.9 @ $5Gemini 3 Flash: 90.2 @ $0.5Gemini 3 FlashGrok 4.3: 90.1 @ $1.25DeepSeek V3.2 Speciale: 89.9 @ $0.287DeepSeek V3.2 SpecialeGPT-5.2-Codex: 89.9 @ $1.75DeepSeek-V4-Flash: 89.4 @ $0.1DeepSeek-V4-FlashQwen3.5 397B A17B: 89.3 @ $0.39Qwen3.5 397B A17BGLM 4.7: 89 @ $0.4GPT-5.1: 89 @ $1.25Qwen3.6 Max: 88.8 @ $1.04Grok 4.20 0309: 88.5 @ $2GPT-5 Pro: 88.4 @ $15GPT-5.1-Codex: 88.2 @ $1.25Qwen3.6 Plus: 88.2 @ $0.325DeepSeek-V4-Pro: 88.2 @ $0.435MiMo-V2-Flash: 88 @ $0.1MiMo-V2-FlashClaude Opus 4.5: 88 @ $5Kimi K2.5: 87.9 @ $0.4GPT-5.4 mini: 87.5 @ $0.75MiniMax M2.7: 87.4 @ $0.279GPT-5 Codex: 87.1 @ $1.25DeepSeek-V3.2: 87.1 @ $0.252MiMo-V2-Pro: 87 @ $1GLM 5.1: 86.8 @ $0.98Hy3: 86.7 @ $0.066MiMo-V2.5-Pro: 86.6 @ $1GPT-5.2: 86.2 @ $1.75Grok-3 Mini: 85.9 @ $0.3Qwen3.5-27B: 85.8 @ $0.195Qwen3.5-122B-A10B: 85.7 @ $0.26Gemma 4 31B: 85.7 @ $0.12Ring-2.6-1T: 85.7 @ $0.075Kimi K2 Thinking: 85.6 @ $0.6MiMo-V2-Omni-0327: 85.5 @ $0.4KAT-Coder-Pro V2: 85.5 @ $0.3MiMo-V2.5: 84.9 @ $0.4MiniMax M2.5: 84.8 @ $0.15GPT-5.1-Codex-Mini: 84.7 @ $0.25GLM 5 Turbo: 84.7 @ $1.2o3 Pro: 84.5 @ $20Qwen3.5-35B-A3B: 84.5 @ $0.139Qwen3 235B A22B 2507: 84.2 @ $0.4Qwen3.6 27B: 84.2 @ $0.3Qwen3.6 35B A3B: 84.1 @ $0.15MiniMax M2.1: 83.6 @ $0.29DeepSeek V3.1 Terminus: 83.5 @ $0.27Gemini 3.1 Pro: 83.2 @ $2Step 3.5 Flash: 83.1 @ $0.09MiMo-V2-Omni: 82.8 @ $0.4Gemini 3 Pro: 82.8 @ $2Qwen3.5 Omni Plus: 82.6 @ $0.4Grok-3: 82.6 @ $3o1-pro: 82.5 @ $150Gemini 3.1 Flash Lite: 82.2 @ $0.25Nova 2 Lite: 82.1 @ $0.3GLM-5: 81.9 @ $0.6KAT-Coder-Pro V1: 81.8 @ $0.3GPT-5.4 nano: 81.7 @ $0.2INTELLECT-3: 81 @ $0.2Nova 2.0 Pro: 80.9 @ $1.3Grok 3 mini Reasoning: 80.9 @ $0.3GLM 5V Turbo: 80.9 @ $1.2Qwen3.5-9B: 80.6 @ $0.04GPT-5: 80.5 @ $1.25Claude Sonnet 4.5: 80.4 @ $3Qwen3-Next-80B-A3B: 80.3 @ $0.5NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ $0.1NVIDIA Nemotron 3 Super 120B A12B: 80 @ $0.3Qwen3-235B-A22B-Thinking-2507: 79.6 @ $0.3gpt-oss-120b: 79.6 @ $0.039Qwen3 Max: 79.5 @ $0.78Llama Nemotron Super 49B v1.5: 79.4 @ $0.1Claude Opus 4.6: 79.4 @ $5Gemma 4 26B A4B: 79.2 @ $0.06GPT-5 mini: 79.2 @ $0.25Qwen2.5 VL 72B Instruct: 79.1 @ $0.25Seed-OSS-36B-Instruct: 78.8 @ $0.2Grok 4 Fast: 78.7 @ $0.2Qwen3 VL 235B A22B: 78.4 @ $0.8Qwen3 VL 32B: 78.4 @ $0.7o4-mini: 78.4 @ $1.1Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ $0.6Nova 2.0 Omni: 78.2 @ $0.3Grok 4: 78.2 @ $3Magistral Medium 1.2: 78.1 @ $2Nemotron Nano 9B V2: 77.6 @ $0.04Qwen3 Next 80B A3B Thinking: 77.5 @ $0.098Mercury 2: 77 @ $0.25Mistral Small 4: 76.9 @ $0.15Gemini 2.5 Pro Preview 06-05: 76.6 @ $1.25Claude Sonnet 4.6: 76.3 @ $3Qwen3 VL 30B A3B: 76.2 @ $0.2Qwen3 Max Thinking: 76.1 @ $0.78GPT-5.5: 76.1 @ $5MiniMax-M2: 76 @ $0.255Cogito v2.1: 75.8 @ $1.3MiniMax M1 80k: 75.5 @ $0.6Claude Opus 4.1: 75.4 @ $15Claude Haiku 4.5: 75.3 @ $1Trinity Large Thinking: 75.2 @ $0.22Ling-2.6-1T: 75.2 @ $0.075DeepSeek-R1: 75 @ $0.55DeepSeek VL2: 74.9 @ $9.5Qwen3 235B A22B: 74.9 @ $0.455Kimi K2.6: 74.9 @ $0.73GPT-5.4: 74.9 @ $2.5Mistral Medium 3.5: 74.8 @ $1.5Qwen3 30B A3B 2507: 74.7 @ $0.3Claude 3.7 Sonnet: 74.7 @ $3Ring-flash-2.0: 74.6 @ $0.1Claude Sonnet 4: 74.5 @ $3Qwen3.5 Omni Flash: 74.2 @ $0.1Magistral Small 1.2: 73.9 @ $0.5Qwen3 32B: 73.8 @ $0.08Qwen3 Coder Next: 73.7 @ $0.11gpt-oss-20b: 73.6 @ $0.03Kimi K2: 73.6 @ $0.57Hermes 4 - Llama-3.1 405B: 73.5 @ $1Qwen3 Omni 30B A3B: 73.4 @ $0.3Gemini 2.5 Flash: 73.1 @ $0.3GLM-4.5: 73 @ $0.6Solar Pro 3: 72.4 @ $0.15GLM-4.6: 72.4 @ $0.43Gemini 2.5 Flash-Lite: 72.3 @ $0.1Qwen3-235B-A22B-Instruct-2507: 72.2 @ $0.15DeepSeek V3.2 Exp: 72.2 @ $0.27Qwen3 30B A3B: 71.7 @ $0.09Gemini 2.5 Pro: 71.6 @ $1.25o3: 71.6 @ $2Hermes 4 - Llama-3.1 70B: 71.3 @ $0.1GPT-5 nano: 71.2 @ $0.05Kimi K2 0905: 71 @ $0.6Ministral 8B Instruct: 70.9 @ $0.1Qwen3 VL 235B A22B Instruct: 70.9 @ $0.2o1-mini: 70.5 @ $3GLM 4.5 Air: 70.4 @ $0.13Claude 3.5 Sonnet: 70.3 @ $3DeepSeek R1 Distill Llama 70B: 70.1 @ $0.1GLM 4.5V: 70.1 @ $0.6GLM 4.6V: 69.6 @ $0.3Olmo 3 32B Think: 69.5 @ $0.15NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ $0.2Claude Opus 4: 69.4 @ $15Qwen3 30B A3B 2507 Instruct: 69.3 @ $0.2Qwen3 Next 80B A3B Instruct: 68.9 @ $0.09DeepSeek R1 Distill Qwen 32B: 68.4 @ $0.12ERNIE 4.5 300B A47B: 68.2 @ $0.28QwQ-32B: 67.8 @ $0.7Gemini 1.5 Pro: 67.3 @ $1.25Ling-flash-2.0: 66.9 @ $0.1Qwen3 14B: 66.8 @ $0.1Qwen3 Coder 480B A35B Instruct: 66.5 @ $0.3Kimi K2 Instruct: 66.5 @ $0.57Qwen3 VL 30B A3B Instruct: 66.5 @ $0.13Qwen3 VL 32B Instruct: 66.5 @ $0.104Pixtral-12B: 66.1 @ $0.15DeepSeek-V3 0324: 65.9 @ $0.28o1: 65.4 @ $15o3-mini: 64.1 @ $1.1Llama 4 Maverick: 63.9 @ $0.15GPT-4.1: 63.8 @ $2Qwen2.5 Max: 63.6 @ $1.6DeepSeek-V2.5: 63.4 @ $0.14DeepSeek-R1-0528: 63.3 @ $0.55QwQ-32B-Preview: 62.6 @ $0.15Grok-2: 62.4 @ $2Mistral Small 3 24B Instruct: 62.1 @ $0.1Gemini 1.5 Flash: 61.9 @ $0.15Nova Pro: 61.6 @ $0.8MiniMax-M1: 61.5 @ $0.4Llama 3.1 405B Instruct: 60.9 @ $0.89Gemini 2.0 Flash: 60.3 @ $0.1DeepSeek-V3.1: 59.8 @ $0.21GPT-4 Turbo: 59.8 @ $10GPT-4.5: 59.4 @ $75Ling-2.6-flash: 59.3 @ $0.01Qwen2.5 72B Instruct: 59.1 @ $0.36Llama 4 Scout: 58.9 @ $0.08Sonar Pro: 58.8 @ $3Mistral Medium 3: 58.6 @ $0.4Claude 3 Opus: 58.5 @ $15Gemma 3 27B: 58.4 @ $0.08Mistral Large 3: 58.3 @ $0.5GPT-4: 58.3 @ $30GPT-4.1 Mini: 58.2 @ $0.4GLM 4.7 Flash: 58.1 @ $0.06DeepSeek-V3: 58.1 @ $0.229Qwen3 8B: 57.8 @ $0.05Nova Lite: 57.7 @ $0.06Qwen3 Omni 30B A3B Instruct: 57.3 @ $0.3o1-preview: 57.3 @ $15Gemini 2.5 Flash Lite: 57 @ $0.1Sonar: 56.8 @ $1Qwen3 4B: 56.5 @ $0.1GPT-4o: 56.4 @ $2.5Reka Flash 3: 56.2 @ $0.1Llama 3.1 70B Instruct: 56 @ $0.4Command A: 55.9 @ $2.5Gemma 3 12B: 55.5 @ $0.04Qwen3 Coder 30B A3B Instruct: 55.4 @ $0.07Claude 3.5 Haiku: 54.5 @ $0.8Gemini 2.0 Flash Lite: 54.4 @ $0.075Devstral 2: 54.3 @ $0.4Llama 3.2 90B Instruct: 54 @ $0.35Nova Premier: 53.1 @ $2.5Pixtral Large: 53.1 @ $2Reka Flash: 52.9 @ $0.2Mistral Medium 3.1: 51.5 @ $0.4Mistral Small 3.2: 51 @ $0.1Mistral Small 3.1 24B Base: 50.9 @ $0.1Llama 3.3 70B Instruct: 50.6 @ $0.1Qwen2.5 Turbo: 50.3 @ $0.1Qwen2.5 Coder 32B Instruct: 50.1 @ $0.66Qwen3 VL 8B: 49.7 @ $0.2GPT-4o-mini: 49.1 @ $0.15Nova Micro: 49 @ $0.03Gemini 1.5 Flash 8B: 48.4 @ $0.07Ministral 3 14B: 47.9 @ $0.2Mistral Large 2: 47.9 @ $2Devstral Medium: 47.8 @ $0.4Phi 4: 47.6 @ $0.065LFM2-24B-A2B: 47.4 @ $0.03Qwen3 1.7B: 46.9 @ $0.1Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ $0.1Qwen2.5 7B Instruct: 46.6 @ $0.04Phi-4-multimodal-instruct: 46.2 @ $0.05Phi-3.5-mini-instruct: 46 @ $0.1Claude 3 Sonnet: 45.8 @ $3Gemma 3 4B: 45.8 @ $0.04Devstral Small: 45.3 @ $0.1Llama 3.1 8B Instruct: 45 @ $0.02Gemma 3 27B Instruct: 44.5 @ $0.1Llama 3.1 Nemotron 70B Instruct: 43.7 @ $1.2Mistral Small 3: 43.6 @ $0.05Ministral 3 8B: 43.3 @ $0.15Granite 4.1 8B: 43.3 @ $0.05Qwen3 VL 8B Instruct: 43 @ $0.08Jamba 1.5 Large: 42.8 @ $2Jamba 1.6 Large: 42.6 @ $2Hermes 3 - Llama-3.1 70B: 42.5 @ $0.3Mistral Small 3.1: 42.4 @ $0.1GPT-4.1 Nano: 42.1 @ $0.1Llama 3 70B Instruct: 40.8 @ $0.51Mistral Small: 40.3 @ $0.2Gemma 3 12B Instruct: 40 @ $0.1Olmo 3 7B Instruct: 40 @ $0.1Mistral Large: 39.3 @ $2Mixtral 8x22B Instruct: 39.1 @ $2Claude 3 Haiku: 38.9 @ $0.25Llama 3.2 11B Instruct: 36.7 @ $0.05Jamba Large 1.7: 36.5 @ $2Granite 4.0 H Small: 35.7 @ $0.1GPT-3.5 Turbo: 35.2 @ $0.5Gemini 1.0 Pro: 34 @ $0.5Ministral 3 3B: 33.7 @ $0.1Mistral Medium: 33.6 @ $2.8Solar Mini: 33.1 @ $0.2Phi 4 Mini Instruct: 32.6 @ $0.08Llama 3 8B Instruct: 32.4 @ $0.04Llama 3.2 3B Instruct: 30.8 @ $0.051Jamba 1.5 Mini: 29.6 @ $0.2Qwen3 0.6B: 29.3 @ $0.1Command R+: 28.9 @ $0.15Apertus 70B Instruct: 27.2 @ $0.8Mixtral 8x7B Instruct: 26.1 @ $0.5Apertus 8B Instruct: 25.6 @ $0.1Granite 4.0 Micro: 25.6 @ $0.017Jamba 1.6 Mini: 24.9 @ $0.2Gemma 3n E4B Instructed: 24.8 @ $20Mistral 7B Instruct: 14.7 @ $0.2Llama 3.2 1B Instruct: 12.1 @ $0.027Llama 2 Chat 7B: 11.3 @ $0.1

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

1008468523620080160240320400Output speed (tokens/s)Intelligence indexQwen3.7 Max: 92.3 @ 203Gemini 3.5 Flash: 92.2 @ 221GPT-5.3-Codex: 91.5 @ 73Grok 4.20 0309 v2: 91.1 @ 105Claude Opus 4.7: 90.9 @ 49Gemini 3 Flash: 90.2 @ 191Grok 4.3: 90.1 @ 88GPT-5.2-Codex: 89.9 @ 106DeepSeek-V4-Flash: 89.4 @ 109Qwen3.5 397B A17B: 89.3 @ 53GLM 4.7: 89 @ 98GPT-5.1: 89 @ 115Qwen3.6 Max: 88.8 @ 36Grok 4.20 0309: 88.5 @ 97GPT-5.1-Codex: 88.2 @ 188Qwen3.6 Plus: 88.2 @ 52DeepSeek-V4-Pro: 88.2 @ 30MiMo-V2-Flash: 88 @ 145Claude Opus 4.5: 88 @ 58Kimi K2.5: 87.9 @ 35GPT-5.4 mini: 87.5 @ 162MiniMax M2.7: 87.4 @ 50GPT-5 Codex: 87.1 @ 180MiMo-V2-Pro: 87 @ 60GLM 5.1: 86.8 @ 53Hy3: 86.7 @ 100MiMo-V2.5-Pro: 86.6 @ 58GPT-5.2: 86.2 @ 73Grok-3 Mini: 85.9 @ 100Qwen3.5-27B: 85.8 @ 91Qwen3.5-122B-A10B: 85.7 @ 129Gemma 4 31B: 85.7 @ 36Ring-2.6-1T: 85.7 @ 120Kimi K2 Thinking: 85.6 @ 100MiMo-V2-Omni-0327: 85.5 @ 110KAT-Coder-Pro V2: 85.5 @ 108MiMo-V2.5: 84.9 @ 92MiniMax M2.5: 84.8 @ 87GPT-5.1-Codex-Mini: 84.7 @ 175o3 Pro: 84.5 @ 25Qwen3.5-35B-A3B: 84.5 @ 121Qwen3 235B A22B 2507: 84.2 @ 59Qwen3.6 27B: 84.2 @ 64Qwen3.6 35B A3B: 84.1 @ 169MiniMax M2.1: 83.6 @ 92Gemini 3.1 Pro: 83.2 @ 142Step 3.5 Flash: 83.1 @ 194MiMo-V2-Omni: 82.8 @ 108Gemini 3 Pro: 82.8 @ 141Qwen3.5 Omni Plus: 82.6 @ 54Step 3.5 Flash 2603: 82.6 @ 197Grok-3: 82.6 @ 100Gemini 3.1 Flash Lite: 82.2 @ 342Nova 2 Lite: 82.1 @ 229GLM-5: 81.9 @ 67KAT-Coder-Pro V1: 81.8 @ 108GPT-5.4 nano: 81.7 @ 157Nova 2.0 Pro: 80.9 @ 149Grok 3 mini Reasoning: 80.9 @ 33Qwen3.5-9B: 80.6 @ 51GPT-5: 80.5 @ 100Claude Sonnet 4.5: 80.4 @ 42Qwen3-Next-80B-A3B: 80.3 @ 147NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ 148NVIDIA Nemotron 3 Super 120B A12B: 80 @ 211gpt-oss-120b: 79.6 @ 500Qwen3 Max: 79.5 @ 45Llama Nemotron Super 49B v1.5: 79.4 @ 51Claude Opus 4.6: 79.4 @ 48Gemma 4 26B A4B: 79.2 @ 66GPT-5 mini: 79.2 @ 200Seed-OSS-36B-Instruct: 78.8 @ 37Grok 4 Fast: 78.7 @ 90Qwen3 VL 235B A22B: 78.4 @ 34Qwen3 VL 32B: 78.4 @ 93o4-mini: 78.4 @ 115Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ 42Grok 4: 78.2 @ 100Magistral Medium 1.2: 78.1 @ 42Qwen3.5 4B: 77.1 @ 164Mercury 2: 77 @ 790Mercury 2Mistral Small 4: 76.9 @ 145Gemini 2.5 Pro Preview 06-05: 76.6 @ 85Claude Sonnet 4.6: 76.3 @ 75Qwen3 VL 30B A3B: 76.2 @ 122Qwen3 Max Thinking: 76.1 @ 45GPT-5.5: 76.1 @ 67MiniMax-M2: 76 @ 91Cogito v2.1: 75.8 @ 56Claude Opus 4.1: 75.4 @ 120Claude Haiku 4.5: 75.3 @ 100Trinity Large Thinking: 75.2 @ 129DeepSeek-R1: 75 @ 189DeepSeek VL2: 74.9 @ 22Qwen3 235B A22B: 74.9 @ 68Kimi K2.6: 74.9 @ 57GPT-5.4: 74.9 @ 84Mistral Medium 3.5: 74.8 @ 140Qwen3 30B A3B 2507: 74.7 @ 151Claude 3.7 Sonnet: 74.7 @ 101Claude Sonnet 4: 74.5 @ 101Qwen3.5 Omni Flash: 74.2 @ 235Magistral Small 1.2: 73.9 @ 106Sarvam 105B: 73.8 @ 128Qwen3 32B: 73.8 @ 328Qwen3 Coder Next: 73.7 @ 92gpt-oss-20b: 73.6 @ 1000gpt-oss-20bKimi K2: 73.6 @ 26Hermes 4 - Llama-3.1 405B: 73.5 @ 34Qwen3 Omni 30B A3B: 73.4 @ 102Gemini 2.5 Flash: 73.1 @ 85GLM-4.5: 73 @ 85GLM-4.6: 72.4 @ 85Qwen3-235B-A22B-Instruct-2507: 72.2 @ 63DeepSeek V3.2 Exp: 72.2 @ 100Qwen3 30B A3B: 71.7 @ 122Gemini 2.5 Pro: 71.6 @ 85o3: 71.6 @ 50Hermes 4 - Llama-3.1 70B: 71.3 @ 60GPT-5 nano: 71.2 @ 500Kimi K2 0905: 71 @ 16Ministral 8B Instruct: 70.9 @ 0Qwen3 VL 235B A22B Instruct: 70.9 @ 51o1-mini: 70.5 @ 115GLM 4.5 Air: 70.4 @ 63Claude 3.5 Sonnet: 70.3 @ 101DeepSeek R1 Distill Llama 70B: 70.1 @ 37GLM 4.5V: 70.1 @ 85GLM 4.6V: 69.6 @ 44NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ 244Claude Opus 4: 69.4 @ 120Qwen3 30B A3B 2507 Instruct: 69.3 @ 122Qwen3 Next 80B A3B Instruct: 68.9 @ 161DeepSeek R1 Distill Qwen 32B: 68.4 @ 37NVIDIA Nemotron Nano 9B V2: 68.3 @ 129ERNIE 4.5 300B A47B: 68.2 @ 24QwQ-32B: 67.8 @ 31Gemini 1.5 Pro: 67.3 @ 85Ling-flash-2.0: 66.9 @ 91Qwen3 14B: 66.8 @ 62Qwen3 Coder 480B A35B Instruct: 66.5 @ 69Kimi K2 Instruct: 66.5 @ 45Qwen3 VL 30B A3B Instruct: 66.5 @ 123Qwen3 VL 32B Instruct: 66.5 @ 76Pixtral-12B: 66.1 @ 0o1: 65.4 @ 66o3-mini: 64.1 @ 115Llama 4 Maverick: 63.9 @ 639GPT-4.1: 63.8 @ 100Qwen2.5 Max: 63.6 @ 50LongCat Flash Lite: 63.6 @ 110DeepSeek-V2.5: 63.4 @ 100Sarvam 30B: 63.3 @ 214DeepSeek-R1-0528: 63.3 @ 45QwQ-32B-Preview: 62.6 @ 99Grok-2: 62.4 @ 85Mistral Small 3 24B Instruct: 62.1 @ 134Gemini 1.5 Flash: 61.9 @ 150Nova Pro: 61.6 @ 100Llama 3.1 405B Instruct: 60.9 @ 100Gemini 2.0 Flash: 60.3 @ 183GPT-4 Turbo: 59.8 @ 100GPT-4.5: 59.4 @ 50Qwen2.5 72B Instruct: 59.1 @ 100Llama 4 Scout: 58.9 @ 776Llama 4 ScoutMistral Medium 3: 58.6 @ 32Claude 3 Opus: 58.5 @ 120Gemma 3 27B: 58.4 @ 33Mistral Large 3: 58.3 @ 54GPT-4: 58.3 @ 104GPT-4.1 Mini: 58.2 @ 150GLM 4.7 Flash: 58.1 @ 113DeepSeek-V3: 58.1 @ 100Qwen3 8B: 57.8 @ 69Nova Lite: 57.7 @ 100Qwen3 Omni 30B A3B Instruct: 57.3 @ 103o1-preview: 57.3 @ 66Gemini 2.5 Flash Lite: 57 @ 6Qwen3 4B: 56.5 @ 103Sarvam M: 56.4 @ 136GPT-4o: 56.4 @ 132Reka Flash 3: 56.2 @ 93Llama 3.1 70B Instruct: 56 @ 1204Llama 3.1 70B InstructCommand A: 55.9 @ 203Gemma 3 12B: 55.5 @ 33Qwen3 Coder 30B A3B Instruct: 55.4 @ 97Claude 3.5 Haiku: 54.5 @ 104Gemini 2.0 Flash Lite: 54.4 @ 85Devstral 2: 54.3 @ 51Llama 3.2 90B Instruct: 54 @ 100Nova Premier: 53.1 @ 40Pixtral Large: 53.1 @ 0Reka Flash: 52.9 @ 85Mistral Medium 3.1: 51.5 @ 47Mistral Small 3.2: 51 @ 100Mistral Small 3.1 24B Base: 50.9 @ 137Llama 3.3 70B Instruct: 50.6 @ 2220Llama 3.3 70B InstructQwen2.5 Turbo: 50.3 @ 67Qwen2.5 Coder 32B Instruct: 50.1 @ 110Qwen3 VL 8B: 49.7 @ 120GPT-4o-mini: 49.1 @ 92Nova Micro: 49 @ 100Gemini 1.5 Flash 8B: 48.4 @ 150Ministral 3 14B: 47.9 @ 67Mistral Large 2: 47.9 @ 42Devstral Medium: 47.8 @ 72Phi 4: 47.6 @ 33Devstral Small 2: 47.5 @ 62LFM2-24B-A2B: 47.4 @ 208Qwen3 1.7B: 46.9 @ 138Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ 301Qwen2.5 7B Instruct: 46.6 @ 138Phi-4-multimodal-instruct: 46.2 @ 25Phi-3.5-mini-instruct: 46 @ 23Claude 3 Sonnet: 45.8 @ 120Gemma 3 4B: 45.8 @ 33Qwen3.5 2B: 45.6 @ 328Devstral Small: 45.3 @ 190Llama 3.1 8B Instruct: 45 @ 2047Llama 3.1 8B InstructLlama 3.1 Nemotron 70B Instruct: 43.7 @ 292Mistral Small 3: 43.6 @ 136Ministral 3 8B: 43.3 @ 86Granite 4.1 8B: 43.3 @ 133Qwen3 VL 8B Instruct: 43 @ 145Jamba 1.5 Large: 42.8 @ 100Jamba 1.6 Large: 42.6 @ 52Hermes 3 - Llama-3.1 70B: 42.5 @ 32Mistral Small 3.1: 42.4 @ 134GPT-4.1 Nano: 42.1 @ 200Llama 3 70B Instruct: 40.8 @ 45Mistral Small: 40.3 @ 134Claude 3 Haiku: 38.9 @ 104Llama 3.2 11B Instruct: 36.7 @ 168Jamba Large 1.7: 36.5 @ 48Granite 4.0 H Small: 35.7 @ 524GPT-3.5 Turbo: 35.2 @ 100Gemma 3n E4B Instruct: 34.7 @ 56Gemini 1.0 Pro: 34 @ 120Ministral 3 3B: 33.7 @ 154Mistral Medium: 33.6 @ 45Solar Mini: 33.1 @ 63Granite 3.3 8B: 32.5 @ 376Llama 3 8B Instruct: 32.4 @ 81Llama 3.2 3B Instruct: 30.8 @ 172Tiny Aya Global: 30.5 @ 126Jamba 1.5 Mini: 29.6 @ 100Qwen3 0.6B: 29.3 @ 225Command R+: 28.9 @ 100Jamba 1.6 Mini: 24.9 @ 183Gemma 3n E4B Instructed: 24.8 @ 42Qwen3.5 0.8B: 23.6 @ 120Mistral 7B Instruct: 14.7 @ 90Llama 3.2 1B Instruct: 12.1 @ 91Llama 2 Chat 7B: 11.3 @ 113
#ModelGeneral idxMulti-IFLiveBenchArena HardHumanity’s Last ExamIFEvalSimpleQAMMLU-ProMMLUContextSpeedIn $/M
1DeepSeek V3.2 Exp91.119.897.185164K100$0.27
2Nemotron Nano 9B V290.390.3131K$0.04
3Grok 4 Fast902095852M90$0.20
4Gemini 3 Pro89.837.589.81M141$2.00
5Claude Opus 4.589.528.489.5200K58$5.00
6Gemini 3 Flash8934.7891M191$0.50
7DeepSeek-R1-052888.717.792.385131K45$0.55
8DeepSeek-V3.188.615.993.483.7164K$0.21
9Claude 3.7 Sonnet88.510.393.283.786.1200K101$3.00
10Claude Opus 4.18811.988200K120$15.00
11DeepSeek-V4-Pro87.535.987.51M30$0.44
12MiniMax M2.187.522.287.5205K92$0.29
13Claude Sonnet 4.587.517.387.51M42$3.00
14GPT-5.287.435.487.4400K73$1.75
15Claude Opus 487.311.787.388.8200K120$15.00
16Kimi-k1.587.287.287.4
17GPT-587.124.887.192.5400K100$1.25
18GPT-5.18726.587400K115$1.25
19Grok 486.64086.6256K100$3.00
20GPT-5 Codex86.525.686.5400K180$1.25
21DeepSeek V3.2 Speciale86.326.186.3164K$0.29
22DeepSeek-V3.286.222.286.2131K$0.25
23GPT-5.1-Codex8623.486400K188$1.25
24Llama 3.1 Nemotron Ultra 253B v1868.189.582.542$0.60
25GLM 4.785.625.185.6203K98$0.40
26Grok 4.1 Fast85.417.685.4$0.00
27Doubao Seed Code85.413.385.4$0.00
28o385.324.385.3200K50$2.00
29DeepSeek V3.1 Terminus85.115.285.1164K$0.27
30Cogito v2.184.91184.956$1.30
31Kimi K2 Thinking84.822.384.8262K100$0.60
32GLM-4.584.614.484.6131K85$0.60
33DeepSeek-R184.49.384.490.8128K189$0.55
34MiMo-V2-Flash84.321.184.3262K145$0.10
35Qwen3 235B A22B 250784.31584.359$0.40
36Qwen3-235B-A22B-Thinking-250784.380.618.287.884.4256K$0.30
37Gemini 2.5 Flash84.212.784.2$0.00
38Claude Sonnet 484.29.684.2881M101$3.00
39Qwen3 Max84.111.184.1262K45$0.78
40K-EXAONE83.813.183.8$0.00
41GPT-5 mini83.716.783.7400K200$0.25
42Qwen3 VL 235B A22B83.610.183.634$0.80
43Llama-3.3 Nemotron Super 49B v183.488.36.578.5$0.00
44o4-mini83.214.783.2200K115$1.10
45Qwen3 Next 80B A3B Thinking83.177.888.982.7262K$0.10
46ERNIE 5.0 Thinking8312.783$0.00
47Nova 2.0 Pro838.983149$1.30
48Hermes 4 - Llama-3.1 405B82.910.382.934$1.00
49GLM-4.682.917.282.9203K85$0.43
50Grok 3 mini Reasoning82.811.182.833$0.30
51Qwen3 32B82.874.993.88.379.8131K328$0.08
52Kimi K2 090582.56.382.590.2262K16$0.60
53Qwen3-Next-80B-A3B82.411.782.4262K147$0.50
54Qwen3 Max Thinking82.426.282.4262K45$0.78
55Kimi K282.4782.489.5131K26$0.57
56Qwen3 VL 235B A22B Instruct82.36.382.3262K51$0.20
57INTELLECT-382.212.182.2131K$0.20
58Ling-1T82.27.282.2$0.00
59GPT-5.1-Codex-Mini8216.982400K175$0.25
60MiniMax-M28212.582205K91$0.26
61Nova 2 Lite81.810.981.81M229$0.30
62EXAONE 4.0 32B81.810.581.8$0.00
63Qwen3 VL 32B81.89.681.893$0.70
64MiniMax M1 80k81.68.281.6$0.60
65Seed-OSS-36B-Instruct81.59.181.537$0.20
66Magistral Medium 1.281.59.681.542$2.00
67Llama Nemotron Super 49B v1.581.46.881.451$0.10
68GLM 4.5 Air81.410.681.4131K63$0.13
69KAT-Coder-Pro V181.333.481.3108$0.30
70Mi:dm K 2.5 Pro81.38.881.3$0.00
71Qwen3 Next 80B A3B Instruct81.375.87.387.680.6262K161$0.09
72DeepSeek-V3 032481.25.281.2164K$0.28
73Hermes 4 - Llama-3.1 70B81.17.981.160$0.10
74Nova 2.0 Omni80.96.880.9$0.30
75Llama 3.1 405B Instruct80.94.288.673.387.3128K100$0.89
76gpt-oss-120b80.81980.890131K500$0.04
77Gemini 2.5 Flash-Lite80.86.680.8$0.10
78MiniMax M1 40k80.87.580.8$0.00
79Qwen3 VL 30B A3B80.78.780.7122$0.20
80Mistral Large 380.74.180.7262K54$0.50
81Ring-1T80.610.280.6$0.00
82Nova Pro80.63.492.169.185.9300K100$0.80
83Qwen3 30B A3B 250780.59.880.5151$0.30
84Solar Pro 280.5780.5$0.00
85Gemini 2.0 Pro80.56.880.5$0.00
86Llama 4 Maverick80.54.880.585.51M639$0.15
87Llama 3.3 70B Instruct80.5492.168.986131K2220$0.10
88Qwen3 235B A22B80.377.195.611.768.287.8131K68$0.46
89Grok-3805.180128K100$3.00
90Qwen38080128K
91Claude Haiku 4.5809.780200K100$1.00
92Phi 4 Reasoning Plus807984.976
93GLM 4.6V79.98.979.9131K44$0.30
94Gemini 2.0 Flash Thinking79.87.179.8$0.00
95Motif-2-12.7B-Reasoning79.68.279.6$0.00
96GPT-4.179.670.85.487.480.690.21M100$2.00
97DeepSeek R1 Distill Llama 70B79.56.179.5128K37$0.10
98NVIDIA Nemotron 3 Nano 30B A3B79.410.279.4148$0.10
99Ring-flash-2.079.38.979.3$0.10
100Llama 3.1 Nemotron Nano 8B V179.379.3
101Grok Code Fast 179.37.579.3$0.00
102Qwen3 Omni 30B A3B79.27.379.2102$0.30
103Qwen3 VL 32B Instruct79.16.379.1262K76$0.10
104Apriel-v1.6-15B-Thinker799.879$0.00
105Mistral Small 3 24B Instruct78.987.682.966.332K134$0.10
106Qwen3 30B A3B78.872.274.3916.677.7131K122$0.09
107GLM 4.5V78.85.978.866K85$0.60
108Qwen3 Coder 480B A35B Instruct78.84.478.869$0.30
109K2-V278.69.878.6$0.00
110HyperCLOVA X SEED Think78.55.578.5$0.00
111GPT-5 nano788.778400K500$0.05
112QwQ-32B77.873.18.283.976.431$0.70
113Qwen3 30B A3B 2507 Instruct77.76.877.7122$0.20
114Ling-flash-2.077.76.377.791$0.10
115Claude 3.5 Sonnet77.63.977.690.4200K101$3.00
116ERNIE 4.5 300B A47B77.63.577.6131K24$0.28
117Qwen3 14B77.44.377.4132K62$0.10
118Apriel-v1.5-15B-Thinker77.31277.3$0.00
119Phi 4 Reasoning7773.383.474.3
120Llama 3.1 70B Instruct774.687.566.483.6131K1204$0.40
121Magistral Small 1.276.86.176.8106$0.50
122Qwen3 VL 30B A3B Instruct76.46.476.4262K123$0.13
123Gemini 2.0 Flash76.45.376.4871M183$0.10
124GPT-4.1 Mini76.4673.784.178.187.51M150$0.40
125Olmo 3.1 32B Think76.3676.3$0.00
126Qwen2.5 Max76.24.576.250$1.60
127DeepSeek-V2.576.276.280.48K100$0.14
128Devstral 276.23.676.2262K51$0.40
129Mistral Medium 3764.376131K32$0.40
130Qwen3-235B-A22B-Instruct-250775.977.510.688.754.383131K63$0.15
131Olmo 3 32B Think75.95.975.966K$0.15
132NVIDIA Nemotron Nano 12B v2 VL75.95.375.9244$0.20
133Gemini 1.5 Pro75.84.975.885.92M85$1.25
134Grok-275.53.875.587.5128K85$2.00
135Sonar Pro75.57.975.5200K$3.00
136Magistral Medium 175.39.575.3$0.00
137Qwen3 VL 8B74.93.374.9120$0.20
138gpt-oss-20b74.817.374.885.3131K1000$0.03
139Magistral Small 174.67.274.6$0.00
140Nova Lite74.44.689.75980.5300K100$0.06
141Qwen3 4B 250774.35.974.3$0.00
142Llama 4 Scout74.34.374.379.610M776$0.08
143Qwen3 8B74.34.274.3131K69$0.05
144o1-mini74.24.974.285.2128K115$3.00
145NVIDIA Nemotron Nano 9B V274.24.674.2129$0.00
146DeepSeek R1 Distill Qwen 14B744.474$0.00
147DeepSeek R1 Distill Qwen 32B73.95.573.9128K37$0.12
148DeepSeek R1 0528 Qwen3 8B73.95.673.9$0.00
149GPT-4.573.870.888.262.590.8128K50$75.00
150Nova Premier73.34.773.340$2.50
151Falcon-H1R-7B72.510.872.5$0.00
152Qwen3 Omni 30B A3B Instruct72.55.172.5103$0.30
153Qwen2.5 72B Instruct72.252.381.24.284.171.1131K100$0.36
154Grok-2 mini727286.2
155Llama 3.1 Tulu3 405B71.63.571.6$0.00
156Command A71.211.471.2256K203$2.50
157Ministral 8B Instruct70.970.965128K0$0.10
158Devstral Medium70.83.870.8131K72$0.40
159o3-mini70.679.584.612.393.91580.286.9200K115$1.10
160Qwen3 Coder 30B A3B Instruct70.6470.6160K97$0.07
161Grok70.34.770.3$0.00
162Nova Micro70.24.787.253.177.6128K100$0.03
163Pixtral Large70.13.670.1131K0$2.00
164Qwen3 VL 4B704.470$0.00
165Mistral Large 269.7469.784128K42$2.00
166Kimi K2 Instruct69.676.44.789.83181.189.5131K45$0.57
167Kimi K2-Instruct-090569.676.44.789.83181.189.5
168Qwen3 4B69.65.169.6103$0.10
169Sarvam M69.63.369.6136$0.00
170GPT-4 Turbo69.43.369.486.5128K100$10.00
171Ministral 3 14B69.34.669.3262K67$0.20
172Qwen2.5 32B Instruct693.86983.3$0.00
173Llama 3.1 Nemotron 70B Instruct694.66980.2292$1.20
174Sonar68.97.368.9127K$1.00
175Qwen2.5 VL 32B Instruct68.868.878.4
176Qwen3 VL 8B Instruct68.62.968.6256K145$0.08
177Claude 3 Opus68.53.168.586.8200K120$15.00
178Gemini 2.5 Pro68.417.850.8861M85$1.25
179Mistral Medium 3.168.34.468.3131K47$0.40
180Mistral Small 3.268.14.368.1100$0.10
181Devstral Small 267.83.467.862$0.00
182Gemini 1.5 Flash67.34.267.378.91M150$0.15
183Qwen3 4B 2507 Instruct67.24.767.2$0.00
184Llama 3.2 90B Instruct67.14.967.186128K100$0.35
185Ling-mini-2.067.1567.1$0.00
186Reka Flash 366.95.166.966K93$0.10
187Gemma 3 27B Instruct66.94.766.9$0.10
188Granite 3.3 8B Instruct66.257.674.865.5
189Granite 3.3 8B Base66.257.674.863.9
190o166677.74784.192200K66$15.00
191Mistral Small 3.165.94.865.9134$0.10
192GPT-4.1 Nano65.857.23.974.565.780.11M200$0.10
193Olmo 3 7B Think65.55.765.5$0.00
194Mistral Small 365.24.165.233K136$0.05
195Claude 3.5 Haiku653.56580.9200K104$0.80
196QwQ-32B-Preview64.84.864.833K99$0.15
197GPT-4o-mini64.8464.882128K92$0.15
198Qwen2 72B Instruct64.43.764.482.3$0.00
199Llama 3.1 8B Instruct64.45.180.448.369.4131K2047$0.02
200Ministral 3 8B64.24.364.2262K86$0.15
201Qwen2.5 14B Instruct63.763.779.7
202GPT-4o63.760.95.38138.274.788.7128K132$2.50
203Qwen3 VL 4B Instruct63.43.763.4$0.00
204Qwen2.5 Turbo63.34.263.367$0.10
205Devstral Small63.2463.2190$0.10
206Granite 4.0 H Small62.43.762.4524$0.10
207DeepSeek-V362.33.686.124.975.988.5131K100$0.23
208Pixtral-12B61.361.369.2128K0$0.15
209Mistral Saba61.14.161.1$0.00
210Jamba 1.5 Large59.565.4453.581.2256K100$2.00
211Gemma 3 12B Instruct59.54.859.5$0.10
212Exaone 4.0 1.2B58.85.858.8$0.00
213Gemini 1.5 Flash 8B58.74.558.71M150$0.07
214Kimi Linear 48B A3B Instruct58.52.758.5$0.00
215DeepHermes 3 - Mistral 24B583.958$0.00
216Jamba Large 1.757.73.857.7256K48$2.00
217Jamba Reasoning 3B57.74.657.7$0.00
218Llama 3 70B Instruct57.44.457.48K45$0.51
219Hermes 3 - Llama-3.1 70B57.14.157.132$0.30
220Qwen3 1.7B575.257138$0.10
221Claude 3 Sonnet56.83.856.879200K120$3.00
222Jamba 1.6 Large56.5456.552$2.00
223Llama 3.2 3B Instruct56.15.277.434.763.4131K172$0.05
224Gemma 3 27B5690.41067.5131K33$0.08
225Mistral Small 3.1 24B Base565681128K137$0.10
226Llama 3.1 Nemotron Nano 4B v1.155.65.155.6$0.00
227Gemini 2.5 Flash55.11126.983.21M85$0.30
228Mistral Small 3 24B Base54.454.480.7
229DeepSeek R1 Distill Llama 8B54.34.254.3$0.00
230Gemini 2.5 Pro Preview 06-055421.6541M85$1.25
231Qwen2.5 7B Instruct53.935.95271.256.3131K138$0.04
232Mixtral 8x22B Instruct53.74.153.766K$2.00
233Mistral Small52.94.452.9134$0.20
234Ministral 3 3B52.45.352.4131K154$0.10
235Kimi K2 Base52.335.369.287.8
236Olmo 3 7B Instruct52.25.852.2$0.10
237Gemma 3 12B51.988.96.360.6131K33$0.04
238Phi 451.947.675.44.163370.484.816K33$0.07
239Mistral Large51.53.451.5128K$2.00
240OLMo 2 32B51.13.751.1$0.00
241Grok-1.5515181.3
242Gemma 3n E4B Instructed LiteRT Preview50.650.664.9
243Gemma 3n E4B Instructed50.650.664.932K42$20.00
244LFM2 8B A1B50.54.950.5$0.00
245Qwen2.5 Coder 32B Instruct50.43.850.475.1128K110$0.66
246Claude 2.149.54.249.5$0.00
247Mistral Medium49.13.449.145$2.80
248Gemma 3n E4B Instruct48.84.948.856$0.00
249Claude 248.648.678.5100K$0.00
250Phi-4-multimodal-instruct48.54.448.5128K25$0.05
251o1-preview47.352.342.490.8128K66$15.00
252Granite 3.3 8B46.84.246.8376$0.00
253Gemini 2.0 Flash Lite46.74.421.771.61M85$0.08
254Phi 4 Mini Instruct46.54.246.5131K$0.08
255Llama 3.2 11B Instruct46.45.246.473128K168$0.05
256GPT-3.5 Turbo46.246.27016K100$0.50
257Gemma 3 4B45.990.2443.6131K33$0.04
258IBM Granite 4.0 Tiny Preview44.926.76360.4
259Granite 4.0 Micro44.75.144.7131K$0.02
260Jamba 1.5 Mini44.346.15.142.569.7256K100$0.20
261Qwen2 7B Instruct44.144.170.5
262Phi-3 Mini Instruct 3.8B43.54.443.5$0.00
263Claude Instant43.43.843.4$0.00
264Gemini 2.5 Flash Lite43.35.110.775.91M6$0.10
265Command R+43.24.843.275.7128K100$0.15
266Gemini 1.0 Pro43.14.643.171.833K120$0.50
267DeepSeek Coder V2 Lite Instruct42.95.342.9$0.00
268Phi 4 Mini42.832.852.867.3
269LFM 40B42.54.942.5$0.00
270Phi-3.5-mini-instruct42.23747.469128K23$0.10
271Gemma 3 4B Instruct41.75.241.7$0.00
272Phi-3.5-MoE-instruct41.637.945.378.9
273Mistral Small 3.2 24B Instruct41.443.112.169.180.5
274Llama 2 Chat 13B40.64.740.6$0.00
275Llama 2 Chat 70B40.6540.6$0.00
276Llama 3 8B Instruct40.55.140.58K81$0.04
277Gemma 3n E2B Instructed LiteRT (Preview)40.540.560.1
278Gemma 3n E2B Instructed40.540.560.1
279Qwen2.5-Coder 7B Instruct40.14.840.167.6$0.00
280DBRX Instruct39.76.639.7$0.00
281Jamba 1.7 Mini38.84.538.8$0.00
282Mixtral 8x7B Instruct38.74.538.7$0.50
283Mistral Small 3.1 24B Instruct38.610.466.880.6
284Qwen2.5-Omni-7B38.329.647
285Gemma 3n E2B Instruct37.8437.8$0.00
286Molmo 7B-D37.15.137.1$0.00
287Jamba 1.6 Mini36.74.636.7183$0.20
288DeepHermes 3 - Llama-3.1 8B36.54.336.5$0.00
289Qwen3 0.6B34.75.734.7225$0.10
290Granite 4.0 1B32.55.132.5$0.00
291Gemma 3 1B32.480.22.214.7
292OpenChat 3.5314.831$0.00
293LFM2 2.6B29.85.229.8$0.00
294OLMo 2 7B28.25.528.2$0.00
295Granite 4.0 H 1B27.7527.7$0.00
296DeepSeek R1 Distill Qwen 1.5B26.93.326.9$0.00
297LFM2 1.2B25.75.725.7$0.00
298Mistral 7B Instruct24.54.324.590$0.20
299Llama 3.2 1B Instruct205.320131K91$0.03
300Llama 2 Chat 7B16.45.816.4113$0.10
301Gemma 3 1B Instruct13.55.213.5$0.00
302Granite 4.0 H 350M12.76.412.7$0.00
303Granite 4.0 350M12.45.712.4$0.00
304Gemma 3 270M5.54.25.5$0.00

304 models ranked on General. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.