AI Hub
Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence
Sonar Reasoning Pro
95.7 index
Top reasoning
Claude Opus 4.7
94.2
Top math
Grok-4 Heavy
100
Fastest
Llama 3.3 70B Instruct
2220 tok/s
Cheapest
Ling-2.6-flash
$0.01/M
Longest context
Llama 4 Scout
10M
Best open-weights
DeepSeek V3.2 Speciale
89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

1008468523620$0$4$8$12$16$20Input price ($/M tokens)Intelligence indexSonar Reasoning Pro: 95.7 @ $2Sonar Reasoning ProQwen3.7 Max: 92.3 @ $2.5Gemini 3.5 Flash: 92.2 @ $1.5GPT-5.3-Codex: 91.5 @ $1.75Grok 4.20 0309 v2: 91.1 @ $2Claude Opus 4.7: 90.9 @ $5Gemini 3 Flash: 90.2 @ $0.5Gemini 3 FlashGrok 4.3: 90.1 @ $1.25DeepSeek V3.2 Speciale: 89.9 @ $0.287DeepSeek V3.2 SpecialeGPT-5.2-Codex: 89.9 @ $1.75DeepSeek-V4-Flash: 89.4 @ $0.1DeepSeek-V4-FlashQwen3.5 397B A17B: 89.3 @ $0.39Qwen3.5 397B A17BGLM 4.7: 89 @ $0.4GPT-5.1: 89 @ $1.25Qwen3.6 Max: 88.8 @ $1.04Grok 4.20 0309: 88.5 @ $2GPT-5 Pro: 88.4 @ $15GPT-5.1-Codex: 88.2 @ $1.25Qwen3.6 Plus: 88.2 @ $0.325DeepSeek-V4-Pro: 88.2 @ $0.435MiMo-V2-Flash: 88 @ $0.1MiMo-V2-FlashClaude Opus 4.5: 88 @ $5Kimi K2.5: 87.9 @ $0.4GPT-5.4 mini: 87.5 @ $0.75MiniMax M2.7: 87.4 @ $0.279GPT-5 Codex: 87.1 @ $1.25DeepSeek-V3.2: 87.1 @ $0.252MiMo-V2-Pro: 87 @ $1GLM 5.1: 86.8 @ $0.98Hy3: 86.7 @ $0.066MiMo-V2.5-Pro: 86.6 @ $1GPT-5.2: 86.2 @ $1.75Grok-3 Mini: 85.9 @ $0.3Qwen3.5-27B: 85.8 @ $0.195Qwen3.5-122B-A10B: 85.7 @ $0.26Gemma 4 31B: 85.7 @ $0.12Ring-2.6-1T: 85.7 @ $0.075Kimi K2 Thinking: 85.6 @ $0.6MiMo-V2-Omni-0327: 85.5 @ $0.4KAT-Coder-Pro V2: 85.5 @ $0.3MiMo-V2.5: 84.9 @ $0.4MiniMax M2.5: 84.8 @ $0.15GPT-5.1-Codex-Mini: 84.7 @ $0.25GLM 5 Turbo: 84.7 @ $1.2o3 Pro: 84.5 @ $20Qwen3.5-35B-A3B: 84.5 @ $0.139Qwen3 235B A22B 2507: 84.2 @ $0.4Qwen3.6 27B: 84.2 @ $0.3Qwen3.6 35B A3B: 84.1 @ $0.15MiniMax M2.1: 83.6 @ $0.29DeepSeek V3.1 Terminus: 83.5 @ $0.27Gemini 3.1 Pro: 83.2 @ $2Step 3.5 Flash: 83.1 @ $0.09MiMo-V2-Omni: 82.8 @ $0.4Gemini 3 Pro: 82.8 @ $2Qwen3.5 Omni Plus: 82.6 @ $0.4Grok-3: 82.6 @ $3o1-pro: 82.5 @ $150Gemini 3.1 Flash Lite: 82.2 @ $0.25Nova 2 Lite: 82.1 @ $0.3GLM-5: 81.9 @ $0.6KAT-Coder-Pro V1: 81.8 @ $0.3GPT-5.4 nano: 81.7 @ $0.2INTELLECT-3: 81 @ $0.2Nova 2.0 Pro: 80.9 @ $1.3Grok 3 mini Reasoning: 80.9 @ $0.3GLM 5V Turbo: 80.9 @ $1.2Qwen3.5-9B: 80.6 @ $0.04GPT-5: 80.5 @ $1.25Claude Sonnet 4.5: 80.4 @ $3Qwen3-Next-80B-A3B: 80.3 @ $0.5NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ $0.1NVIDIA Nemotron 3 Super 120B A12B: 80 @ $0.3Qwen3-235B-A22B-Thinking-2507: 79.6 @ $0.3gpt-oss-120b: 79.6 @ $0.039Qwen3 Max: 79.5 @ $0.78Llama Nemotron Super 49B v1.5: 79.4 @ $0.1Claude Opus 4.6: 79.4 @ $5Gemma 4 26B A4B: 79.2 @ $0.06GPT-5 mini: 79.2 @ $0.25Qwen2.5 VL 72B Instruct: 79.1 @ $0.25Seed-OSS-36B-Instruct: 78.8 @ $0.2Grok 4 Fast: 78.7 @ $0.2Qwen3 VL 235B A22B: 78.4 @ $0.8Qwen3 VL 32B: 78.4 @ $0.7o4-mini: 78.4 @ $1.1Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ $0.6Nova 2.0 Omni: 78.2 @ $0.3Grok 4: 78.2 @ $3Magistral Medium 1.2: 78.1 @ $2Nemotron Nano 9B V2: 77.6 @ $0.04Qwen3 Next 80B A3B Thinking: 77.5 @ $0.098Mercury 2: 77 @ $0.25Mistral Small 4: 76.9 @ $0.15Gemini 2.5 Pro Preview 06-05: 76.6 @ $1.25Claude Sonnet 4.6: 76.3 @ $3Qwen3 VL 30B A3B: 76.2 @ $0.2Qwen3 Max Thinking: 76.1 @ $0.78GPT-5.5: 76.1 @ $5MiniMax-M2: 76 @ $0.255Cogito v2.1: 75.8 @ $1.3MiniMax M1 80k: 75.5 @ $0.6Claude Opus 4.1: 75.4 @ $15Claude Haiku 4.5: 75.3 @ $1Trinity Large Thinking: 75.2 @ $0.22Ling-2.6-1T: 75.2 @ $0.075DeepSeek-R1: 75 @ $0.55DeepSeek VL2: 74.9 @ $9.5Qwen3 235B A22B: 74.9 @ $0.455Kimi K2.6: 74.9 @ $0.73GPT-5.4: 74.9 @ $2.5Mistral Medium 3.5: 74.8 @ $1.5Qwen3 30B A3B 2507: 74.7 @ $0.3Claude 3.7 Sonnet: 74.7 @ $3Ring-flash-2.0: 74.6 @ $0.1Claude Sonnet 4: 74.5 @ $3Qwen3.5 Omni Flash: 74.2 @ $0.1Magistral Small 1.2: 73.9 @ $0.5Qwen3 32B: 73.8 @ $0.08Qwen3 Coder Next: 73.7 @ $0.11gpt-oss-20b: 73.6 @ $0.03Kimi K2: 73.6 @ $0.57Hermes 4 - Llama-3.1 405B: 73.5 @ $1Qwen3 Omni 30B A3B: 73.4 @ $0.3Gemini 2.5 Flash: 73.1 @ $0.3GLM-4.5: 73 @ $0.6Solar Pro 3: 72.4 @ $0.15GLM-4.6: 72.4 @ $0.43Gemini 2.5 Flash-Lite: 72.3 @ $0.1Qwen3-235B-A22B-Instruct-2507: 72.2 @ $0.15DeepSeek V3.2 Exp: 72.2 @ $0.27Qwen3 30B A3B: 71.7 @ $0.09Gemini 2.5 Pro: 71.6 @ $1.25o3: 71.6 @ $2Hermes 4 - Llama-3.1 70B: 71.3 @ $0.1GPT-5 nano: 71.2 @ $0.05Kimi K2 0905: 71 @ $0.6Ministral 8B Instruct: 70.9 @ $0.1Qwen3 VL 235B A22B Instruct: 70.9 @ $0.2o1-mini: 70.5 @ $3GLM 4.5 Air: 70.4 @ $0.13Claude 3.5 Sonnet: 70.3 @ $3DeepSeek R1 Distill Llama 70B: 70.1 @ $0.1GLM 4.5V: 70.1 @ $0.6GLM 4.6V: 69.6 @ $0.3Olmo 3 32B Think: 69.5 @ $0.15NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ $0.2Claude Opus 4: 69.4 @ $15Qwen3 30B A3B 2507 Instruct: 69.3 @ $0.2Qwen3 Next 80B A3B Instruct: 68.9 @ $0.09DeepSeek R1 Distill Qwen 32B: 68.4 @ $0.12ERNIE 4.5 300B A47B: 68.2 @ $0.28QwQ-32B: 67.8 @ $0.7Gemini 1.5 Pro: 67.3 @ $1.25Ling-flash-2.0: 66.9 @ $0.1Qwen3 14B: 66.8 @ $0.1Qwen3 Coder 480B A35B Instruct: 66.5 @ $0.3Kimi K2 Instruct: 66.5 @ $0.57Qwen3 VL 30B A3B Instruct: 66.5 @ $0.13Qwen3 VL 32B Instruct: 66.5 @ $0.104Pixtral-12B: 66.1 @ $0.15DeepSeek-V3 0324: 65.9 @ $0.28o1: 65.4 @ $15o3-mini: 64.1 @ $1.1Llama 4 Maverick: 63.9 @ $0.15GPT-4.1: 63.8 @ $2Qwen2.5 Max: 63.6 @ $1.6DeepSeek-V2.5: 63.4 @ $0.14DeepSeek-R1-0528: 63.3 @ $0.55QwQ-32B-Preview: 62.6 @ $0.15Grok-2: 62.4 @ $2Mistral Small 3 24B Instruct: 62.1 @ $0.1Gemini 1.5 Flash: 61.9 @ $0.15Nova Pro: 61.6 @ $0.8MiniMax-M1: 61.5 @ $0.4Llama 3.1 405B Instruct: 60.9 @ $0.89Gemini 2.0 Flash: 60.3 @ $0.1DeepSeek-V3.1: 59.8 @ $0.21GPT-4 Turbo: 59.8 @ $10GPT-4.5: 59.4 @ $75Ling-2.6-flash: 59.3 @ $0.01Qwen2.5 72B Instruct: 59.1 @ $0.36Llama 4 Scout: 58.9 @ $0.08Sonar Pro: 58.8 @ $3Mistral Medium 3: 58.6 @ $0.4Claude 3 Opus: 58.5 @ $15Gemma 3 27B: 58.4 @ $0.08Mistral Large 3: 58.3 @ $0.5GPT-4: 58.3 @ $30GPT-4.1 Mini: 58.2 @ $0.4GLM 4.7 Flash: 58.1 @ $0.06DeepSeek-V3: 58.1 @ $0.229Qwen3 8B: 57.8 @ $0.05Nova Lite: 57.7 @ $0.06Qwen3 Omni 30B A3B Instruct: 57.3 @ $0.3o1-preview: 57.3 @ $15Gemini 2.5 Flash Lite: 57 @ $0.1Sonar: 56.8 @ $1Qwen3 4B: 56.5 @ $0.1GPT-4o: 56.4 @ $2.5Reka Flash 3: 56.2 @ $0.1Llama 3.1 70B Instruct: 56 @ $0.4Command A: 55.9 @ $2.5Gemma 3 12B: 55.5 @ $0.04Qwen3 Coder 30B A3B Instruct: 55.4 @ $0.07Claude 3.5 Haiku: 54.5 @ $0.8Gemini 2.0 Flash Lite: 54.4 @ $0.075Devstral 2: 54.3 @ $0.4Llama 3.2 90B Instruct: 54 @ $0.35Nova Premier: 53.1 @ $2.5Pixtral Large: 53.1 @ $2Reka Flash: 52.9 @ $0.2Mistral Medium 3.1: 51.5 @ $0.4Mistral Small 3.2: 51 @ $0.1Mistral Small 3.1 24B Base: 50.9 @ $0.1Llama 3.3 70B Instruct: 50.6 @ $0.1Qwen2.5 Turbo: 50.3 @ $0.1Qwen2.5 Coder 32B Instruct: 50.1 @ $0.66Qwen3 VL 8B: 49.7 @ $0.2GPT-4o-mini: 49.1 @ $0.15Nova Micro: 49 @ $0.03Gemini 1.5 Flash 8B: 48.4 @ $0.07Ministral 3 14B: 47.9 @ $0.2Mistral Large 2: 47.9 @ $2Devstral Medium: 47.8 @ $0.4Phi 4: 47.6 @ $0.065LFM2-24B-A2B: 47.4 @ $0.03Qwen3 1.7B: 46.9 @ $0.1Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ $0.1Qwen2.5 7B Instruct: 46.6 @ $0.04Phi-4-multimodal-instruct: 46.2 @ $0.05Phi-3.5-mini-instruct: 46 @ $0.1Claude 3 Sonnet: 45.8 @ $3Gemma 3 4B: 45.8 @ $0.04Devstral Small: 45.3 @ $0.1Llama 3.1 8B Instruct: 45 @ $0.02Gemma 3 27B Instruct: 44.5 @ $0.1Llama 3.1 Nemotron 70B Instruct: 43.7 @ $1.2Mistral Small 3: 43.6 @ $0.05Ministral 3 8B: 43.3 @ $0.15Granite 4.1 8B: 43.3 @ $0.05Qwen3 VL 8B Instruct: 43 @ $0.08Jamba 1.5 Large: 42.8 @ $2Jamba 1.6 Large: 42.6 @ $2Hermes 3 - Llama-3.1 70B: 42.5 @ $0.3Mistral Small 3.1: 42.4 @ $0.1GPT-4.1 Nano: 42.1 @ $0.1Llama 3 70B Instruct: 40.8 @ $0.51Mistral Small: 40.3 @ $0.2Gemma 3 12B Instruct: 40 @ $0.1Olmo 3 7B Instruct: 40 @ $0.1Mistral Large: 39.3 @ $2Mixtral 8x22B Instruct: 39.1 @ $2Claude 3 Haiku: 38.9 @ $0.25Llama 3.2 11B Instruct: 36.7 @ $0.05Jamba Large 1.7: 36.5 @ $2Granite 4.0 H Small: 35.7 @ $0.1GPT-3.5 Turbo: 35.2 @ $0.5Gemini 1.0 Pro: 34 @ $0.5Ministral 3 3B: 33.7 @ $0.1Mistral Medium: 33.6 @ $2.8Solar Mini: 33.1 @ $0.2Phi 4 Mini Instruct: 32.6 @ $0.08Llama 3 8B Instruct: 32.4 @ $0.04Llama 3.2 3B Instruct: 30.8 @ $0.051Jamba 1.5 Mini: 29.6 @ $0.2Qwen3 0.6B: 29.3 @ $0.1Command R+: 28.9 @ $0.15Apertus 70B Instruct: 27.2 @ $0.8Mixtral 8x7B Instruct: 26.1 @ $0.5Apertus 8B Instruct: 25.6 @ $0.1Granite 4.0 Micro: 25.6 @ $0.017Jamba 1.6 Mini: 24.9 @ $0.2Gemma 3n E4B Instructed: 24.8 @ $20Mistral 7B Instruct: 14.7 @ $0.2Llama 3.2 1B Instruct: 12.1 @ $0.027Llama 2 Chat 7B: 11.3 @ $0.1

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

1008468523620080160240320400Output speed (tokens/s)Intelligence indexQwen3.7 Max: 92.3 @ 203Gemini 3.5 Flash: 92.2 @ 221GPT-5.3-Codex: 91.5 @ 73Grok 4.20 0309 v2: 91.1 @ 105Claude Opus 4.7: 90.9 @ 49Gemini 3 Flash: 90.2 @ 191Grok 4.3: 90.1 @ 88GPT-5.2-Codex: 89.9 @ 106DeepSeek-V4-Flash: 89.4 @ 109Qwen3.5 397B A17B: 89.3 @ 53GLM 4.7: 89 @ 98GPT-5.1: 89 @ 115Qwen3.6 Max: 88.8 @ 36Grok 4.20 0309: 88.5 @ 97GPT-5.1-Codex: 88.2 @ 188Qwen3.6 Plus: 88.2 @ 52DeepSeek-V4-Pro: 88.2 @ 30MiMo-V2-Flash: 88 @ 145Claude Opus 4.5: 88 @ 58Kimi K2.5: 87.9 @ 35GPT-5.4 mini: 87.5 @ 162MiniMax M2.7: 87.4 @ 50GPT-5 Codex: 87.1 @ 180MiMo-V2-Pro: 87 @ 60GLM 5.1: 86.8 @ 53Hy3: 86.7 @ 100MiMo-V2.5-Pro: 86.6 @ 58GPT-5.2: 86.2 @ 73Grok-3 Mini: 85.9 @ 100Qwen3.5-27B: 85.8 @ 91Qwen3.5-122B-A10B: 85.7 @ 129Gemma 4 31B: 85.7 @ 36Ring-2.6-1T: 85.7 @ 120Kimi K2 Thinking: 85.6 @ 100MiMo-V2-Omni-0327: 85.5 @ 110KAT-Coder-Pro V2: 85.5 @ 108MiMo-V2.5: 84.9 @ 92MiniMax M2.5: 84.8 @ 87GPT-5.1-Codex-Mini: 84.7 @ 175o3 Pro: 84.5 @ 25Qwen3.5-35B-A3B: 84.5 @ 121Qwen3 235B A22B 2507: 84.2 @ 59Qwen3.6 27B: 84.2 @ 64Qwen3.6 35B A3B: 84.1 @ 169MiniMax M2.1: 83.6 @ 92Gemini 3.1 Pro: 83.2 @ 142Step 3.5 Flash: 83.1 @ 194MiMo-V2-Omni: 82.8 @ 108Gemini 3 Pro: 82.8 @ 141Qwen3.5 Omni Plus: 82.6 @ 54Step 3.5 Flash 2603: 82.6 @ 197Grok-3: 82.6 @ 100Gemini 3.1 Flash Lite: 82.2 @ 342Nova 2 Lite: 82.1 @ 229GLM-5: 81.9 @ 67KAT-Coder-Pro V1: 81.8 @ 108GPT-5.4 nano: 81.7 @ 157Nova 2.0 Pro: 80.9 @ 149Grok 3 mini Reasoning: 80.9 @ 33Qwen3.5-9B: 80.6 @ 51GPT-5: 80.5 @ 100Claude Sonnet 4.5: 80.4 @ 42Qwen3-Next-80B-A3B: 80.3 @ 147NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ 148NVIDIA Nemotron 3 Super 120B A12B: 80 @ 211gpt-oss-120b: 79.6 @ 500Qwen3 Max: 79.5 @ 45Llama Nemotron Super 49B v1.5: 79.4 @ 51Claude Opus 4.6: 79.4 @ 48Gemma 4 26B A4B: 79.2 @ 66GPT-5 mini: 79.2 @ 200Seed-OSS-36B-Instruct: 78.8 @ 37Grok 4 Fast: 78.7 @ 90Qwen3 VL 235B A22B: 78.4 @ 34Qwen3 VL 32B: 78.4 @ 93o4-mini: 78.4 @ 115Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ 42Grok 4: 78.2 @ 100Magistral Medium 1.2: 78.1 @ 42Qwen3.5 4B: 77.1 @ 164Mercury 2: 77 @ 790Mercury 2Mistral Small 4: 76.9 @ 145Gemini 2.5 Pro Preview 06-05: 76.6 @ 85Claude Sonnet 4.6: 76.3 @ 75Qwen3 VL 30B A3B: 76.2 @ 122Qwen3 Max Thinking: 76.1 @ 45GPT-5.5: 76.1 @ 67MiniMax-M2: 76 @ 91Cogito v2.1: 75.8 @ 56Claude Opus 4.1: 75.4 @ 120Claude Haiku 4.5: 75.3 @ 100Trinity Large Thinking: 75.2 @ 129DeepSeek-R1: 75 @ 189DeepSeek VL2: 74.9 @ 22Qwen3 235B A22B: 74.9 @ 68Kimi K2.6: 74.9 @ 57GPT-5.4: 74.9 @ 84Mistral Medium 3.5: 74.8 @ 140Qwen3 30B A3B 2507: 74.7 @ 151Claude 3.7 Sonnet: 74.7 @ 101Claude Sonnet 4: 74.5 @ 101Qwen3.5 Omni Flash: 74.2 @ 235Magistral Small 1.2: 73.9 @ 106Sarvam 105B: 73.8 @ 128Qwen3 32B: 73.8 @ 328Qwen3 Coder Next: 73.7 @ 92gpt-oss-20b: 73.6 @ 1000gpt-oss-20bKimi K2: 73.6 @ 26Hermes 4 - Llama-3.1 405B: 73.5 @ 34Qwen3 Omni 30B A3B: 73.4 @ 102Gemini 2.5 Flash: 73.1 @ 85GLM-4.5: 73 @ 85GLM-4.6: 72.4 @ 85Qwen3-235B-A22B-Instruct-2507: 72.2 @ 63DeepSeek V3.2 Exp: 72.2 @ 100Qwen3 30B A3B: 71.7 @ 122Gemini 2.5 Pro: 71.6 @ 85o3: 71.6 @ 50Hermes 4 - Llama-3.1 70B: 71.3 @ 60GPT-5 nano: 71.2 @ 500Kimi K2 0905: 71 @ 16Ministral 8B Instruct: 70.9 @ 0Qwen3 VL 235B A22B Instruct: 70.9 @ 51o1-mini: 70.5 @ 115GLM 4.5 Air: 70.4 @ 63Claude 3.5 Sonnet: 70.3 @ 101DeepSeek R1 Distill Llama 70B: 70.1 @ 37GLM 4.5V: 70.1 @ 85GLM 4.6V: 69.6 @ 44NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ 244Claude Opus 4: 69.4 @ 120Qwen3 30B A3B 2507 Instruct: 69.3 @ 122Qwen3 Next 80B A3B Instruct: 68.9 @ 161DeepSeek R1 Distill Qwen 32B: 68.4 @ 37NVIDIA Nemotron Nano 9B V2: 68.3 @ 129ERNIE 4.5 300B A47B: 68.2 @ 24QwQ-32B: 67.8 @ 31Gemini 1.5 Pro: 67.3 @ 85Ling-flash-2.0: 66.9 @ 91Qwen3 14B: 66.8 @ 62Qwen3 Coder 480B A35B Instruct: 66.5 @ 69Kimi K2 Instruct: 66.5 @ 45Qwen3 VL 30B A3B Instruct: 66.5 @ 123Qwen3 VL 32B Instruct: 66.5 @ 76Pixtral-12B: 66.1 @ 0o1: 65.4 @ 66o3-mini: 64.1 @ 115Llama 4 Maverick: 63.9 @ 639GPT-4.1: 63.8 @ 100Qwen2.5 Max: 63.6 @ 50LongCat Flash Lite: 63.6 @ 110DeepSeek-V2.5: 63.4 @ 100Sarvam 30B: 63.3 @ 214DeepSeek-R1-0528: 63.3 @ 45QwQ-32B-Preview: 62.6 @ 99Grok-2: 62.4 @ 85Mistral Small 3 24B Instruct: 62.1 @ 134Gemini 1.5 Flash: 61.9 @ 150Nova Pro: 61.6 @ 100Llama 3.1 405B Instruct: 60.9 @ 100Gemini 2.0 Flash: 60.3 @ 183GPT-4 Turbo: 59.8 @ 100GPT-4.5: 59.4 @ 50Qwen2.5 72B Instruct: 59.1 @ 100Llama 4 Scout: 58.9 @ 776Llama 4 ScoutMistral Medium 3: 58.6 @ 32Claude 3 Opus: 58.5 @ 120Gemma 3 27B: 58.4 @ 33Mistral Large 3: 58.3 @ 54GPT-4: 58.3 @ 104GPT-4.1 Mini: 58.2 @ 150GLM 4.7 Flash: 58.1 @ 113DeepSeek-V3: 58.1 @ 100Qwen3 8B: 57.8 @ 69Nova Lite: 57.7 @ 100Qwen3 Omni 30B A3B Instruct: 57.3 @ 103o1-preview: 57.3 @ 66Gemini 2.5 Flash Lite: 57 @ 6Qwen3 4B: 56.5 @ 103Sarvam M: 56.4 @ 136GPT-4o: 56.4 @ 132Reka Flash 3: 56.2 @ 93Llama 3.1 70B Instruct: 56 @ 1204Llama 3.1 70B InstructCommand A: 55.9 @ 203Gemma 3 12B: 55.5 @ 33Qwen3 Coder 30B A3B Instruct: 55.4 @ 97Claude 3.5 Haiku: 54.5 @ 104Gemini 2.0 Flash Lite: 54.4 @ 85Devstral 2: 54.3 @ 51Llama 3.2 90B Instruct: 54 @ 100Nova Premier: 53.1 @ 40Pixtral Large: 53.1 @ 0Reka Flash: 52.9 @ 85Mistral Medium 3.1: 51.5 @ 47Mistral Small 3.2: 51 @ 100Mistral Small 3.1 24B Base: 50.9 @ 137Llama 3.3 70B Instruct: 50.6 @ 2220Llama 3.3 70B InstructQwen2.5 Turbo: 50.3 @ 67Qwen2.5 Coder 32B Instruct: 50.1 @ 110Qwen3 VL 8B: 49.7 @ 120GPT-4o-mini: 49.1 @ 92Nova Micro: 49 @ 100Gemini 1.5 Flash 8B: 48.4 @ 150Ministral 3 14B: 47.9 @ 67Mistral Large 2: 47.9 @ 42Devstral Medium: 47.8 @ 72Phi 4: 47.6 @ 33Devstral Small 2: 47.5 @ 62LFM2-24B-A2B: 47.4 @ 208Qwen3 1.7B: 46.9 @ 138Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ 301Qwen2.5 7B Instruct: 46.6 @ 138Phi-4-multimodal-instruct: 46.2 @ 25Phi-3.5-mini-instruct: 46 @ 23Claude 3 Sonnet: 45.8 @ 120Gemma 3 4B: 45.8 @ 33Qwen3.5 2B: 45.6 @ 328Devstral Small: 45.3 @ 190Llama 3.1 8B Instruct: 45 @ 2047Llama 3.1 8B InstructLlama 3.1 Nemotron 70B Instruct: 43.7 @ 292Mistral Small 3: 43.6 @ 136Ministral 3 8B: 43.3 @ 86Granite 4.1 8B: 43.3 @ 133Qwen3 VL 8B Instruct: 43 @ 145Jamba 1.5 Large: 42.8 @ 100Jamba 1.6 Large: 42.6 @ 52Hermes 3 - Llama-3.1 70B: 42.5 @ 32Mistral Small 3.1: 42.4 @ 134GPT-4.1 Nano: 42.1 @ 200Llama 3 70B Instruct: 40.8 @ 45Mistral Small: 40.3 @ 134Claude 3 Haiku: 38.9 @ 104Llama 3.2 11B Instruct: 36.7 @ 168Jamba Large 1.7: 36.5 @ 48Granite 4.0 H Small: 35.7 @ 524GPT-3.5 Turbo: 35.2 @ 100Gemma 3n E4B Instruct: 34.7 @ 56Gemini 1.0 Pro: 34 @ 120Ministral 3 3B: 33.7 @ 154Mistral Medium: 33.6 @ 45Solar Mini: 33.1 @ 63Granite 3.3 8B: 32.5 @ 376Llama 3 8B Instruct: 32.4 @ 81Llama 3.2 3B Instruct: 30.8 @ 172Tiny Aya Global: 30.5 @ 126Jamba 1.5 Mini: 29.6 @ 100Qwen3 0.6B: 29.3 @ 225Command R+: 28.9 @ 100Jamba 1.6 Mini: 24.9 @ 183Gemma 3n E4B Instructed: 24.8 @ 42Qwen3.5 0.8B: 23.6 @ 120Mistral 7B Instruct: 14.7 @ 90Llama 3.2 1B Instruct: 12.1 @ 91Llama 2 Chat 7B: 11.3 @ 113
#ModelMath idxMATH-500FrontierMathHMMT 2025GSM8KMGSMAIME 2024AIME 2025MATHContextSpeedIn $/M
1Grok-4 Heavy100100
2GPT-5.2100100400K73$1.75
3GPT-5 Codex98.798.7400K180$1.25
4Gemini 3 Flash97971M191$0.50
5DeepSeek V3.2 Speciale96.796.7164K$0.29
6MiMo-V2-Flash96.396.3262K145$0.10
7Claude Haiku 4.596.396.3200K100$1.00
8Sonar Reasoning Pro95.795.7128K$2.00
9GPT-5.1-Codex95.795.7400K188$1.25
10Gemini 3 Pro95.795.71M141$2.00
11R1 177695.495.4$0.00
12Grok 495.49991.7256K100$3.00
13GLM 4.79595203K98$0.40
14o4-mini9598.993.492.7200K115$1.10
15Kimi K2 Thinking94.794.7262K100$0.60
16Qwen3 235B A22B 250794.798.49159$0.40
17KAT-Coder-Pro V194.794.7108$0.30
18Phi 4 Mini Reasoning94.694.6
19Nova 2 Lite94.394.31M229$0.30
20GPT-5.19494400K115$1.25
21GLM-4.693.993.9203K85$0.43
22gpt-oss-120b93.493.4131K500$0.04
23Grok-3 Mini93.395.890.8128K100$0.30
24Grok 4 Fast92.793.3922M90$0.20
25Qwen3-235B-A22B-Thinking-250792.392.3256K$0.30
26Gemini 2.0 Pro92.392.3$0.00
27Gemini 2.5 Pro92.296.79288921M85$1.25
28Sonar Reasoning92.192.1$0.00
29DeepSeek-V3.29292131K$0.25
30Grok 3 mini Reasoning9299.284.733$0.30
31GPT-5.1-Codex-Mini91.791.7400K175$0.25
32Claude Opus 4.591.391.3200K58$5.00
33DeepSeek R1 Zero91.395.986.7
34Grok-391.28793.393.3128K100$3.00
35NVIDIA Nemotron 3 Nano 30B A3B9191148$0.10
36K-EXAONE90.390.3$0.00
37o1-mini9090128K115$3.00
38DeepSeek V3.1 Terminus89.789.7164K$0.27
39Nova 2.0 Omni89.789.7$0.30
40GLM 4.5 Air89.498.189.480.7131K63$0.13
41Grok 4.1 Fast89.389.3$0.00
42Ring-1T89.389.3$0.00
43gpt-oss-20b89.389.3131K1000$0.03
44DeepSeek-R1-052889.298.379.491.487.5131K45$0.55
45Nova 2.0 Pro8989149$1.30
46EXAONE 4.0 32B88.997.780$0.00
47Qwen3 VL 235B A22B88.388.334$0.80
48DeepSeek R1 Distill Qwen 7B88.192.883.3
49INTELLECT-38888131K$0.20
50Apriel-v1.6-15B-Thinker8888$0.00
51Gemini 2.5 Pro Preview 06-0588881M85$1.25
52Qwen3 Next 80B A3B Thinking87.887.8262K$0.10
53GLM-4.587.698.29173.7131K85$0.60
54Gemini 1.5 Pro87.687.690.887.586.52M85$1.25
55Llama Nemotron Super 49B v1.587.598.376.751$0.10
56Apriel-v1.5-15B-Thinker87.587.5$0.00
57Gemini 2.0 Flash Lite87.387.386.81M85$0.08
58Claude Sonnet 4.587871M42$3.00
59Kimi-k1.586.996.277.5
60Claude Opus 486.998.275.5200K120$15.00
61Qwen3 235B A22B86.79394.483.585.781.571.8131K68$0.46
62DeepSeek V3.2 Exp86.483.689.3164K100$0.27
63o1-pro8686200K$150.00
64Gemini 2.5 Flash8698.188721M85$0.30
65GLM 4.6V85.385.3131K44$0.30
66ERNIE 5.0 Thinking8585$0.00
67Nemotron Nano 9B V284.997.872.1131K$0.04
68Llama 3.1 Nemotron Ultra 253B v184.89772.542$0.60
69Claude Sonnet 484.899.170.51M101$3.00
70Seed-OSS-36B-Instruct84.784.737$0.20
71Qwen3 VL 32B84.784.793$0.70
72Sarvam M84.784.7136$0.00
73Qwen3-Next-80B-A3B84.384.3262K147$0.50
74Qwen3-235B-A22B-Instruct-250784.29870.3131K63$0.15
75Gemini 2.0 Flash Thinking83.994.473.3$0.00
76Ring-flash-2.083.783.7$0.10
77Qwen3 32B83.596.181.472.9131K328$0.08
78Qwen2.5 Max83.583.550$1.60
79MiniMax M2.182.782.7205K92$0.29
80Qwen3 4B 250782.782.7$0.00
81Gemini 1.5 Flash82.782.786.282.677.91M150$0.15
82Qwen3 30B A3B82.495.980.470.9131K122$0.09
83Qwen3 VL 30B A3B82.382.3122$0.20
84Qwen3 Max Thinking82.382.3262K45$0.78
85DeepSeek-R182.396.668128K189$0.55
86Magistral Medium 1.2828242$2.00
87Qwen3 30B A3B 2507 Instruct81.997.566.3122$0.20
88Sonar81.781.7127K$1.00
89Qwen381.581.5128K
90Qwen3 Max80.780.7262K45$0.78
91Qwen2.5 32B Instruct80.580.595.983.1$0.00
92Qwen2.5 Turbo80.580.567$0.10
93Magistral Small 1.280.380.3106$0.50
94Motif-2-12.7B-Reasoning80.380.3$0.00
95DeepSeek R1 Distill Qwen 32B80.294.383.363128K37$0.12
96Falcon-H1R-7B8080$0.00
97Phi 4 Reasoning Plus79.781.378
98MiniMax M1 80k79.59861$0.60
99Doubao Seed Code79.379.3$0.00
100Claude 3.7 Sonnet79.196.2806182200K101$3.00
101Solar Pro 27996.761.3$0.00
102Mi:dm K 2.5 Pro78.778.7$0.00
103DeepSeek R1 0528 Qwen3 8B78.593.263.7$0.00
104GPT-578.499.426.393.394.684.7400K100$1.25
105Gemini 2.5 Flash78.378.3$0.00
106MiniMax-M278.378.3205K91$0.26
107K2-V278.378.3$0.00
108DeepSeek R1 Distill Llama 70B78.394.586.753.7128K37$0.10
109Claude Opus 4.17878200K120$15.00
110Grok-277.877.876.1128K85$2.00
111Llama 3.1 Tulu3 405B77.877.8$0.00
112Llama-3.3 Nemotron Super 49B v177.596.658.4$0.00
113Olmo 3.1 32B Think77.377.3$0.00
114Claude 3.5 Sonnet77.177.196.491.678.3200K101$3.00
115Qwen3 14B77.196.158132K62$0.10
116Qwen3 30B A3B 250776.997.656.3151$0.30
117Qwen2.5 Coder 32B Instruct76.776.791.157.2128K110$0.66
118DeepSeek R1 Distill Qwen 14B76.593.98055.7$0.00
119DeepSeek-V2.576.376.395.174.78K100$0.14
120Granite 3.3 8B Instruct75.16980.981.2
121Granite 3.3 8B Base75.1695981.2
122NVIDIA Nemotron Nano 12B v2 VL7575244$0.20
123Kimi K274.697.169.657131K26$0.57
124Sonar Pro74.574.5200K$3.00
125DeepSeek-Coder-V274.374.3$0.00
126Qwen3 Omni 30B A3B7474102$0.30
127Olmo 3 32B Think73.773.766K$0.15
128GPT-4 Turbo73.773.788.572.6128K100$10.00
129Grok73.773.7$0.00
130Gemini 2.5 Flash Lite73.496.949.81M6$0.10
131o373.399.215.891.686.4200K50$2.00
132GLM 4.5V737366K85$0.60
133Cogito v2.172.772.756$1.30
134Llama 3.1 Nemotron Nano 4B v1.172.494.750$0.00
135Qwen3 VL 30B A3B Instruct72.372.3262K123$0.13
136Claude 3.5 Haiku72.172.185.669.4200K104$0.80
137Ling-1T71.371.3$0.00
138Llama 3.1 Nemotron Nano 8B V171.395.447.1
139Qwen3 VL 235B A22B Instruct70.770.7262K51$0.20
140Olmo 3 7B Think70.770.7$0.00
141QwQ-32B-Preview70.390.65033K99$0.15
142Qwen2 72B Instruct70.170.191.159.7$0.00
143DeepSeek R1 Distill Llama 8B70.189.18041.3$0.00
144Hermes 4 - Llama-3.1 405B69.769.734$1.00
145NVIDIA Nemotron Nano 9B V269.769.7129$0.00
146Qwen3 Next 80B A3B Instruct69.569.5262K161$0.09
147Magistral Medium69.373.664.9
148Phi-4-multimodal-instruct69.369.3128K25$0.05
149Phi 4 Reasoning69.175.362.9
150Gemini 1.5 Flash 8B68.968.958.71M150$0.07
151Magistral Small 168.896.341.3$0.00
152Gemini 2.5 Flash-Lite68.768.7$0.10
153Hermes 4 - Llama-3.1 70B68.768.760$0.10
154Qwen3 VL 32B Instruct68.368.3262K76$0.10
155Mistral Saba67.767.7$0.00
156ERNIE 4.5 300B A47B67.293.141.3131K24$0.28
157o1-preview67.292.490.84285.5128K66$15.00
158GPT-5 mini6722.187.891.1400K200$0.25
159Qwen3 Coder 480B A35B Instruct66.894.239.369$0.30
160Magistral Small 250666.870.762.8
161QwQ-32B66.490.679.52931$0.70
162Magistral Medium 16691.740.3$0.00
163Qwen2.5-Coder 7B Instruct666683.946.6$0.00
164Ling-flash-2.065.365.391$0.10
165o3-mini6598.59.29287.397.9200K115$1.10
166DeepSeek-V3 032464.89459.441164K$0.28
167Kimi K2 090564.77257.389.1262K16$0.60
168Claude 3 Opus64.164.19590.760.1200K120$15.00
169Qwen3 1.7B64.189.438.7138$0.10
170Kimi K2 Instruct63.897.438.897.369.649.5131K45$0.57
171Kimi K2-Instruct-090563.897.438.869.649.5
172Llama 3.2 90B Instruct62.962.986.968128K100$0.35
173Reka Flash 361.589.333.766K93$0.10
174Jamba 1.5 Large60.660.687256K100$2.00
175Mistral Medium 360.590.730.3131K32$0.40
176DeepHermes 3 - Mistral 24B59.559.5$0.00
177Qwen3 Coder 30B A3B Instruct59.289.329160K97$0.07
178HyperCLOVA X SEED Think5959$0.00
179o158.9975.597.189.374.396.4200K66$15.00
180Jamba 1.6 Large585852$2.00
181Qwen3 4B57.893.322.3103$0.10
182Mistral Small 3.257.788.327100$0.10
183Gemini 2.0 Flash57.49321.789.71M183$0.10
184Qwen3 8B57.490.424.3131K69$0.05
185GPT-5 nano56.89.675.685.2400K500$0.05
186Mistral Small56.356.3134$0.20
187MiniMax M1 40k55.597.213.7$0.00
188Gemma 3 27B Instruct54.588.320.7$0.10
189Mixtral 8x22B Instruct54.554.566K$2.00
190GPT-4.1 Mini54.392.53549.640.21M150$0.40
191Llama 4 Maverick54.188.992.319.361.21M639$0.15
192Hermes 3 - Llama-3.1 70B53.853.832$0.30
193GPT-4.153.791.328.948.146.4871M100$2.00
194Reka Flash52.952.985$0.20
195DeepSeek R1 Distill Qwen 1.5B52.983.952.722$0.00
196Mistral Large52.752.7128K$2.00
197Qwen3 Omni 30B A3B Instruct52.352.3103$0.30
198Qwen3 4B 2507 Instruct52.352.3$0.00
199DeepSeek-V351.890.239.226131K100$0.23
200Gemma 3 12B Instruct51.885.318.3$0.10
201Nova Premier50.683.917.340$2.50
202Exaone 4.0 1.2B50.350.3$0.00
203DeepSeek-V3.149.933.566.349.8164K$0.21
204Qwen2.5 72B Instruct49.985.895.81483.1131K100$0.36
205Llama 3 8B Instruct49.949.98K81$0.04
206Phi 449.58180.61880.416K33$0.07
207Ling-mini-2.049.349.3$0.00
208Llama 4 Scout49.284.490.61450.310M776$0.08
209Devstral Small48.968.429.3190$0.10
210Llama 3 70B Instruct48.348.38K45$0.51
211LFM 40B4848$0.00
212Command A47.581.913256K203$2.50
213GPT-4o-mini46.878.98714.770.2128K92$0.15
214Qwen3 0.6B46.57518225$0.10
215GPT-4.1 Nano46.184.829.4241M200$0.10
216Gemma 3n E4B Instruct45.777.114.356$0.00
217Gemma 3 4B Instruct44.776.612.7$0.00
218GPT-3.5 Turbo44.144.156.343.116K100$0.50
219Mistral Large 243.873.69314128K42$2.00
220Grok Code Fast 143.343.3$0.00
221Nova Pro42.878.694.8776.6300K100$0.80
222GPT-4o42.789.313.125.7128K132$2.50
223Llama 3.3 70B Instruct42.577.391.17.777131K2220$0.10
224Llama 3.1 Nemotron 70B Instruct42.273.391.411292$1.20
225Nova Lite41.876.594.5773.3300K100$0.06
226Claude 3 Sonnet41.441.492.383.543.1200K120$3.00
227Olmo 3 7B Instruct41.341.3$0.10
228Mistral Medium40.540.545$2.80
229Gemini 1.0 Pro40.340.332.633K120$0.50
230Gemma 3n E2B Instruct39.769.110.3$0.00
231Claude 3 Haiku39.439.488.975.138.9200K104$0.25
232Mistral Medium 3.138.338.3131K47$0.40
233Nova Micro38.270.392.3669.3128K100$0.03
234Phi 4 Mini Instruct38.269.66.7131K$0.08
235Mistral Large 33838262K54$0.50
236Mistral Small 337.971.54.333K136$0.05
237Devstral Medium37.770.74.7131K72$0.40
238Claude 2.137.437.4$0.00
239Mistral Small 3.137.270.73.7134$0.10
240Qwen3 VL 4B Instruct3737$0.00
241Pixtral Large36.971.42.3131K0$2.00
242Llama 3.1 405B Instruct36.770.396.8373.8128K100$0.89
243GPT-4.536.79736.785128K50$75.00
244Devstral 236.736.7262K51$0.40
245Granite 3.3 8B36.666.56.7376$0.00
246Kimi Linear 48B A3B Instruct36.336.3$0.00
247Jamba 1.5 Mini35.735.775.8256K100$0.20
248Llama 3.1 70B Instruct34.564.94131K1204$0.40
249Devstral Small 234.334.362$0.00
250Solar Mini33.133.163$0.20
251Llama 2 Chat 13B32.932.9$0.00
252Llama 2 Chat 70B32.332.3$0.00
253Ministral 3 8B31.731.7262K86$0.15
254Jamba Large 1.731.2602.3256K48$2.00
255Qwen3 VL 8B30.730.7120$0.20
256OpenChat 3.530.730.7$0.00
257Ministral 3 14B3030262K67$0.20
258Mixtral 8x7B Instruct29.929.9$0.50
259Llama 3.1 8B Instruct28.151.94.3131K2047$0.02
260Command R+27.927.970.7128K100$0.15
261DBRX Instruct27.927.9$0.00
262Qwen3 VL 8B Instruct27.327.3256K145$0.08
263Llama 3.2 11B Instruct26.751.668.91.751.9128K168$0.05
264Claude Instant26.426.4$0.00
265Llama 3.2 3B Instruct26.148.977.758.23.348131K172$0.05
266Gemma 3 1B Instruct25.948.43.3$0.00
267Qwen3 VL 4B25.725.7$0.00
268Jamba 1.6 Mini25.725.7183$0.20
269LFM2 8B A1B25.325.3$0.00
270Gemini Diffusion23.323.3
271Phi-3 Mini Instruct 3.8B2345.70.3$0.00
272Ministral 3 3B2222131K154$0.10
273DeepHermes 3 - Llama-3.1 8B21.821.8$0.00
274Granite 4.0 H Small13.713.7524$0.10
275Jamba 1.7 Mini13.125.80.3$0.00
276Mistral 7B Instruct12.112.190$0.20
277Gemma 3n E4B Instructed LiteRT Preview11.660.711.6
278Gemma 3n E4B Instructed11.66711.632K42$20.00
279Jamba Reasoning 3B10.710.7$0.00
280LFM2 2.6B8.38.3$0.00
281Llama 3.2 1B Instruct7140131K91$0.03
282Gemma 3n E2B Instructed LiteRT (Preview)6.753.16.7
283Gemma 3n E2B Instructed6.753.16.7
284Granite 4.0 H 1B6.36.3$0.00
285Granite 4.0 1B6.36.3$0.00
286Granite 4.0 Micro66131K$0.02
287Llama 2 Chat 7B5.95.9113$0.10
288OLMo 2 32B3.33.3$0.00
289LFM2 1.2B3.33.3$0.00
290Gemma 3 270M2.32.3$0.00
291Granite 4.0 H 350M1.31.3$0.00
292OLMo 2 7B0.70.7$0.00
293Molmo 7B-D00$0.00
294Granite 4.0 350M00$0.00

294 models ranked on Math. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.