AI Hub
Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence
Sonar Reasoning Pro
95.7 index
Top reasoning
Claude Opus 4.7
94.2
Top math
Grok-4 Heavy
100
Fastest
Llama 3.3 70B Instruct
2220 tok/s
Cheapest
Ling-2.6-flash
$0.01/M
Longest context
Llama 4 Scout
10M
Best open-weights
DeepSeek V3.2 Speciale
89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

1008468523620$0$4$8$12$16$20Input price ($/M tokens)Intelligence indexSonar Reasoning Pro: 95.7 @ $2Sonar Reasoning ProQwen3.7 Max: 92.3 @ $2.5Gemini 3.5 Flash: 92.2 @ $1.5GPT-5.3-Codex: 91.5 @ $1.75Grok 4.20 0309 v2: 91.1 @ $2Claude Opus 4.7: 90.9 @ $5Gemini 3 Flash: 90.2 @ $0.5Gemini 3 FlashGrok 4.3: 90.1 @ $1.25DeepSeek V3.2 Speciale: 89.9 @ $0.287DeepSeek V3.2 SpecialeGPT-5.2-Codex: 89.9 @ $1.75DeepSeek-V4-Flash: 89.4 @ $0.1DeepSeek-V4-FlashQwen3.5 397B A17B: 89.3 @ $0.39Qwen3.5 397B A17BGLM 4.7: 89 @ $0.4GPT-5.1: 89 @ $1.25Qwen3.6 Max: 88.8 @ $1.04Grok 4.20 0309: 88.5 @ $2GPT-5 Pro: 88.4 @ $15GPT-5.1-Codex: 88.2 @ $1.25Qwen3.6 Plus: 88.2 @ $0.325DeepSeek-V4-Pro: 88.2 @ $0.435MiMo-V2-Flash: 88 @ $0.1MiMo-V2-FlashClaude Opus 4.5: 88 @ $5Kimi K2.5: 87.9 @ $0.4GPT-5.4 mini: 87.5 @ $0.75MiniMax M2.7: 87.4 @ $0.279GPT-5 Codex: 87.1 @ $1.25DeepSeek-V3.2: 87.1 @ $0.252MiMo-V2-Pro: 87 @ $1GLM 5.1: 86.8 @ $0.98Hy3: 86.7 @ $0.066MiMo-V2.5-Pro: 86.6 @ $1GPT-5.2: 86.2 @ $1.75Grok-3 Mini: 85.9 @ $0.3Qwen3.5-27B: 85.8 @ $0.195Qwen3.5-122B-A10B: 85.7 @ $0.26Gemma 4 31B: 85.7 @ $0.12Ring-2.6-1T: 85.7 @ $0.075Kimi K2 Thinking: 85.6 @ $0.6MiMo-V2-Omni-0327: 85.5 @ $0.4KAT-Coder-Pro V2: 85.5 @ $0.3MiMo-V2.5: 84.9 @ $0.4MiniMax M2.5: 84.8 @ $0.15GPT-5.1-Codex-Mini: 84.7 @ $0.25GLM 5 Turbo: 84.7 @ $1.2o3 Pro: 84.5 @ $20Qwen3.5-35B-A3B: 84.5 @ $0.139Qwen3 235B A22B 2507: 84.2 @ $0.4Qwen3.6 27B: 84.2 @ $0.3Qwen3.6 35B A3B: 84.1 @ $0.15MiniMax M2.1: 83.6 @ $0.29DeepSeek V3.1 Terminus: 83.5 @ $0.27Gemini 3.1 Pro: 83.2 @ $2Step 3.5 Flash: 83.1 @ $0.09MiMo-V2-Omni: 82.8 @ $0.4Gemini 3 Pro: 82.8 @ $2Qwen3.5 Omni Plus: 82.6 @ $0.4Grok-3: 82.6 @ $3o1-pro: 82.5 @ $150Gemini 3.1 Flash Lite: 82.2 @ $0.25Nova 2 Lite: 82.1 @ $0.3GLM-5: 81.9 @ $0.6KAT-Coder-Pro V1: 81.8 @ $0.3GPT-5.4 nano: 81.7 @ $0.2INTELLECT-3: 81 @ $0.2Nova 2.0 Pro: 80.9 @ $1.3Grok 3 mini Reasoning: 80.9 @ $0.3GLM 5V Turbo: 80.9 @ $1.2Qwen3.5-9B: 80.6 @ $0.04GPT-5: 80.5 @ $1.25Claude Sonnet 4.5: 80.4 @ $3Qwen3-Next-80B-A3B: 80.3 @ $0.5NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ $0.1NVIDIA Nemotron 3 Super 120B A12B: 80 @ $0.3Qwen3-235B-A22B-Thinking-2507: 79.6 @ $0.3gpt-oss-120b: 79.6 @ $0.039Qwen3 Max: 79.5 @ $0.78Llama Nemotron Super 49B v1.5: 79.4 @ $0.1Claude Opus 4.6: 79.4 @ $5Gemma 4 26B A4B: 79.2 @ $0.06GPT-5 mini: 79.2 @ $0.25Qwen2.5 VL 72B Instruct: 79.1 @ $0.25Seed-OSS-36B-Instruct: 78.8 @ $0.2Grok 4 Fast: 78.7 @ $0.2Qwen3 VL 235B A22B: 78.4 @ $0.8Qwen3 VL 32B: 78.4 @ $0.7o4-mini: 78.4 @ $1.1Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ $0.6Nova 2.0 Omni: 78.2 @ $0.3Grok 4: 78.2 @ $3Magistral Medium 1.2: 78.1 @ $2Nemotron Nano 9B V2: 77.6 @ $0.04Qwen3 Next 80B A3B Thinking: 77.5 @ $0.098Mercury 2: 77 @ $0.25Mistral Small 4: 76.9 @ $0.15Gemini 2.5 Pro Preview 06-05: 76.6 @ $1.25Claude Sonnet 4.6: 76.3 @ $3Qwen3 VL 30B A3B: 76.2 @ $0.2Qwen3 Max Thinking: 76.1 @ $0.78GPT-5.5: 76.1 @ $5MiniMax-M2: 76 @ $0.255Cogito v2.1: 75.8 @ $1.3MiniMax M1 80k: 75.5 @ $0.6Claude Opus 4.1: 75.4 @ $15Claude Haiku 4.5: 75.3 @ $1Trinity Large Thinking: 75.2 @ $0.22Ling-2.6-1T: 75.2 @ $0.075DeepSeek-R1: 75 @ $0.55DeepSeek VL2: 74.9 @ $9.5Qwen3 235B A22B: 74.9 @ $0.455Kimi K2.6: 74.9 @ $0.73GPT-5.4: 74.9 @ $2.5Mistral Medium 3.5: 74.8 @ $1.5Qwen3 30B A3B 2507: 74.7 @ $0.3Claude 3.7 Sonnet: 74.7 @ $3Ring-flash-2.0: 74.6 @ $0.1Claude Sonnet 4: 74.5 @ $3Qwen3.5 Omni Flash: 74.2 @ $0.1Magistral Small 1.2: 73.9 @ $0.5Qwen3 32B: 73.8 @ $0.08Qwen3 Coder Next: 73.7 @ $0.11gpt-oss-20b: 73.6 @ $0.03Kimi K2: 73.6 @ $0.57Hermes 4 - Llama-3.1 405B: 73.5 @ $1Qwen3 Omni 30B A3B: 73.4 @ $0.3Gemini 2.5 Flash: 73.1 @ $0.3GLM-4.5: 73 @ $0.6Solar Pro 3: 72.4 @ $0.15GLM-4.6: 72.4 @ $0.43Gemini 2.5 Flash-Lite: 72.3 @ $0.1Qwen3-235B-A22B-Instruct-2507: 72.2 @ $0.15DeepSeek V3.2 Exp: 72.2 @ $0.27Qwen3 30B A3B: 71.7 @ $0.09Gemini 2.5 Pro: 71.6 @ $1.25o3: 71.6 @ $2Hermes 4 - Llama-3.1 70B: 71.3 @ $0.1GPT-5 nano: 71.2 @ $0.05Kimi K2 0905: 71 @ $0.6Ministral 8B Instruct: 70.9 @ $0.1Qwen3 VL 235B A22B Instruct: 70.9 @ $0.2o1-mini: 70.5 @ $3GLM 4.5 Air: 70.4 @ $0.13Claude 3.5 Sonnet: 70.3 @ $3DeepSeek R1 Distill Llama 70B: 70.1 @ $0.1GLM 4.5V: 70.1 @ $0.6GLM 4.6V: 69.6 @ $0.3Olmo 3 32B Think: 69.5 @ $0.15NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ $0.2Claude Opus 4: 69.4 @ $15Qwen3 30B A3B 2507 Instruct: 69.3 @ $0.2Qwen3 Next 80B A3B Instruct: 68.9 @ $0.09DeepSeek R1 Distill Qwen 32B: 68.4 @ $0.12ERNIE 4.5 300B A47B: 68.2 @ $0.28QwQ-32B: 67.8 @ $0.7Gemini 1.5 Pro: 67.3 @ $1.25Ling-flash-2.0: 66.9 @ $0.1Qwen3 14B: 66.8 @ $0.1Qwen3 Coder 480B A35B Instruct: 66.5 @ $0.3Kimi K2 Instruct: 66.5 @ $0.57Qwen3 VL 30B A3B Instruct: 66.5 @ $0.13Qwen3 VL 32B Instruct: 66.5 @ $0.104Pixtral-12B: 66.1 @ $0.15DeepSeek-V3 0324: 65.9 @ $0.28o1: 65.4 @ $15o3-mini: 64.1 @ $1.1Llama 4 Maverick: 63.9 @ $0.15GPT-4.1: 63.8 @ $2Qwen2.5 Max: 63.6 @ $1.6DeepSeek-V2.5: 63.4 @ $0.14DeepSeek-R1-0528: 63.3 @ $0.55QwQ-32B-Preview: 62.6 @ $0.15Grok-2: 62.4 @ $2Mistral Small 3 24B Instruct: 62.1 @ $0.1Gemini 1.5 Flash: 61.9 @ $0.15Nova Pro: 61.6 @ $0.8MiniMax-M1: 61.5 @ $0.4Llama 3.1 405B Instruct: 60.9 @ $0.89Gemini 2.0 Flash: 60.3 @ $0.1DeepSeek-V3.1: 59.8 @ $0.21GPT-4 Turbo: 59.8 @ $10GPT-4.5: 59.4 @ $75Ling-2.6-flash: 59.3 @ $0.01Qwen2.5 72B Instruct: 59.1 @ $0.36Llama 4 Scout: 58.9 @ $0.08Sonar Pro: 58.8 @ $3Mistral Medium 3: 58.6 @ $0.4Claude 3 Opus: 58.5 @ $15Gemma 3 27B: 58.4 @ $0.08Mistral Large 3: 58.3 @ $0.5GPT-4: 58.3 @ $30GPT-4.1 Mini: 58.2 @ $0.4GLM 4.7 Flash: 58.1 @ $0.06DeepSeek-V3: 58.1 @ $0.229Qwen3 8B: 57.8 @ $0.05Nova Lite: 57.7 @ $0.06Qwen3 Omni 30B A3B Instruct: 57.3 @ $0.3o1-preview: 57.3 @ $15Gemini 2.5 Flash Lite: 57 @ $0.1Sonar: 56.8 @ $1Qwen3 4B: 56.5 @ $0.1GPT-4o: 56.4 @ $2.5Reka Flash 3: 56.2 @ $0.1Llama 3.1 70B Instruct: 56 @ $0.4Command A: 55.9 @ $2.5Gemma 3 12B: 55.5 @ $0.04Qwen3 Coder 30B A3B Instruct: 55.4 @ $0.07Claude 3.5 Haiku: 54.5 @ $0.8Gemini 2.0 Flash Lite: 54.4 @ $0.075Devstral 2: 54.3 @ $0.4Llama 3.2 90B Instruct: 54 @ $0.35Nova Premier: 53.1 @ $2.5Pixtral Large: 53.1 @ $2Reka Flash: 52.9 @ $0.2Mistral Medium 3.1: 51.5 @ $0.4Mistral Small 3.2: 51 @ $0.1Mistral Small 3.1 24B Base: 50.9 @ $0.1Llama 3.3 70B Instruct: 50.6 @ $0.1Qwen2.5 Turbo: 50.3 @ $0.1Qwen2.5 Coder 32B Instruct: 50.1 @ $0.66Qwen3 VL 8B: 49.7 @ $0.2GPT-4o-mini: 49.1 @ $0.15Nova Micro: 49 @ $0.03Gemini 1.5 Flash 8B: 48.4 @ $0.07Ministral 3 14B: 47.9 @ $0.2Mistral Large 2: 47.9 @ $2Devstral Medium: 47.8 @ $0.4Phi 4: 47.6 @ $0.065LFM2-24B-A2B: 47.4 @ $0.03Qwen3 1.7B: 46.9 @ $0.1Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ $0.1Qwen2.5 7B Instruct: 46.6 @ $0.04Phi-4-multimodal-instruct: 46.2 @ $0.05Phi-3.5-mini-instruct: 46 @ $0.1Claude 3 Sonnet: 45.8 @ $3Gemma 3 4B: 45.8 @ $0.04Devstral Small: 45.3 @ $0.1Llama 3.1 8B Instruct: 45 @ $0.02Gemma 3 27B Instruct: 44.5 @ $0.1Llama 3.1 Nemotron 70B Instruct: 43.7 @ $1.2Mistral Small 3: 43.6 @ $0.05Ministral 3 8B: 43.3 @ $0.15Granite 4.1 8B: 43.3 @ $0.05Qwen3 VL 8B Instruct: 43 @ $0.08Jamba 1.5 Large: 42.8 @ $2Jamba 1.6 Large: 42.6 @ $2Hermes 3 - Llama-3.1 70B: 42.5 @ $0.3Mistral Small 3.1: 42.4 @ $0.1GPT-4.1 Nano: 42.1 @ $0.1Llama 3 70B Instruct: 40.8 @ $0.51Mistral Small: 40.3 @ $0.2Gemma 3 12B Instruct: 40 @ $0.1Olmo 3 7B Instruct: 40 @ $0.1Mistral Large: 39.3 @ $2Mixtral 8x22B Instruct: 39.1 @ $2Claude 3 Haiku: 38.9 @ $0.25Llama 3.2 11B Instruct: 36.7 @ $0.05Jamba Large 1.7: 36.5 @ $2Granite 4.0 H Small: 35.7 @ $0.1GPT-3.5 Turbo: 35.2 @ $0.5Gemini 1.0 Pro: 34 @ $0.5Ministral 3 3B: 33.7 @ $0.1Mistral Medium: 33.6 @ $2.8Solar Mini: 33.1 @ $0.2Phi 4 Mini Instruct: 32.6 @ $0.08Llama 3 8B Instruct: 32.4 @ $0.04Llama 3.2 3B Instruct: 30.8 @ $0.051Jamba 1.5 Mini: 29.6 @ $0.2Qwen3 0.6B: 29.3 @ $0.1Command R+: 28.9 @ $0.15Apertus 70B Instruct: 27.2 @ $0.8Mixtral 8x7B Instruct: 26.1 @ $0.5Apertus 8B Instruct: 25.6 @ $0.1Granite 4.0 Micro: 25.6 @ $0.017Jamba 1.6 Mini: 24.9 @ $0.2Gemma 3n E4B Instructed: 24.8 @ $20Mistral 7B Instruct: 14.7 @ $0.2Llama 3.2 1B Instruct: 12.1 @ $0.027Llama 2 Chat 7B: 11.3 @ $0.1

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

1008468523620080160240320400Output speed (tokens/s)Intelligence indexQwen3.7 Max: 92.3 @ 203Gemini 3.5 Flash: 92.2 @ 221GPT-5.3-Codex: 91.5 @ 73Grok 4.20 0309 v2: 91.1 @ 105Claude Opus 4.7: 90.9 @ 49Gemini 3 Flash: 90.2 @ 191Grok 4.3: 90.1 @ 88GPT-5.2-Codex: 89.9 @ 106DeepSeek-V4-Flash: 89.4 @ 109Qwen3.5 397B A17B: 89.3 @ 53GLM 4.7: 89 @ 98GPT-5.1: 89 @ 115Qwen3.6 Max: 88.8 @ 36Grok 4.20 0309: 88.5 @ 97GPT-5.1-Codex: 88.2 @ 188Qwen3.6 Plus: 88.2 @ 52DeepSeek-V4-Pro: 88.2 @ 30MiMo-V2-Flash: 88 @ 145Claude Opus 4.5: 88 @ 58Kimi K2.5: 87.9 @ 35GPT-5.4 mini: 87.5 @ 162MiniMax M2.7: 87.4 @ 50GPT-5 Codex: 87.1 @ 180MiMo-V2-Pro: 87 @ 60GLM 5.1: 86.8 @ 53Hy3: 86.7 @ 100MiMo-V2.5-Pro: 86.6 @ 58GPT-5.2: 86.2 @ 73Grok-3 Mini: 85.9 @ 100Qwen3.5-27B: 85.8 @ 91Qwen3.5-122B-A10B: 85.7 @ 129Gemma 4 31B: 85.7 @ 36Ring-2.6-1T: 85.7 @ 120Kimi K2 Thinking: 85.6 @ 100MiMo-V2-Omni-0327: 85.5 @ 110KAT-Coder-Pro V2: 85.5 @ 108MiMo-V2.5: 84.9 @ 92MiniMax M2.5: 84.8 @ 87GPT-5.1-Codex-Mini: 84.7 @ 175o3 Pro: 84.5 @ 25Qwen3.5-35B-A3B: 84.5 @ 121Qwen3 235B A22B 2507: 84.2 @ 59Qwen3.6 27B: 84.2 @ 64Qwen3.6 35B A3B: 84.1 @ 169MiniMax M2.1: 83.6 @ 92Gemini 3.1 Pro: 83.2 @ 142Step 3.5 Flash: 83.1 @ 194MiMo-V2-Omni: 82.8 @ 108Gemini 3 Pro: 82.8 @ 141Qwen3.5 Omni Plus: 82.6 @ 54Step 3.5 Flash 2603: 82.6 @ 197Grok-3: 82.6 @ 100Gemini 3.1 Flash Lite: 82.2 @ 342Nova 2 Lite: 82.1 @ 229GLM-5: 81.9 @ 67KAT-Coder-Pro V1: 81.8 @ 108GPT-5.4 nano: 81.7 @ 157Nova 2.0 Pro: 80.9 @ 149Grok 3 mini Reasoning: 80.9 @ 33Qwen3.5-9B: 80.6 @ 51GPT-5: 80.5 @ 100Claude Sonnet 4.5: 80.4 @ 42Qwen3-Next-80B-A3B: 80.3 @ 147NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ 148NVIDIA Nemotron 3 Super 120B A12B: 80 @ 211gpt-oss-120b: 79.6 @ 500Qwen3 Max: 79.5 @ 45Llama Nemotron Super 49B v1.5: 79.4 @ 51Claude Opus 4.6: 79.4 @ 48Gemma 4 26B A4B: 79.2 @ 66GPT-5 mini: 79.2 @ 200Seed-OSS-36B-Instruct: 78.8 @ 37Grok 4 Fast: 78.7 @ 90Qwen3 VL 235B A22B: 78.4 @ 34Qwen3 VL 32B: 78.4 @ 93o4-mini: 78.4 @ 115Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ 42Grok 4: 78.2 @ 100Magistral Medium 1.2: 78.1 @ 42Qwen3.5 4B: 77.1 @ 164Mercury 2: 77 @ 790Mercury 2Mistral Small 4: 76.9 @ 145Gemini 2.5 Pro Preview 06-05: 76.6 @ 85Claude Sonnet 4.6: 76.3 @ 75Qwen3 VL 30B A3B: 76.2 @ 122Qwen3 Max Thinking: 76.1 @ 45GPT-5.5: 76.1 @ 67MiniMax-M2: 76 @ 91Cogito v2.1: 75.8 @ 56Claude Opus 4.1: 75.4 @ 120Claude Haiku 4.5: 75.3 @ 100Trinity Large Thinking: 75.2 @ 129DeepSeek-R1: 75 @ 189DeepSeek VL2: 74.9 @ 22Qwen3 235B A22B: 74.9 @ 68Kimi K2.6: 74.9 @ 57GPT-5.4: 74.9 @ 84Mistral Medium 3.5: 74.8 @ 140Qwen3 30B A3B 2507: 74.7 @ 151Claude 3.7 Sonnet: 74.7 @ 101Claude Sonnet 4: 74.5 @ 101Qwen3.5 Omni Flash: 74.2 @ 235Magistral Small 1.2: 73.9 @ 106Sarvam 105B: 73.8 @ 128Qwen3 32B: 73.8 @ 328Qwen3 Coder Next: 73.7 @ 92gpt-oss-20b: 73.6 @ 1000gpt-oss-20bKimi K2: 73.6 @ 26Hermes 4 - Llama-3.1 405B: 73.5 @ 34Qwen3 Omni 30B A3B: 73.4 @ 102Gemini 2.5 Flash: 73.1 @ 85GLM-4.5: 73 @ 85GLM-4.6: 72.4 @ 85Qwen3-235B-A22B-Instruct-2507: 72.2 @ 63DeepSeek V3.2 Exp: 72.2 @ 100Qwen3 30B A3B: 71.7 @ 122Gemini 2.5 Pro: 71.6 @ 85o3: 71.6 @ 50Hermes 4 - Llama-3.1 70B: 71.3 @ 60GPT-5 nano: 71.2 @ 500Kimi K2 0905: 71 @ 16Ministral 8B Instruct: 70.9 @ 0Qwen3 VL 235B A22B Instruct: 70.9 @ 51o1-mini: 70.5 @ 115GLM 4.5 Air: 70.4 @ 63Claude 3.5 Sonnet: 70.3 @ 101DeepSeek R1 Distill Llama 70B: 70.1 @ 37GLM 4.5V: 70.1 @ 85GLM 4.6V: 69.6 @ 44NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ 244Claude Opus 4: 69.4 @ 120Qwen3 30B A3B 2507 Instruct: 69.3 @ 122Qwen3 Next 80B A3B Instruct: 68.9 @ 161DeepSeek R1 Distill Qwen 32B: 68.4 @ 37NVIDIA Nemotron Nano 9B V2: 68.3 @ 129ERNIE 4.5 300B A47B: 68.2 @ 24QwQ-32B: 67.8 @ 31Gemini 1.5 Pro: 67.3 @ 85Ling-flash-2.0: 66.9 @ 91Qwen3 14B: 66.8 @ 62Qwen3 Coder 480B A35B Instruct: 66.5 @ 69Kimi K2 Instruct: 66.5 @ 45Qwen3 VL 30B A3B Instruct: 66.5 @ 123Qwen3 VL 32B Instruct: 66.5 @ 76Pixtral-12B: 66.1 @ 0o1: 65.4 @ 66o3-mini: 64.1 @ 115Llama 4 Maverick: 63.9 @ 639GPT-4.1: 63.8 @ 100Qwen2.5 Max: 63.6 @ 50LongCat Flash Lite: 63.6 @ 110DeepSeek-V2.5: 63.4 @ 100Sarvam 30B: 63.3 @ 214DeepSeek-R1-0528: 63.3 @ 45QwQ-32B-Preview: 62.6 @ 99Grok-2: 62.4 @ 85Mistral Small 3 24B Instruct: 62.1 @ 134Gemini 1.5 Flash: 61.9 @ 150Nova Pro: 61.6 @ 100Llama 3.1 405B Instruct: 60.9 @ 100Gemini 2.0 Flash: 60.3 @ 183GPT-4 Turbo: 59.8 @ 100GPT-4.5: 59.4 @ 50Qwen2.5 72B Instruct: 59.1 @ 100Llama 4 Scout: 58.9 @ 776Llama 4 ScoutMistral Medium 3: 58.6 @ 32Claude 3 Opus: 58.5 @ 120Gemma 3 27B: 58.4 @ 33Mistral Large 3: 58.3 @ 54GPT-4: 58.3 @ 104GPT-4.1 Mini: 58.2 @ 150GLM 4.7 Flash: 58.1 @ 113DeepSeek-V3: 58.1 @ 100Qwen3 8B: 57.8 @ 69Nova Lite: 57.7 @ 100Qwen3 Omni 30B A3B Instruct: 57.3 @ 103o1-preview: 57.3 @ 66Gemini 2.5 Flash Lite: 57 @ 6Qwen3 4B: 56.5 @ 103Sarvam M: 56.4 @ 136GPT-4o: 56.4 @ 132Reka Flash 3: 56.2 @ 93Llama 3.1 70B Instruct: 56 @ 1204Llama 3.1 70B InstructCommand A: 55.9 @ 203Gemma 3 12B: 55.5 @ 33Qwen3 Coder 30B A3B Instruct: 55.4 @ 97Claude 3.5 Haiku: 54.5 @ 104Gemini 2.0 Flash Lite: 54.4 @ 85Devstral 2: 54.3 @ 51Llama 3.2 90B Instruct: 54 @ 100Nova Premier: 53.1 @ 40Pixtral Large: 53.1 @ 0Reka Flash: 52.9 @ 85Mistral Medium 3.1: 51.5 @ 47Mistral Small 3.2: 51 @ 100Mistral Small 3.1 24B Base: 50.9 @ 137Llama 3.3 70B Instruct: 50.6 @ 2220Llama 3.3 70B InstructQwen2.5 Turbo: 50.3 @ 67Qwen2.5 Coder 32B Instruct: 50.1 @ 110Qwen3 VL 8B: 49.7 @ 120GPT-4o-mini: 49.1 @ 92Nova Micro: 49 @ 100Gemini 1.5 Flash 8B: 48.4 @ 150Ministral 3 14B: 47.9 @ 67Mistral Large 2: 47.9 @ 42Devstral Medium: 47.8 @ 72Phi 4: 47.6 @ 33Devstral Small 2: 47.5 @ 62LFM2-24B-A2B: 47.4 @ 208Qwen3 1.7B: 46.9 @ 138Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ 301Qwen2.5 7B Instruct: 46.6 @ 138Phi-4-multimodal-instruct: 46.2 @ 25Phi-3.5-mini-instruct: 46 @ 23Claude 3 Sonnet: 45.8 @ 120Gemma 3 4B: 45.8 @ 33Qwen3.5 2B: 45.6 @ 328Devstral Small: 45.3 @ 190Llama 3.1 8B Instruct: 45 @ 2047Llama 3.1 8B InstructLlama 3.1 Nemotron 70B Instruct: 43.7 @ 292Mistral Small 3: 43.6 @ 136Ministral 3 8B: 43.3 @ 86Granite 4.1 8B: 43.3 @ 133Qwen3 VL 8B Instruct: 43 @ 145Jamba 1.5 Large: 42.8 @ 100Jamba 1.6 Large: 42.6 @ 52Hermes 3 - Llama-3.1 70B: 42.5 @ 32Mistral Small 3.1: 42.4 @ 134GPT-4.1 Nano: 42.1 @ 200Llama 3 70B Instruct: 40.8 @ 45Mistral Small: 40.3 @ 134Claude 3 Haiku: 38.9 @ 104Llama 3.2 11B Instruct: 36.7 @ 168Jamba Large 1.7: 36.5 @ 48Granite 4.0 H Small: 35.7 @ 524GPT-3.5 Turbo: 35.2 @ 100Gemma 3n E4B Instruct: 34.7 @ 56Gemini 1.0 Pro: 34 @ 120Ministral 3 3B: 33.7 @ 154Mistral Medium: 33.6 @ 45Solar Mini: 33.1 @ 63Granite 3.3 8B: 32.5 @ 376Llama 3 8B Instruct: 32.4 @ 81Llama 3.2 3B Instruct: 30.8 @ 172Tiny Aya Global: 30.5 @ 126Jamba 1.5 Mini: 29.6 @ 100Qwen3 0.6B: 29.3 @ 225Command R+: 28.9 @ 100Jamba 1.6 Mini: 24.9 @ 183Gemma 3n E4B Instructed: 24.8 @ 42Qwen3.5 0.8B: 23.6 @ 120Mistral 7B Instruct: 14.7 @ 90Llama 3.2 1B Instruct: 12.1 @ 91Llama 2 Chat 7B: 11.3 @ 113
#ModelCoding idxAider Polyglot EditMultiPL-EMBPPSWE-bench ProAider PolyglotLiveCodeBenchTerminal-BenchSWE-bench VerifiedHumanEvalContextSpeedIn $/M
1DeepSeek V3.2 Speciale89.689.6164K$0.29
2GLM 4.789.489.4203K98$0.40
3Claude Opus 4.787.687.61M49$5.00
4DeepSeek-V4-Pro87.193.580.61M30$0.44
5GPT-5.186.886.8400K115$1.25
6MiMo-V2-Flash86.886.8262K145$0.10
7DeepSeek-V3.286.286.2131K$0.25
8GPT-5.1-Codex84.984.9400K188$1.25
9GPT-5.284.789.480400K73$1.75
10Gemini 3 Flash84.490.8781M191$0.50
11Claude Opus 4.58487.180.9200K58$5.00
12Gemini 3 Pro8491.776.21M141$2.00
13GPT-5 mini83.883.8400K200$0.25
14GPT-5.1-Codex-Mini83.683.6400K175$0.25
15GPT-582.58884.674.993.4400K100$1.25
16Grok 4.1 Fast82.282.2$0.00
17ERNIE 5.0 Thinking81.281.2$0.00
18MiniMax M2.18181205K92$0.29
19Claude Opus 4.680.880.8951M48$5.00
20Apriel-v1.6-15B-Thinker80.780.7$0.00
21Gemini 3.1 Pro80.680.61M142$2.00
22Grok-3 Mini80.480.4128K100$0.30
23Grok 4 Fast80802M90$0.20
24DeepSeek V3.1 Terminus79.879.8164K$0.27
25Claude Sonnet 4.679.679.61M75$3.00
26Grok-4 Heavy79.479.4
27Grok-379.479.4128K100$3.00
28GPT-5 Codex79.38474.5400K180$1.25
29Grok 47979256K100$3.00
30GPT-5 nano78.978.9400K500$0.05
31Qwen3 235B A22B 250778.878.859$0.40
32Qwen3-Next-80B-A3B78.478.4262K147$0.50
33Kimi K2 Thinking78.385.371.3262K100$0.60
34GLM-577.877.8203K67$0.60
35INTELLECT-377.777.7131K$0.20
36gpt-oss-20b77.777.7131K1000$0.03
37o377.181.380.869.1200K50$2.00
38K-EXAONE76.876.8$0.00
39Qwen3 Max76.776.7262K45$0.78
40Doubao Seed Code76.676.6$0.00
41Seed-OSS-36B-Instruct76.576.537$0.20
42gpt-oss-120b75.187.862.4131K500$0.04
43Magistral Medium 1.2757542$2.00
44KAT-Coder-Pro V174.774.7108$0.30
45EXAONE 4.0 32B74.774.7$0.00
46NVIDIA Nemotron 3 Nano 30B A3B74.174.1148$0.10
47Qwen3 VL 32B73.873.893$0.70
48Llama Nemotron Super 49B v1.573.773.751$0.10
49Gemini 2.5 Pro73.372.776.580.163.81M85$1.25
50Nova 2.0 Pro7373149$1.30
51Apriel-v1.5-15B-Thinker72.872.8$0.00
52Gemini 2.5 Pro Preview 06-0572.882.26967.21M85$1.25
53Qwen2.5 14B Instruct72.872.88283.5
54Falcon-H1R-7B72.472.4$0.00
55NVIDIA Nemotron Nano 9B V272.472.4129$0.00
56Magistral Small 1.272.372.3106$0.50
57Gemini 2.5 Flash71.371.3$0.00
58Nova 2 Lite71.171.11M229$0.30
59Nemotron Nano 9B V271.171.1131K$0.04
60MiniMax M1 80k71.171.1$0.60
61Qwen3 30B A3B 250770.770.7151$0.30
62o4-mini70.358.268.985.968.1200K115$1.10
63Qwen3 VL 30B A3B69.769.7122$0.20
64Grok 3 mini Reasoning69.669.633$0.30
65Olmo 3.1 32B Think69.569.5$0.00
66K2-V269.469.4$0.00
67NVIDIA Nemotron Nano 12B v2 VL69.469.4244$0.20
68Cogito v2.168.868.856$1.30
69Gemini 2.5 Flash-Lite68.868.8$0.10
70Qwen3 Next 80B A3B Instruct68.787.849.868.4262K161$0.09
71Hermes 4 - Llama-3.1 405B68.668.634$1.00
72Qwen3 235B A22B68.365.981.470.7131K68$0.46
73Qwen3 Omni 30B A3B67.967.9102$0.30
74Ling-1T67.767.7$0.00
75Olmo 3 32B Think67.267.266K$0.15
76Llama 3.1 Nemotron Ultra 253B v166.366.342$0.60
77Claude Sonnet 4.566.271.45077.21M42$3.00
78MiniMax-M266.182.646.369.4205K91$0.26
79Nova 2.0 Omni6666$0.30
80Qwen3-235B-A22B-Instruct-250765.987.957.352.4131K63$0.15
81Qwen2.5-Omni-7B65.865.873.278.7
82Qwen3 32B65.765.7131K328$0.08
83MiniMax M1 40k65.765.7$0.00
84Grok Code Fast 165.765.7$0.00
85Mi:dm K 2.5 Pro65.665.6$0.00
86Hermes 4 - Llama-3.1 70B65.365.360$0.10
87Qwen2.5 72B Instruct65.375.188.255.586.6131K100$0.36
88Motif-2-12.7B-Reasoning65.165.1$0.00
89Qwen3 VL 235B A22B64.664.634$0.80
90Ring-1T64.364.3$0.00
91Qwen3 4B 250764.164.1$0.00
92DeepSeek V3.2 Exp63.574.574.137.767.8164K100$0.27
93QwQ-32B63.463.431$0.70
94HyperCLOVA X SEED Think62.962.9$0.00
95Ring-flash-2.062.862.8$0.10
96Qwen3 30B A3B62.662.6131K122$0.09
97o3-mini62.560.466.773.449.3200K115$1.10
98Gemini 2.5 Flash62.156.761.969.560.41M85$0.30
99DeepSeek-R161.761.7128K189$0.55
100Olmo 3 7B Think61.761.7$0.00
101Solar Pro 261.661.6$0.00
102Claude Opus 4.161.165.443.374.5200K120$15.00
103Kimi K2 0905616194.5262K16$0.60
104Kimi K260.755.665.8131K26$0.57
105GLM 4.5V60.460.466K85$0.60
106Kimi K2 Instruct60.485.7603065.893.3131K45$0.57
107Qwen3 VL 235B A22B Instruct59.459.4262K51$0.20
108GLM-4.659.369.540.568203K85$0.43
109Ling-flash-2.058.958.991$0.10
110GPT-5.558.658.61.1M67$5.00
111Kimi K2.658.658.6262K57$0.73
112Qwen3 Coder 480B A35B Instruct58.558.569$0.30
113Claude Opus 458.463.639.272.5200K120$15.00
114GLM-4.558.272.937.564.2131K85$0.60
115Kimi K2-Instruct-09055885.76053.72565.8
116Claude Sonnet 457.965.535.572.71M101$3.00
117GPT-5.457.757.71.1M84$2.50
118o1-mini57.657.692.4128K115$3.00
119DeepSeek R1 Distill Llama 70B57.557.5128K37$0.10
120DeepSeek R1 Distill Qwen 32B57.257.2128K37$0.12
121DeepSeek-V3.155.568.456.431.366164K$0.21
122Qwen3-Coder55.455.4262K
123o154.567.94188.1200K66$15.00
124Claude Haiku 4.553.839.561.54173.3200K100$1.00
125Phi 4 Reasoning53.853.892.9
126Qwen3 Max Thinking53.553.5262K45$0.78
127Phi 4 Reasoning Plus53.153.192.3
128DeepSeek R1 Distill Qwen 14B53.153.1$0.00
129GLM 4.5 Air52.870.73057.6131K63$0.13
130Magistral Medium 152.752.7$0.00
131Qwen3 14B52.352.3132K62$0.10
132DeepSeek-V352.279.749.637.642131K100$0.23
133Exaone 4.0 1.2B51.651.6$0.00
134Qwen3 30B A3B 2507 Instruct51.551.5122$0.20
135Qwen3 VL 32B Instruct51.451.4262K76$0.10
136Magistral Small 151.451.4$0.00
137DeepSeek R1 0528 Qwen3 8B51.351.3$0.00
138Magistral Small 250651.351.3
139GPT-4.151.252.951.645.754.6941M100$2.00
140Claude 3.7 Sonnet50.947.335.270.3200K101$3.00
141Qwen2.5 32B Instruct50.175.48424.888.4$0.00
142DeepSeek R1 Zero5050
143QwQ-32B-Preview505033K99$0.15
144Qwen2.5 7B Instruct49.670.479.228.784.8131K138$0.04
145Llama 3.1 Nemotron Nano 4B v1.149.349.3$0.00
146DeepSeek-V3 032449.249.2164K$0.28
147DeepSeek-R1-052848.871.673.35.744.6131K45$0.55
148Magistral Medium48.747.150.3
149Qwen3 VL 30B A3B Instruct47.647.6262K123$0.13
150ERNIE 4.5 300B A47B46.746.7131K24$0.28
151Mistral Large 346.546.5262K54$0.50
152Qwen3 4B46.546.5103$0.10
153Devstral 244.844.8262K51$0.40
154Claude 3.5 Sonnet43.638.14993.7200K101$3.00
155Reka Flash 343.543.566K93$0.10
156Ling-mini-2.042.942.9$0.00
157Qwen2 7B Instruct42.959.167.226.679.9
158Qwen2 72B Instruct42.669.280.215.986$0.00
159Qwen3 Omni 30B A3B Instruct42.242.2103$0.30
160GPT-4.541.544.93888128K50$75.00
161o1-preview41.341.3128K66$15.00
162GLM 4.6V41.141.1131K44$0.30
163Qwen3 8B40.640.6131K69$0.05
164Mistral Medium 3.140.640.6131K47$0.40
165Qwen3 Coder 30B A3B Instruct40.340.3160K97$0.07
166Mistral Medium 34040131K32$0.40
167DeepSeek R1 Distill Llama 8B39.639.6$0.00
168Kimi Linear 48B A3B Instruct37.837.8$0.00
169Qwen3 4B 2507 Instruct37.737.7$0.00
170DeepSeek R1 Distill Qwen 7B37.637.6
171Llama 4 Maverick36.777.643.4301M639$0.15
172Claude 3.5 Haiku3631.440.688.1200K104$0.80
173Qwen2.5 Max35.935.950$1.60
174Qwen3 VL 8B35.335.3120$0.20
175Gemini 2.0 Flash35.135.11M183$0.10
176Ministral 3 14B35.135.1262K67$0.20
177Devstral Small 234.834.862$0.00
178Gemini 2.0 Pro34.734.7$0.00
179GPT-4.1 Mini34.631.634.748.323.61M150$0.40
180Devstral Medium33.733.7131K72$0.40
181Qwen3 VL 8B Instruct33.233.2256K145$0.08
182Llama 4 Scout32.867.832.810M776$0.08
183Gemini 2.0 Flash Thinking32.132.1$0.00
184Qwen3 VL 4B3232$0.00
185Nova Premier31.731.740$2.50
186Gemini 1.5 Pro31.631.684.12M85$1.25
187Qwen2.5 Coder 32B Instruct31.490.231.492.7128K110$0.66
188GPT-4o31.218.230.742.533.290.2128K132$2.50
189Qwen3 1.7B30.830.8138$0.10
190Gemini 2.5 Flash Lite30.726.733.731.61M6$0.10
191Llama 3.1 405B Instruct30.530.589128K100$0.89
192Ministral 3 8B30.330.3262K86$0.15
193Gemma 3 27B29.774.429.787.8131K33$0.08
194Sonar29.529.5127K$1.00
195Sarvam M29.529.5136$0.00
196Mistral Large 229.329.392128K42$2.00
197GPT-4 Turbo29.129.187.1128K100$10.00
198Llama 3.1 Tulu3 405B29.129.1$0.00
199Qwen3 VL 4B Instruct2929$0.00
200Llama 3.3 70B Instruct28.828.888.4131K2220$0.10
201Command A28.728.7256K203$2.50
202Llama-3.3 Nemotron Super 49B v12891.328$0.00
203Claude 3 Opus27.927.984.9200K120$15.00
204Sonar Pro27.527.5200K$3.00
205Mistral Small 3.227.527.5100$0.10
206Gemini 1.5 Flash27.327.374.31M150$0.15
207Gemini Diffusion26.97630.922.989.6
208Grok-226.726.788.4128K85$2.00
209Olmo 3 7B Instruct26.626.6$0.10
210Pixtral Large26.126.1131K0$2.00
211Devstral Small25.825.8190$0.10
212Mistral Small 325.225.233K136$0.05
213Granite 4.0 H Small25.125.1524$0.10
214Ministral 3 3B24.724.7131K154$0.10
215Gemma 3 12B24.67324.685.4131K33$0.04
216Grok24.124.1$0.00
217Nova Pro23.323.389300K100$0.80
218Llama 3.1 70B Instruct23.223.280.5131K1204$0.40
219Phi 423.123.182.816K33$0.07
220Gemini 1.5 Flash 8B21.721.71M150$0.07
221Llama 3.2 90B Instruct21.421.4128K100$0.35
222Mistral Small 3.121.221.2134$0.10
223Jamba Reasoning 3B2121$0.00
224Llama 3 70B Instruct19.819.88K45$0.51
225DeepHermes 3 - Mistral 24B19.519.5$0.00
226Claude 2.119.519.5$0.00
227Hermes 3 - Llama-3.1 70B18.818.832$0.30
228Gemini 2.0 Flash Lite18.518.51M85$0.08
229Qwen2.5-Coder 7B Instruct18.283.518.288.4$0.00
230Jamba Large 1.718.118.1256K48$2.00
231Granite 4.0 Micro1818131K$0.02
232Mistral Large17.817.8128K$2.00
233Claude 3 Sonnet17.517.573200K120$3.00
234Jamba 1.6 Large17.217.252$2.00
235Claude 217.117.171.2100K$0.00
236Llama 3.1 Nemotron 70B Instruct16.916.9292$1.20
237DeepSeek R1 Distill Qwen 1.5B16.916.9$0.00
238DeepSeek-V2.516.816.8898K100$0.14
239Nova Lite16.716.785.4300K100$0.06
240Qwen2.5 Turbo16.316.367$0.10
241GPT-4.1 Nano16.26.29.832.61M200$0.10
242GPT-4o-mini1623.48.787.2128K92$0.15
243DeepSeek Coder V2 Lite Instruct15.815.8$0.00
244Claude 3 Haiku15.415.475.9200K104$0.25
245LFM2 8B A1B15.115.1$0.00
246Mixtral 8x22B Instruct14.814.866K$2.00
247Gemma 3n E4B Instruct14.614.656$0.00
248Jamba 1.5 Large14.314.3256K100$2.00
249Mistral Small14.114.1134$0.20
250Nova Micro141481.1128K100$0.03
251Gemma 3 27B Instruct13.713.7$0.10
252Gemma 3 12B Instruct13.713.7$0.10
253Gemma 3n E4B Instructed LiteRT Preview13.263.613.275
254Gemma 3n E2B Instructed LiteRT (Preview)13.256.613.266.5
255Gemma 3n E4B Instructed13.263.613.27532K42$20.00
256Gemma 3n E2B Instructed13.256.613.266.5
257Phi-4-multimodal-instruct13.113.1128K25$0.05
258Granite 3.3 8B12.712.7376$0.00
259Gemma 3 4B12.663.212.671.3131K33$0.04
260Phi 4 Mini Instruct12.612.6131K$0.08
261Command R+12.212.2128K100$0.15
262Qwen3 0.6B12.112.1225$0.10
263Llama 3.1 8B Instruct11.611.672.6131K2047$0.02
264Gemini 1.0 Pro11.611.633K120$0.50
265Phi-3 Mini Instruct 3.8B11.611.6$0.00
266OpenChat 3.511.511.5$0.00
267Granite 4.0 H 1B11.511.5$0.00
268Gemma 3 4B Instruct11.211.2$0.00
269Llama 3.2 11B Instruct1111128K168$0.05
270Claude Instant10.910.9$0.00
271Mistral Medium9.99.945$2.80
272Llama 2 Chat 13B9.89.8$0.00
273Llama 2 Chat 70B9.89.8$0.00
274LFM 40B9.69.6$0.00
275Llama 3 8B Instruct9.69.68K81$0.04
276Gemma 3n E2B Instruct9.59.5$0.00
277DBRX Instruct9.39.3$0.00
278DeepHermes 3 - Llama-3.1 8B8.58.5$0.00
279Llama 3.2 3B Instruct8.38.3131K172$0.05
280LFM2 2.6B8.18.1$0.00
281Jamba 1.6 Mini7.17.1183$0.20
282OLMo 2 32B6.86.8$0.00
283Mixtral 8x7B Instruct6.66.6$0.50
284Jamba 1.5 Mini6.26.2256K100$0.20
285Jamba 1.7 Mini6.16.1$0.00
286Granite 4.0 1B4.74.7$0.00
287Mistral 7B Instruct4.64.690$0.20
288OLMo 2 7B4.14.1$0.00
289Molmo 7B-D3.93.9$0.00
290Granite 4.0 350M2.42.4$0.00
291LFM2 1.2B22$0.00
292Gemma 3 1B1.935.21.941.5
293Llama 3.2 1B Instruct1.91.9131K91$0.03
294Granite 4.0 H 350M1.91.9$0.00
295Gemma 3 1B Instruct1.71.7$0.00
296Gemma 3 270M0.30.3$0.00
297Llama 2 Chat 7B0.20.2113$0.10

297 models ranked on Coding. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.