AI Hub
Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence
Sonar Reasoning Pro
95.7 index
Top reasoning
Claude Opus 4.7
94.2
Top math
Grok-4 Heavy
100
Fastest
Llama 3.3 70B Instruct
2220 tok/s
Cheapest
Ling-2.6-flash
$0.01/M
Longest context
Llama 4 Scout
10M
Best open-weights
DeepSeek V3.2 Speciale
89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

1008468523620$0$4$8$12$16$20Input price ($/M tokens)Intelligence indexSonar Reasoning Pro: 95.7 @ $2Sonar Reasoning ProQwen3.7 Max: 92.3 @ $2.5Gemini 3.5 Flash: 92.2 @ $1.5GPT-5.3-Codex: 91.5 @ $1.75Grok 4.20 0309 v2: 91.1 @ $2Claude Opus 4.7: 90.9 @ $5Gemini 3 Flash: 90.2 @ $0.5Gemini 3 FlashGrok 4.3: 90.1 @ $1.25DeepSeek V3.2 Speciale: 89.9 @ $0.287DeepSeek V3.2 SpecialeGPT-5.2-Codex: 89.9 @ $1.75DeepSeek-V4-Flash: 89.4 @ $0.1DeepSeek-V4-FlashQwen3.5 397B A17B: 89.3 @ $0.39Qwen3.5 397B A17BGLM 4.7: 89 @ $0.4GPT-5.1: 89 @ $1.25Qwen3.6 Max: 88.8 @ $1.04Grok 4.20 0309: 88.5 @ $2GPT-5 Pro: 88.4 @ $15GPT-5.1-Codex: 88.2 @ $1.25Qwen3.6 Plus: 88.2 @ $0.325DeepSeek-V4-Pro: 88.2 @ $0.435MiMo-V2-Flash: 88 @ $0.1MiMo-V2-FlashClaude Opus 4.5: 88 @ $5Kimi K2.5: 87.9 @ $0.4GPT-5.4 mini: 87.5 @ $0.75MiniMax M2.7: 87.4 @ $0.279GPT-5 Codex: 87.1 @ $1.25DeepSeek-V3.2: 87.1 @ $0.252MiMo-V2-Pro: 87 @ $1GLM 5.1: 86.8 @ $0.98Hy3: 86.7 @ $0.066MiMo-V2.5-Pro: 86.6 @ $1GPT-5.2: 86.2 @ $1.75Grok-3 Mini: 85.9 @ $0.3Qwen3.5-27B: 85.8 @ $0.195Qwen3.5-122B-A10B: 85.7 @ $0.26Gemma 4 31B: 85.7 @ $0.12Ring-2.6-1T: 85.7 @ $0.075Kimi K2 Thinking: 85.6 @ $0.6MiMo-V2-Omni-0327: 85.5 @ $0.4KAT-Coder-Pro V2: 85.5 @ $0.3MiMo-V2.5: 84.9 @ $0.4MiniMax M2.5: 84.8 @ $0.15GPT-5.1-Codex-Mini: 84.7 @ $0.25GLM 5 Turbo: 84.7 @ $1.2o3 Pro: 84.5 @ $20Qwen3.5-35B-A3B: 84.5 @ $0.139Qwen3 235B A22B 2507: 84.2 @ $0.4Qwen3.6 27B: 84.2 @ $0.3Qwen3.6 35B A3B: 84.1 @ $0.15MiniMax M2.1: 83.6 @ $0.29DeepSeek V3.1 Terminus: 83.5 @ $0.27Gemini 3.1 Pro: 83.2 @ $2Step 3.5 Flash: 83.1 @ $0.09MiMo-V2-Omni: 82.8 @ $0.4Gemini 3 Pro: 82.8 @ $2Qwen3.5 Omni Plus: 82.6 @ $0.4Grok-3: 82.6 @ $3o1-pro: 82.5 @ $150Gemini 3.1 Flash Lite: 82.2 @ $0.25Nova 2 Lite: 82.1 @ $0.3GLM-5: 81.9 @ $0.6KAT-Coder-Pro V1: 81.8 @ $0.3GPT-5.4 nano: 81.7 @ $0.2INTELLECT-3: 81 @ $0.2Nova 2.0 Pro: 80.9 @ $1.3Grok 3 mini Reasoning: 80.9 @ $0.3GLM 5V Turbo: 80.9 @ $1.2Qwen3.5-9B: 80.6 @ $0.04GPT-5: 80.5 @ $1.25Claude Sonnet 4.5: 80.4 @ $3Qwen3-Next-80B-A3B: 80.3 @ $0.5NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ $0.1NVIDIA Nemotron 3 Super 120B A12B: 80 @ $0.3Qwen3-235B-A22B-Thinking-2507: 79.6 @ $0.3gpt-oss-120b: 79.6 @ $0.039Qwen3 Max: 79.5 @ $0.78Llama Nemotron Super 49B v1.5: 79.4 @ $0.1Claude Opus 4.6: 79.4 @ $5Gemma 4 26B A4B: 79.2 @ $0.06GPT-5 mini: 79.2 @ $0.25Qwen2.5 VL 72B Instruct: 79.1 @ $0.25Seed-OSS-36B-Instruct: 78.8 @ $0.2Grok 4 Fast: 78.7 @ $0.2Qwen3 VL 235B A22B: 78.4 @ $0.8Qwen3 VL 32B: 78.4 @ $0.7o4-mini: 78.4 @ $1.1Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ $0.6Nova 2.0 Omni: 78.2 @ $0.3Grok 4: 78.2 @ $3Magistral Medium 1.2: 78.1 @ $2Nemotron Nano 9B V2: 77.6 @ $0.04Qwen3 Next 80B A3B Thinking: 77.5 @ $0.098Mercury 2: 77 @ $0.25Mistral Small 4: 76.9 @ $0.15Gemini 2.5 Pro Preview 06-05: 76.6 @ $1.25Claude Sonnet 4.6: 76.3 @ $3Qwen3 VL 30B A3B: 76.2 @ $0.2Qwen3 Max Thinking: 76.1 @ $0.78GPT-5.5: 76.1 @ $5MiniMax-M2: 76 @ $0.255Cogito v2.1: 75.8 @ $1.3MiniMax M1 80k: 75.5 @ $0.6Claude Opus 4.1: 75.4 @ $15Claude Haiku 4.5: 75.3 @ $1Trinity Large Thinking: 75.2 @ $0.22Ling-2.6-1T: 75.2 @ $0.075DeepSeek-R1: 75 @ $0.55DeepSeek VL2: 74.9 @ $9.5Qwen3 235B A22B: 74.9 @ $0.455Kimi K2.6: 74.9 @ $0.73GPT-5.4: 74.9 @ $2.5Mistral Medium 3.5: 74.8 @ $1.5Qwen3 30B A3B 2507: 74.7 @ $0.3Claude 3.7 Sonnet: 74.7 @ $3Ring-flash-2.0: 74.6 @ $0.1Claude Sonnet 4: 74.5 @ $3Qwen3.5 Omni Flash: 74.2 @ $0.1Magistral Small 1.2: 73.9 @ $0.5Qwen3 32B: 73.8 @ $0.08Qwen3 Coder Next: 73.7 @ $0.11gpt-oss-20b: 73.6 @ $0.03Kimi K2: 73.6 @ $0.57Hermes 4 - Llama-3.1 405B: 73.5 @ $1Qwen3 Omni 30B A3B: 73.4 @ $0.3Gemini 2.5 Flash: 73.1 @ $0.3GLM-4.5: 73 @ $0.6Solar Pro 3: 72.4 @ $0.15GLM-4.6: 72.4 @ $0.43Gemini 2.5 Flash-Lite: 72.3 @ $0.1Qwen3-235B-A22B-Instruct-2507: 72.2 @ $0.15DeepSeek V3.2 Exp: 72.2 @ $0.27Qwen3 30B A3B: 71.7 @ $0.09Gemini 2.5 Pro: 71.6 @ $1.25o3: 71.6 @ $2Hermes 4 - Llama-3.1 70B: 71.3 @ $0.1GPT-5 nano: 71.2 @ $0.05Kimi K2 0905: 71 @ $0.6Ministral 8B Instruct: 70.9 @ $0.1Qwen3 VL 235B A22B Instruct: 70.9 @ $0.2o1-mini: 70.5 @ $3GLM 4.5 Air: 70.4 @ $0.13Claude 3.5 Sonnet: 70.3 @ $3DeepSeek R1 Distill Llama 70B: 70.1 @ $0.1GLM 4.5V: 70.1 @ $0.6GLM 4.6V: 69.6 @ $0.3Olmo 3 32B Think: 69.5 @ $0.15NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ $0.2Claude Opus 4: 69.4 @ $15Qwen3 30B A3B 2507 Instruct: 69.3 @ $0.2Qwen3 Next 80B A3B Instruct: 68.9 @ $0.09DeepSeek R1 Distill Qwen 32B: 68.4 @ $0.12ERNIE 4.5 300B A47B: 68.2 @ $0.28QwQ-32B: 67.8 @ $0.7Gemini 1.5 Pro: 67.3 @ $1.25Ling-flash-2.0: 66.9 @ $0.1Qwen3 14B: 66.8 @ $0.1Qwen3 Coder 480B A35B Instruct: 66.5 @ $0.3Kimi K2 Instruct: 66.5 @ $0.57Qwen3 VL 30B A3B Instruct: 66.5 @ $0.13Qwen3 VL 32B Instruct: 66.5 @ $0.104Pixtral-12B: 66.1 @ $0.15DeepSeek-V3 0324: 65.9 @ $0.28o1: 65.4 @ $15o3-mini: 64.1 @ $1.1Llama 4 Maverick: 63.9 @ $0.15GPT-4.1: 63.8 @ $2Qwen2.5 Max: 63.6 @ $1.6DeepSeek-V2.5: 63.4 @ $0.14DeepSeek-R1-0528: 63.3 @ $0.55QwQ-32B-Preview: 62.6 @ $0.15Grok-2: 62.4 @ $2Mistral Small 3 24B Instruct: 62.1 @ $0.1Gemini 1.5 Flash: 61.9 @ $0.15Nova Pro: 61.6 @ $0.8MiniMax-M1: 61.5 @ $0.4Llama 3.1 405B Instruct: 60.9 @ $0.89Gemini 2.0 Flash: 60.3 @ $0.1DeepSeek-V3.1: 59.8 @ $0.21GPT-4 Turbo: 59.8 @ $10GPT-4.5: 59.4 @ $75Ling-2.6-flash: 59.3 @ $0.01Qwen2.5 72B Instruct: 59.1 @ $0.36Llama 4 Scout: 58.9 @ $0.08Sonar Pro: 58.8 @ $3Mistral Medium 3: 58.6 @ $0.4Claude 3 Opus: 58.5 @ $15Gemma 3 27B: 58.4 @ $0.08Mistral Large 3: 58.3 @ $0.5GPT-4: 58.3 @ $30GPT-4.1 Mini: 58.2 @ $0.4GLM 4.7 Flash: 58.1 @ $0.06DeepSeek-V3: 58.1 @ $0.229Qwen3 8B: 57.8 @ $0.05Nova Lite: 57.7 @ $0.06Qwen3 Omni 30B A3B Instruct: 57.3 @ $0.3o1-preview: 57.3 @ $15Gemini 2.5 Flash Lite: 57 @ $0.1Sonar: 56.8 @ $1Qwen3 4B: 56.5 @ $0.1GPT-4o: 56.4 @ $2.5Reka Flash 3: 56.2 @ $0.1Llama 3.1 70B Instruct: 56 @ $0.4Command A: 55.9 @ $2.5Gemma 3 12B: 55.5 @ $0.04Qwen3 Coder 30B A3B Instruct: 55.4 @ $0.07Claude 3.5 Haiku: 54.5 @ $0.8Gemini 2.0 Flash Lite: 54.4 @ $0.075Devstral 2: 54.3 @ $0.4Llama 3.2 90B Instruct: 54 @ $0.35Nova Premier: 53.1 @ $2.5Pixtral Large: 53.1 @ $2Reka Flash: 52.9 @ $0.2Mistral Medium 3.1: 51.5 @ $0.4Mistral Small 3.2: 51 @ $0.1Mistral Small 3.1 24B Base: 50.9 @ $0.1Llama 3.3 70B Instruct: 50.6 @ $0.1Qwen2.5 Turbo: 50.3 @ $0.1Qwen2.5 Coder 32B Instruct: 50.1 @ $0.66Qwen3 VL 8B: 49.7 @ $0.2GPT-4o-mini: 49.1 @ $0.15Nova Micro: 49 @ $0.03Gemini 1.5 Flash 8B: 48.4 @ $0.07Ministral 3 14B: 47.9 @ $0.2Mistral Large 2: 47.9 @ $2Devstral Medium: 47.8 @ $0.4Phi 4: 47.6 @ $0.065LFM2-24B-A2B: 47.4 @ $0.03Qwen3 1.7B: 46.9 @ $0.1Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ $0.1Qwen2.5 7B Instruct: 46.6 @ $0.04Phi-4-multimodal-instruct: 46.2 @ $0.05Phi-3.5-mini-instruct: 46 @ $0.1Claude 3 Sonnet: 45.8 @ $3Gemma 3 4B: 45.8 @ $0.04Devstral Small: 45.3 @ $0.1Llama 3.1 8B Instruct: 45 @ $0.02Gemma 3 27B Instruct: 44.5 @ $0.1Llama 3.1 Nemotron 70B Instruct: 43.7 @ $1.2Mistral Small 3: 43.6 @ $0.05Ministral 3 8B: 43.3 @ $0.15Granite 4.1 8B: 43.3 @ $0.05Qwen3 VL 8B Instruct: 43 @ $0.08Jamba 1.5 Large: 42.8 @ $2Jamba 1.6 Large: 42.6 @ $2Hermes 3 - Llama-3.1 70B: 42.5 @ $0.3Mistral Small 3.1: 42.4 @ $0.1GPT-4.1 Nano: 42.1 @ $0.1Llama 3 70B Instruct: 40.8 @ $0.51Mistral Small: 40.3 @ $0.2Gemma 3 12B Instruct: 40 @ $0.1Olmo 3 7B Instruct: 40 @ $0.1Mistral Large: 39.3 @ $2Mixtral 8x22B Instruct: 39.1 @ $2Claude 3 Haiku: 38.9 @ $0.25Llama 3.2 11B Instruct: 36.7 @ $0.05Jamba Large 1.7: 36.5 @ $2Granite 4.0 H Small: 35.7 @ $0.1GPT-3.5 Turbo: 35.2 @ $0.5Gemini 1.0 Pro: 34 @ $0.5Ministral 3 3B: 33.7 @ $0.1Mistral Medium: 33.6 @ $2.8Solar Mini: 33.1 @ $0.2Phi 4 Mini Instruct: 32.6 @ $0.08Llama 3 8B Instruct: 32.4 @ $0.04Llama 3.2 3B Instruct: 30.8 @ $0.051Jamba 1.5 Mini: 29.6 @ $0.2Qwen3 0.6B: 29.3 @ $0.1Command R+: 28.9 @ $0.15Apertus 70B Instruct: 27.2 @ $0.8Mixtral 8x7B Instruct: 26.1 @ $0.5Apertus 8B Instruct: 25.6 @ $0.1Granite 4.0 Micro: 25.6 @ $0.017Jamba 1.6 Mini: 24.9 @ $0.2Gemma 3n E4B Instructed: 24.8 @ $20Mistral 7B Instruct: 14.7 @ $0.2Llama 3.2 1B Instruct: 12.1 @ $0.027Llama 2 Chat 7B: 11.3 @ $0.1

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

1008468523620080160240320400Output speed (tokens/s)Intelligence indexQwen3.7 Max: 92.3 @ 203Gemini 3.5 Flash: 92.2 @ 221GPT-5.3-Codex: 91.5 @ 73Grok 4.20 0309 v2: 91.1 @ 105Claude Opus 4.7: 90.9 @ 49Gemini 3 Flash: 90.2 @ 191Grok 4.3: 90.1 @ 88GPT-5.2-Codex: 89.9 @ 106DeepSeek-V4-Flash: 89.4 @ 109Qwen3.5 397B A17B: 89.3 @ 53GLM 4.7: 89 @ 98GPT-5.1: 89 @ 115Qwen3.6 Max: 88.8 @ 36Grok 4.20 0309: 88.5 @ 97GPT-5.1-Codex: 88.2 @ 188Qwen3.6 Plus: 88.2 @ 52DeepSeek-V4-Pro: 88.2 @ 30MiMo-V2-Flash: 88 @ 145Claude Opus 4.5: 88 @ 58Kimi K2.5: 87.9 @ 35GPT-5.4 mini: 87.5 @ 162MiniMax M2.7: 87.4 @ 50GPT-5 Codex: 87.1 @ 180MiMo-V2-Pro: 87 @ 60GLM 5.1: 86.8 @ 53Hy3: 86.7 @ 100MiMo-V2.5-Pro: 86.6 @ 58GPT-5.2: 86.2 @ 73Grok-3 Mini: 85.9 @ 100Qwen3.5-27B: 85.8 @ 91Qwen3.5-122B-A10B: 85.7 @ 129Gemma 4 31B: 85.7 @ 36Ring-2.6-1T: 85.7 @ 120Kimi K2 Thinking: 85.6 @ 100MiMo-V2-Omni-0327: 85.5 @ 110KAT-Coder-Pro V2: 85.5 @ 108MiMo-V2.5: 84.9 @ 92MiniMax M2.5: 84.8 @ 87GPT-5.1-Codex-Mini: 84.7 @ 175o3 Pro: 84.5 @ 25Qwen3.5-35B-A3B: 84.5 @ 121Qwen3 235B A22B 2507: 84.2 @ 59Qwen3.6 27B: 84.2 @ 64Qwen3.6 35B A3B: 84.1 @ 169MiniMax M2.1: 83.6 @ 92Gemini 3.1 Pro: 83.2 @ 142Step 3.5 Flash: 83.1 @ 194MiMo-V2-Omni: 82.8 @ 108Gemini 3 Pro: 82.8 @ 141Qwen3.5 Omni Plus: 82.6 @ 54Step 3.5 Flash 2603: 82.6 @ 197Grok-3: 82.6 @ 100Gemini 3.1 Flash Lite: 82.2 @ 342Nova 2 Lite: 82.1 @ 229GLM-5: 81.9 @ 67KAT-Coder-Pro V1: 81.8 @ 108GPT-5.4 nano: 81.7 @ 157Nova 2.0 Pro: 80.9 @ 149Grok 3 mini Reasoning: 80.9 @ 33Qwen3.5-9B: 80.6 @ 51GPT-5: 80.5 @ 100Claude Sonnet 4.5: 80.4 @ 42Qwen3-Next-80B-A3B: 80.3 @ 147NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ 148NVIDIA Nemotron 3 Super 120B A12B: 80 @ 211gpt-oss-120b: 79.6 @ 500Qwen3 Max: 79.5 @ 45Llama Nemotron Super 49B v1.5: 79.4 @ 51Claude Opus 4.6: 79.4 @ 48Gemma 4 26B A4B: 79.2 @ 66GPT-5 mini: 79.2 @ 200Seed-OSS-36B-Instruct: 78.8 @ 37Grok 4 Fast: 78.7 @ 90Qwen3 VL 235B A22B: 78.4 @ 34Qwen3 VL 32B: 78.4 @ 93o4-mini: 78.4 @ 115Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ 42Grok 4: 78.2 @ 100Magistral Medium 1.2: 78.1 @ 42Qwen3.5 4B: 77.1 @ 164Mercury 2: 77 @ 790Mercury 2Mistral Small 4: 76.9 @ 145Gemini 2.5 Pro Preview 06-05: 76.6 @ 85Claude Sonnet 4.6: 76.3 @ 75Qwen3 VL 30B A3B: 76.2 @ 122Qwen3 Max Thinking: 76.1 @ 45GPT-5.5: 76.1 @ 67MiniMax-M2: 76 @ 91Cogito v2.1: 75.8 @ 56Claude Opus 4.1: 75.4 @ 120Claude Haiku 4.5: 75.3 @ 100Trinity Large Thinking: 75.2 @ 129DeepSeek-R1: 75 @ 189DeepSeek VL2: 74.9 @ 22Qwen3 235B A22B: 74.9 @ 68Kimi K2.6: 74.9 @ 57GPT-5.4: 74.9 @ 84Mistral Medium 3.5: 74.8 @ 140Qwen3 30B A3B 2507: 74.7 @ 151Claude 3.7 Sonnet: 74.7 @ 101Claude Sonnet 4: 74.5 @ 101Qwen3.5 Omni Flash: 74.2 @ 235Magistral Small 1.2: 73.9 @ 106Sarvam 105B: 73.8 @ 128Qwen3 32B: 73.8 @ 328Qwen3 Coder Next: 73.7 @ 92gpt-oss-20b: 73.6 @ 1000gpt-oss-20bKimi K2: 73.6 @ 26Hermes 4 - Llama-3.1 405B: 73.5 @ 34Qwen3 Omni 30B A3B: 73.4 @ 102Gemini 2.5 Flash: 73.1 @ 85GLM-4.5: 73 @ 85GLM-4.6: 72.4 @ 85Qwen3-235B-A22B-Instruct-2507: 72.2 @ 63DeepSeek V3.2 Exp: 72.2 @ 100Qwen3 30B A3B: 71.7 @ 122Gemini 2.5 Pro: 71.6 @ 85o3: 71.6 @ 50Hermes 4 - Llama-3.1 70B: 71.3 @ 60GPT-5 nano: 71.2 @ 500Kimi K2 0905: 71 @ 16Ministral 8B Instruct: 70.9 @ 0Qwen3 VL 235B A22B Instruct: 70.9 @ 51o1-mini: 70.5 @ 115GLM 4.5 Air: 70.4 @ 63Claude 3.5 Sonnet: 70.3 @ 101DeepSeek R1 Distill Llama 70B: 70.1 @ 37GLM 4.5V: 70.1 @ 85GLM 4.6V: 69.6 @ 44NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ 244Claude Opus 4: 69.4 @ 120Qwen3 30B A3B 2507 Instruct: 69.3 @ 122Qwen3 Next 80B A3B Instruct: 68.9 @ 161DeepSeek R1 Distill Qwen 32B: 68.4 @ 37NVIDIA Nemotron Nano 9B V2: 68.3 @ 129ERNIE 4.5 300B A47B: 68.2 @ 24QwQ-32B: 67.8 @ 31Gemini 1.5 Pro: 67.3 @ 85Ling-flash-2.0: 66.9 @ 91Qwen3 14B: 66.8 @ 62Qwen3 Coder 480B A35B Instruct: 66.5 @ 69Kimi K2 Instruct: 66.5 @ 45Qwen3 VL 30B A3B Instruct: 66.5 @ 123Qwen3 VL 32B Instruct: 66.5 @ 76Pixtral-12B: 66.1 @ 0o1: 65.4 @ 66o3-mini: 64.1 @ 115Llama 4 Maverick: 63.9 @ 639GPT-4.1: 63.8 @ 100Qwen2.5 Max: 63.6 @ 50LongCat Flash Lite: 63.6 @ 110DeepSeek-V2.5: 63.4 @ 100Sarvam 30B: 63.3 @ 214DeepSeek-R1-0528: 63.3 @ 45QwQ-32B-Preview: 62.6 @ 99Grok-2: 62.4 @ 85Mistral Small 3 24B Instruct: 62.1 @ 134Gemini 1.5 Flash: 61.9 @ 150Nova Pro: 61.6 @ 100Llama 3.1 405B Instruct: 60.9 @ 100Gemini 2.0 Flash: 60.3 @ 183GPT-4 Turbo: 59.8 @ 100GPT-4.5: 59.4 @ 50Qwen2.5 72B Instruct: 59.1 @ 100Llama 4 Scout: 58.9 @ 776Llama 4 ScoutMistral Medium 3: 58.6 @ 32Claude 3 Opus: 58.5 @ 120Gemma 3 27B: 58.4 @ 33Mistral Large 3: 58.3 @ 54GPT-4: 58.3 @ 104GPT-4.1 Mini: 58.2 @ 150GLM 4.7 Flash: 58.1 @ 113DeepSeek-V3: 58.1 @ 100Qwen3 8B: 57.8 @ 69Nova Lite: 57.7 @ 100Qwen3 Omni 30B A3B Instruct: 57.3 @ 103o1-preview: 57.3 @ 66Gemini 2.5 Flash Lite: 57 @ 6Qwen3 4B: 56.5 @ 103Sarvam M: 56.4 @ 136GPT-4o: 56.4 @ 132Reka Flash 3: 56.2 @ 93Llama 3.1 70B Instruct: 56 @ 1204Llama 3.1 70B InstructCommand A: 55.9 @ 203Gemma 3 12B: 55.5 @ 33Qwen3 Coder 30B A3B Instruct: 55.4 @ 97Claude 3.5 Haiku: 54.5 @ 104Gemini 2.0 Flash Lite: 54.4 @ 85Devstral 2: 54.3 @ 51Llama 3.2 90B Instruct: 54 @ 100Nova Premier: 53.1 @ 40Pixtral Large: 53.1 @ 0Reka Flash: 52.9 @ 85Mistral Medium 3.1: 51.5 @ 47Mistral Small 3.2: 51 @ 100Mistral Small 3.1 24B Base: 50.9 @ 137Llama 3.3 70B Instruct: 50.6 @ 2220Llama 3.3 70B InstructQwen2.5 Turbo: 50.3 @ 67Qwen2.5 Coder 32B Instruct: 50.1 @ 110Qwen3 VL 8B: 49.7 @ 120GPT-4o-mini: 49.1 @ 92Nova Micro: 49 @ 100Gemini 1.5 Flash 8B: 48.4 @ 150Ministral 3 14B: 47.9 @ 67Mistral Large 2: 47.9 @ 42Devstral Medium: 47.8 @ 72Phi 4: 47.6 @ 33Devstral Small 2: 47.5 @ 62LFM2-24B-A2B: 47.4 @ 208Qwen3 1.7B: 46.9 @ 138Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ 301Qwen2.5 7B Instruct: 46.6 @ 138Phi-4-multimodal-instruct: 46.2 @ 25Phi-3.5-mini-instruct: 46 @ 23Claude 3 Sonnet: 45.8 @ 120Gemma 3 4B: 45.8 @ 33Qwen3.5 2B: 45.6 @ 328Devstral Small: 45.3 @ 190Llama 3.1 8B Instruct: 45 @ 2047Llama 3.1 8B InstructLlama 3.1 Nemotron 70B Instruct: 43.7 @ 292Mistral Small 3: 43.6 @ 136Ministral 3 8B: 43.3 @ 86Granite 4.1 8B: 43.3 @ 133Qwen3 VL 8B Instruct: 43 @ 145Jamba 1.5 Large: 42.8 @ 100Jamba 1.6 Large: 42.6 @ 52Hermes 3 - Llama-3.1 70B: 42.5 @ 32Mistral Small 3.1: 42.4 @ 134GPT-4.1 Nano: 42.1 @ 200Llama 3 70B Instruct: 40.8 @ 45Mistral Small: 40.3 @ 134Claude 3 Haiku: 38.9 @ 104Llama 3.2 11B Instruct: 36.7 @ 168Jamba Large 1.7: 36.5 @ 48Granite 4.0 H Small: 35.7 @ 524GPT-3.5 Turbo: 35.2 @ 100Gemma 3n E4B Instruct: 34.7 @ 56Gemini 1.0 Pro: 34 @ 120Ministral 3 3B: 33.7 @ 154Mistral Medium: 33.6 @ 45Solar Mini: 33.1 @ 63Granite 3.3 8B: 32.5 @ 376Llama 3 8B Instruct: 32.4 @ 81Llama 3.2 3B Instruct: 30.8 @ 172Tiny Aya Global: 30.5 @ 126Jamba 1.5 Mini: 29.6 @ 100Qwen3 0.6B: 29.3 @ 225Command R+: 28.9 @ 100Jamba 1.6 Mini: 24.9 @ 183Gemma 3n E4B Instructed: 24.8 @ 42Qwen3.5 0.8B: 23.6 @ 120Mistral 7B Instruct: 14.7 @ 90Llama 3.2 1B Instruct: 12.1 @ 91Llama 2 Chat 7B: 11.3 @ 113

Intelligence over time

Every scored model by release date; the line traces the rising state of the art (intelligence index).

10075502502023202420252026Claude Instant: 28.4 (2023-03-14)Claude 2: 33.4 (2023-07-11)Llama 2 Chat 13B: 28.9 (2023-07-18)Llama 2 Chat 70B: 28.9 (2023-07-18)Llama 2 Chat 7B: 11.3 (2023-07-18)Mistral 7B Instruct: 14.7 (2023-09-27)Claude 2.1: 34.6 (2023-11-21)Mistral Medium: 33.6 (2023-12-11)Mixtral 8x7B Instruct: 26.1 (2023-12-11)OpenChat 3.5: 24.1 (2023-12-18)Solar Mini: 33.1 (2024-01-25)Gemini 1.0 Pro: 34 (2024-02-15)Mistral Small: 40.3 (2024-02-26)Claude 3 Sonnet: 45.8 (2024-02-29)Claude 3 Opus: 58.5 (2024-03-04)Claude 3 Haiku: 38.9 (2024-03-13)Gemini 1.5 Flash 8B: 48.4 (2024-03-15)DBRX Instruct: 27.5 (2024-03-27)Grok-1.5: 50.3 (2024-03-28)Command R+: 28.9 (2024-04-04)Mixtral 8x22B Instruct: 39.1 (2024-04-17)Llama 3 70B Instruct: 40.8 (2024-04-18)Llama 3 8B Instruct: 32.4 (2024-04-18)Phi-3 Mini Instruct 3.8B: 27.5 (2024-04-23)Qwen1.5 Chat 110B: 28.9 (2024-04-25)Gemini 1.5 Flash: 61.9 (2024-05-01)DeepSeek-V2.5: 63.4 (2024-05-08)GPT-4o: 56.4 (2024-05-13)DeepSeek Coder V2 Lite Instruct: 30.2 (2024-06-17)Claude 3.5 Sonnet: 70.3 (2024-06-20)GPT-4o-mini: 49.1 (2024-07-18)Llama 3.1 405B Instruct: 60.9 (2024-07-23)Qwen2 72B Instruct: 59.9 (2024-07-23)Llama 3.1 70B Instruct: 56 (2024-07-23)Llama 3.1 8B Instruct: 45 (2024-07-23)Qwen2 7B Instruct: 37.4 (2024-07-23)Mistral Large 2: 47.9 (2024-07-24)Grok-2 mini: 65.9 (2024-08-13)Grok-2: 62.4 (2024-08-13)Grok: 53.8 (2024-08-13)Hermes 3 - Llama-3.1 70B: 42.5 (2024-08-15)Jamba 1.5 Large: 42.8 (2024-08-22)Jamba 1.5 Mini: 29.6 (2024-08-22)Phi-3.5-vision-instruct: 61.7 (2024-08-23)Phi-3.5-MoE-instruct: 49.8 (2024-08-23)Phi-3.5-mini-instruct: 46 (2024-08-23)Qwen2-VL-72B-Instruct: 67.3 (2024-08-29)o1-mini: 70.5 (2024-09-12)o1-preview: 57.3 (2024-09-12)Pixtral-12B: 66.1 (2024-09-17)Qwen2.5 32B Instruct: 66.7 (2024-09-19)Qwen2.5 14B Instruct: 66.1 (2024-09-19)Qwen2.5 72B Instruct: 59.1 (2024-09-19)Qwen2.5-Coder 7B Instruct: 39.6 (2024-09-19)Llama 3.2 90B Instruct: 54 (2024-09-25)Llama 3.2 11B Instruct: 36.7 (2024-09-25)Llama 3.2 3B Instruct: 30.8 (2024-09-25)Molmo 7B-D: 16.3 (2024-09-25)Llama 3.2 1B Instruct: 12.1 (2024-09-25)LFM 40B: 33.2 (2024-09-30)Llama 3.1 Nemotron 70B Instruct: 43.7 (2024-10-01)Reka Flash: 52.9 (2024-10-04)Ministral 8B Instruct: 70.9 (2024-10-16)Qwen2.5 7B Instruct: 46.6 (2024-10-16)Claude 3.5 Haiku: 54.5 (2024-11-04)Qwen2.5 Coder 32B Instruct: 50.1 (2024-11-11)Qwen2.5 Turbo: 50.3 (2024-11-18)Pixtral Large: 53.1 (2024-11-19)Mistral Large: 39.3 (2024-11-19)Nova Pro: 61.6 (2024-11-20)Nova Lite: 57.7 (2024-11-20)Nova Micro: 49 (2024-11-20)OLMo 2 7B: 15.5 (2024-11-26)QwQ-32B-Preview: 62.6 (2024-11-28)o1: 65.4 (2024-12-05)Llama 3.3 70B Instruct: 50.6 (2024-12-06)Gemini 2.0 Flash: 60.3 (2024-12-11)DeepSeek VL2 Small: 73.1 (2024-12-13)DeepSeek VL2 Tiny: 67.2 (2024-12-13)QvQ-72B-Preview: 70.9 (2024-12-25)DeepSeek-V3: 58.1 (2024-12-26)Phi 4: 47.6 (2025-01-10)DeepSeek-R1: 75 (2025-01-20)DeepSeek R1 Zero: 71.5 (2025-01-20)DeepSeek R1 Distill Llama 70B: 70.1 (2025-01-20)DeepSeek R1 Distill Qwen 32B: 68.4 (2025-01-20)DeepSeek R1 Distill Qwen 14B: 65.7 (2025-01-20)DeepSeek R1 Distill Qwen 7B: 58.3 (2025-01-20)DeepSeek R1 Distill Llama 8B: 53.3 (2025-01-20)DeepSeek R1 Distill Qwen 1.5B: 32.6 (2025-01-20)Gemini 2.0 Flash Thinking: 69.1 (2025-01-21)Qwen2.5 VL 7B Instruct: 70 (2025-01-26)Sonar: 56.8 (2025-01-27)Sonar Reasoning: 77.2 (2025-01-28)Qwen2.5 Max: 63.6 (2025-01-28)Mistral Small 3 24B Instruct: 62.1 (2025-01-30)Llama 3.1 Tulu3 405B: 57.5 (2025-01-30)Mistral Small 3 24B Base: 44.4 (2025-01-30)Mistral Small 3: 43.6 (2025-01-30)o3-mini: 64.1 (2025-01-31)Qwen2.5 VL 72B Instruct: 79.1 (2025-02-01)Phi-4-multimodal-instruct: 46.2 (2025-02-01)Phi 4 Mini: 45.3 (2025-02-01)Gemini 2.0 Pro: 67.4 (2025-02-05)DeepHermes 3 - Llama-3.1 8B: 23.5 (2025-02-13)Grok-3: 82.6 (2025-02-17)Mistral Saba: 57.1 (2025-02-17)Grok 3 mini Reasoning: 80.9 (2025-02-19)Claude 3.7 Sonnet: 74.7 (2025-02-24)Gemini 2.0 Flash Lite: 54.4 (2025-02-25)GPT-4.5: 59.4 (2025-02-27)Qwen2.5 VL 32B Instruct: 62.1 (2025-02-28)QwQ-32B: 67.8 (2025-03-05)Jamba 1.6 Large: 42.6 (2025-03-06)Jamba 1.6 Mini: 24.9 (2025-03-06)Sonar Pro: 58.8 (2025-03-07)Gemma 3 27B: 58.4 (2025-03-12)Reka Flash 3: 56.2 (2025-03-12)Gemma 3 27B Instruct: 44.5 (2025-03-12)Gemma 3 12B Instruct: 40 (2025-03-12)Gemma 3 4B Instruct: 31.7 (2025-03-12)Gemma 3 1B: 21.2 (2025-03-12)Command A: 55.9 (2025-03-13)Gemma 3 12B: 55.5 (2025-03-13)Gemma 3 4B: 45.8 (2025-03-13)DeepHermes 3 - Mistral 24B: 43.8 (2025-03-13)OLMo 2 32B: 23.5 (2025-03-13)Gemma 3 1B Instruct: 16.2 (2025-03-13)Mistral Small 3.1 24B Base: 50.9 (2025-03-17)Mistral Small 3.1 24B Instruct: 48 (2025-03-17)Mistral Small 3.1: 42.4 (2025-03-17)Llama 3.1 Nemotron Nano 8B V1: 68.2 (2025-03-18)Llama-3.3 Nemotron Super 49B v1: 63.9 (2025-03-18)o1-pro: 82.5 (2025-03-19)Gemini 2.5 Pro: 71.6 (2025-03-25)DeepSeek-V3 0324: 65.9 (2025-03-25)Qwen2.5-Omni-7B: 51.5 (2025-03-27)Llama 4 Maverick: 63.9 (2025-04-05)Llama 4 Scout: 58.9 (2025-04-05)Llama 3.1 Nemotron Ultra 253B v1: 78.3 (2025-04-07)GPT-4.1: 63.8 (2025-04-14)GPT-4.1 Mini: 58.2 (2025-04-14)GPT-4.1 Nano: 42.1 (2025-04-14)o4-mini: 78.4 (2025-04-16)o3: 71.6 (2025-04-16)Granite 3.3 8B Instruct: 68.5 (2025-04-16)Granite 3.3 8B Base: 64.6 (2025-04-16)Granite 3.3 8B: 32.5 (2025-04-16)Gemini 2.5 Flash: 73.1 (2025-04-17)Qwen3: 75.8 (2025-04-28)Qwen3 235B A22B: 74.9 (2025-04-28)Qwen3 32B: 73.8 (2025-04-28)Qwen3 30B A3B: 71.7 (2025-04-28)Qwen3 14B: 66.8 (2025-04-28)Qwen3 8B: 57.8 (2025-04-28)Qwen3 4B: 56.5 (2025-04-28)Qwen3 1.7B: 46.9 (2025-04-28)Qwen3 0.6B: 29.3 (2025-04-28)Phi 4 Mini Reasoning: 73.3 (2025-04-30)Phi 4 Reasoning Plus: 70.4 (2025-04-30)Phi 4 Reasoning: 66.4 (2025-04-30)Nova Premier: 53.1 (2025-04-30)IBM Granite 4.0 Tiny Preview: 48 (2025-05-02)Mistral Medium 3: 58.6 (2025-05-07)Solar Pro 2: 72.5 (2025-05-20)Llama 3.1 Nemotron Nano 4B v1.1: 54.5 (2025-05-20)Gemma 3n E4B Instruct: 34.7 (2025-05-20)Gemma 3n E4B Instructed LiteRT Preview: 30.3 (2025-05-20)Gemini Diffusion: 30.2 (2025-05-20)Gemma 3n E2B Instructed LiteRT (Preview): 25.4 (2025-05-20)Devstral Small: 45.3 (2025-05-21)Claude Sonnet 4: 74.5 (2025-05-22)Claude Opus 4: 69.4 (2025-05-22)Sarvam M: 56.4 (2025-05-23)DeepSeek-R1-0528: 63.3 (2025-05-28)DeepSeek R1 0528 Qwen3 8B: 66.2 (2025-05-29)Gemini 2.5 Pro Preview 06-05: 76.6 (2025-06-05)o3 Pro: 84.5 (2025-06-10)Magistral Medium 1: 65.5 (2025-06-10)Magistral Small 1: 64.7 (2025-06-10)Magistral Medium: 62.9 (2025-06-10)Magistral Small 2506: 62.1 (2025-06-10)MiniMax-M1: 61.5 (2025-06-16)MiniMax M1 80k: 75.5 (2025-06-17)MiniMax M1 40k: 67.6 (2025-06-17)Mistral Small 3.2 24B Instruct: 56.2 (2025-06-20)Mistral Small 3.2: 51 (2025-06-20)Gemma 3n E4B: 56.9 (2025-06-26)Gemma 3n E2B: 49.1 (2025-06-26)Gemma 3n E2B Instruct: 27.5 (2025-06-26)Gemma 3n E4B Instructed: 24.8 (2025-06-26)Gemma 3n E2B Instructed: 21.3 (2025-06-26)ERNIE 4.5 300B A47B: 68.2 (2025-06-30)Jamba 1.7 Mini: 22.5 (2025-07-07)Grok-4 Heavy: 89.3 (2025-07-09)Grok 4: 78.2 (2025-07-09)Devstral Medium: 47.8 (2025-07-10)LFM2 1.2B: 13.5 (2025-07-10)Kimi K2: 73.6 (2025-07-11)Kimi K2 Instruct: 66.5 (2025-07-11)Kimi K2 Base: 50.2 (2025-07-11)EXAONE 4.0 32B: 79.8 (2025-07-15)Exaone 4.0 1.2B: 53.1 (2025-07-15)Qwen3-235B-A22B-Instruct-2507: 72.2 (2025-07-22)Qwen3 Coder 480B A35B Instruct: 66.5 (2025-07-22)Gemini 2.5 Flash Lite: 57 (2025-07-22)Qwen3-Coder: 55.4 (2025-07-22)Qwen3 235B A22B 2507: 84.2 (2025-07-25)Qwen3-235B-A22B-Thinking-2507: 79.6 (2025-07-25)Llama Nemotron Super 49B v1.5: 79.4 (2025-07-25)GLM 4.5 Air: 70.4 (2025-07-25)GLM-4.5: 73 (2025-07-28)Qwen3 30B A3B 2507 Instruct: 69.3 (2025-07-29)Qwen3 30B A3B 2507: 74.7 (2025-07-30)Qwen3 Coder 30B A3B Instruct: 55.4 (2025-07-31)gpt-oss-120b: 79.6 (2025-08-05)Claude Opus 4.1: 75.4 (2025-08-05)gpt-oss-20b: 73.6 (2025-08-05)Qwen3 4B 2507: 71.9 (2025-08-06)Qwen3 4B 2507 Instruct: 52.2 (2025-08-06)GPT-5: 80.5 (2025-08-07)GPT-5 mini: 79.2 (2025-08-07)GPT-5 nano: 71.2 (2025-08-07)Jamba Large 1.7: 36.5 (2025-08-08)GLM 4.5V: 70.1 (2025-08-11)Mistral Medium 3.1: 51.5 (2025-08-13)Gemma 3 270M: 7.6 (2025-08-14)NVIDIA Nemotron Nano 9B V2: 68.3 (2025-08-18)Seed-OSS-36B-Instruct: 78.8 (2025-08-20)DeepSeek-V3.1: 59.8 (2025-08-21)Hermes 4 - Llama-3.1 405B: 73.5 (2025-08-27)Hermes 4 - Llama-3.1 70B: 71.3 (2025-08-27)Grok Code Fast 1: 65.3 (2025-08-28)Apertus 70B Instruct: 27.2 (2025-09-02)Apertus 8B Instruct: 25.6 (2025-09-02)Nemotron Nano 9B V2: 77.6 (2025-09-05)Kimi K2 0905: 71 (2025-09-05)Kimi K2-Instruct-0905: 66 (2025-09-05)Gemini 2.5 Flash-Lite: 72.3 (2025-09-08)Ling-mini-2.0: 53.9 (2025-09-09)Qwen3-Next-80B-A3B: 80.3 (2025-09-10)Qwen3 Next 80B A3B Thinking: 77.5 (2025-09-11)Qwen3 Next 80B A3B Instruct: 68.9 (2025-09-11)Magistral Small 1.2: 73.9 (2025-09-17)Ling-flash-2.0: 66.9 (2025-09-17)Magistral Medium 1.2: 78.1 (2025-09-18)Grok 4 Fast: 78.7 (2025-09-19)Ring-flash-2.0: 74.6 (2025-09-19)DeepSeek V3.1 Terminus: 83.5 (2025-09-22)Qwen3 Omni 30B A3B: 73.4 (2025-09-22)Qwen3 Omni 30B A3B Instruct: 57.3 (2025-09-22)Granite 4.0 H Small: 35.7 (2025-09-22)GPT-5 Codex: 87.1 (2025-09-23)Qwen3 Max: 79.5 (2025-09-23)Qwen3 VL 235B A22B: 78.4 (2025-09-23)Qwen3 VL 235B A22B Instruct: 70.9 (2025-09-23)LFM2 2.6B: 19.2 (2025-09-23)Gemini 2.5 Flash: 78.3 (2025-09-25)Claude Sonnet 4.5: 80.4 (2025-09-29)DeepSeek V3.2 Exp: 72.2 (2025-09-29)Apriel-v1.5-15B-Thinker: 77.2 (2025-09-30)GLM-4.6: 72.4 (2025-09-30)Qwen3 VL 30B A3B: 76.2 (2025-10-03)GPT-5 Pro: 88.4 (2025-10-06)Qwen3 VL 30B A3B Instruct: 66.5 (2025-10-06)LFM2 8B A1B: 31.3 (2025-10-07)Ling-1T: 73.3 (2025-10-08)Jamba Reasoning 3B: 30.7 (2025-10-08)Ring-1T: 77.9 (2025-10-13)Qwen3 VL 8B: 49.7 (2025-10-14)Qwen3 VL 4B: 44.3 (2025-10-14)Qwen3 VL 8B Instruct: 43 (2025-10-14)Qwen3 VL 4B Instruct: 41.6 (2025-10-14)Claude Haiku 4.5: 75.3 (2025-10-15)Phi 4 Mini Instruct: 32.6 (2025-10-17)Granite 4.0 Micro: 25.6 (2025-10-20)Qwen3 VL 32B: 78.4 (2025-10-21)Qwen3 VL 32B Instruct: 66.5 (2025-10-23)MiniMax-M2: 76 (2025-10-27)NVIDIA Nemotron Nano 12B v2 VL: 69.4 (2025-10-28)Granite 4.0 H 1B: 18 (2025-10-28)Granite 4.0 1B: 17.9 (2025-10-28)Granite 4.0 H 350M: 10.4 (2025-10-28)Granite 4.0 350M: 10.2 (2025-10-28)Kimi Linear 48B A3B Instruct: 43.5 (2025-10-30)Kimi K2 Thinking: 85.6 (2025-11-06)KAT-Coder-Pro V1: 81.8 (2025-11-11)Doubao Seed Code: 79.4 (2025-11-11)GPT-5.1: 89 (2025-11-12)GPT-5.1-Codex: 88.2 (2025-11-13)GPT-5.1-Codex-Mini: 84.7 (2025-11-13)ERNIE 5.0 Thinking: 81.7 (2025-11-13)Gemini 3 Pro: 82.8 (2025-11-18)Cogito v2.1: 75.8 (2025-11-18)Gemini 3 Deep Think: 69.5 (2025-11-18)Grok 4.1 Fast: 85.6 (2025-11-19)Olmo 3 7B Think: 62.4 (2025-11-20)Olmo 3 7B Instruct: 40 (2025-11-20)Olmo 3 32B Think: 69.5 (2025-11-21)Claude Opus 4.5: 88 (2025-11-24)Apriel-v1.6-15B-Thinker: 80.3 (2025-11-25)Nova 2.0 Omni: 78.2 (2025-11-26)INTELLECT-3: 81 (2025-11-27)Nova 2.0 Pro: 80.9 (2025-11-27)DeepSeek V3.2 Speciale: 89.9 (2025-12-01)DeepSeek-V3.2: 87.1 (2025-12-01)Nova 2 Lite: 82.1 (2025-12-02)Mistral Large 3: 58.3 (2025-12-02)Ministral 3 14B: 47.9 (2025-12-02)Ministral 3 8B: 43.3 (2025-12-02)Ministral 3 3B: 33.7 (2025-12-02)Motif-2-12.7B-Reasoning: 73.6 (2025-12-04)K2-V2: 73.6 (2025-12-05)GLM 4.6V: 69.6 (2025-12-08)Devstral 2: 54.3 (2025-12-09)Devstral Small 2: 47.5 (2025-12-09)GPT-5.2: 86.2 (2025-12-11)Mi:dm K 2.5 Pro: 74.4 (2025-12-11)Molmo2-8B: 42.5 (2025-12-11)Olmo 3.1 32B Think: 70.6 (2025-12-12)MiMo-V2-Flash: 88 (2025-12-14)NVIDIA Nemotron 3 Nano 30B A3B: 80.1 (2025-12-15)K2 Think V2: 71.3 (2025-12-15)Gemini 3 Flash: 90.2 (2025-12-17)Solar Open 100B: 65.7 (2025-12-17)GLM 4.7: 89 (2025-12-22)MiniMax M2.1: 83.6 (2025-12-23)HyperCLOVA X SEED Think: 65.5 (2025-12-26)K-EXAONE: 82.3 (2025-12-31)Falcon-H1R-7B: 72.8 (2026-01-04)LFM2.5-1.2B-Instruct: 32.6 (2026-01-05)LFM2.5-VL-1.6B: 28.9 (2026-01-05)Olmo 3.1 32B Instruct: 53.9 (2026-01-13)GPT-5.2-Codex: 89.9 (2026-01-14)GLM 4.7 Flash: 58.1 (2026-01-19)Step3 VL 10B: 69 (2026-01-20)LFM2.5-1.2B-Thinking: 33.9 (2026-01-20)Kimi K2.5: 87.9 (2026-01-27)Solar Pro 3: 72.4 (2026-01-27)LongCat Flash Lite: 63.6 (2026-01-28)Step 3.5 Flash: 83.1 (2026-01-29)Qwen3 Coder Next: 73.7 (2026-02-04)Claude Opus 4.6: 79.4 (2026-02-05)Qwen3 Max Thinking: 76.1 (2026-02-09)Tri-21B-Think: 60.1 (2026-02-10)Nanbeige4.1-3B: 84.9 (2026-02-11)GLM-5: 81.9 (2026-02-11)MiniMax M2.5: 84.8 (2026-02-12)Qwen3.5 397B A17B: 89.3 (2026-02-16)Claude Sonnet 4.6: 76.3 (2026-02-17)Tiny Aya Global: 30.5 (2026-02-17)Gemini 3.1 Pro: 83.2 (2026-02-19)GPT-5.3-Codex: 91.5 (2026-02-24)Qwen3.5-27B: 85.8 (2026-02-25)Qwen3.5-122B-A10B: 85.7 (2026-02-25)Qwen3.5-35B-A3B: 84.5 (2026-02-25)LFM2-24B-A2B: 47.4 (2026-02-25)Qwen3.5 4B: 77.1 (2026-03-02)Qwen3.5 2B: 45.6 (2026-03-02)Qwen3.5 0.8B: 23.6 (2026-03-02)Mercury 2: 77 (2026-03-04)GPT-5.4: 74.9 (2026-03-05)Sarvam 105B: 73.8 (2026-03-06)Sarvam 30B: 63.3 (2026-03-06)Grok 4.20 0309: 88.5 (2026-03-10)Qwen3.5-9B: 80.6 (2026-03-10)NVIDIA Nemotron 3 Super 120B A12B: 80 (2026-03-11)GLM 5 Turbo: 84.7 (2026-03-15)Mistral Small 4: 76.9 (2026-03-16)NVIDIA Nemotron 3 Nano 4B: 51.3 (2026-03-16)GPT-5.4 mini: 87.5 (2026-03-17)GPT-5.4 nano: 81.7 (2026-03-17)MiniMax M2.7: 87.4 (2026-03-18)MiMo-V2-Pro: 87 (2026-03-18)MiMo-V2-Omni: 82.8 (2026-03-18)Nemotron Cascade 2 30B A3B: 75.8 (2026-03-19)MiMo-V2-Omni-0327: 85.5 (2026-03-27)KAT-Coder-Pro V2: 85.5 (2026-03-27)Qwen3.5 Omni Plus: 82.6 (2026-03-30)Qwen3.5 Omni Flash: 74.2 (2026-03-30)GLM 5V Turbo: 80.9 (2026-04-01)Trinity Large Thinking: 75.2 (2026-04-01)Qwen3.6 Plus: 88.2 (2026-04-02)Gemma 4 31B: 85.7 (2026-04-02)Step 3.5 Flash 2603: 82.6 (2026-04-02)Gemma 4 E2B: 43.3 (2026-04-02)Gemma 4 26B A4B: 79.2 (2026-04-03)Gemma 4 E4B: 57.6 (2026-04-03)Grok 4.20 0309 v2: 91.1 (2026-04-07)GLM 5.1: 86.8 (2026-04-07)Muse Spark: 88.4 (2026-04-08)EXAONE 4.5 33B: 79.4 (2026-04-09)JT-MINI: 67.6 (2026-04-15)Claude Opus 4.7: 90.9 (2026-04-16)Kimi K2.6: 74.9 (2026-04-20)Ling-2.6-flash: 59.3 (2026-04-21)Hy3: 86.7 (2026-04-22)MiMo-V2.5-Pro: 86.6 (2026-04-22)MiMo-V2.5: 84.9 (2026-04-22)GPT-5.5: 76.1 (2026-04-23)Ling-2.6-1T: 75.2 (2026-04-23)DeepSeek-V4-Flash: 89.4 (2026-04-24)DeepSeek-V4-Pro: 88.2 (2026-04-24)Qwen3.6 Max: 88.8 (2026-04-27)Qwen3.6 27B: 84.2 (2026-04-27)Qwen3.6 35B A3B: 84.1 (2026-04-27)Granite 4.1 30B: 48.1 (2026-04-29)Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 (2026-04-29)Granite 4.1 3B: 31.4 (2026-04-29)Mistral Medium 3.5: 74.8 (2026-04-30)Granite 4.1 8B: 43.3 (2026-04-30)Grok 4.3: 90.1 (2026-05-06)Gemini 3.1 Flash Lite: 82.2 (2026-05-07)Ring-2.6-1T: 85.7 (2026-05-08)MiniCPM-V 4.6 1.3B: 30.5 (2026-05-11)JT-35B-Flash: 82.9 (2026-05-14)Gemini 3.5 Flash: 92.2 (2026-05-19)Qwen3.7 Max: 92.3 (2026-05-21)MiniCPM5-1B: 26.9 (2026-05-25)GPT-3.5 Turbo: 35.2 (2023-03-01)GPT-3.5 TurboGPT-4: 58.3 (2023-03-14)GPT-4GPT-4 Turbo: 59.8 (2023-11-06)GPT-4 TurboGemini 1.5 Pro: 67.3 (2024-02-15)Gemini 1.5 ProGrok-1.5V: 71.3 (2024-04-12)Grok-1.5VDeepSeek-Coder-V2: 74.3 (2024-06-17)DeepSeek-Coder-V2DeepSeek VL2: 74.9 (2024-12-13)DeepSeek VL2Kimi-k1.5: 82.2 (2025-01-20)Kimi-k1.5Grok-3 Mini: 85.9 (2025-02-17)Grok-3 MiniR1 1776: 95.4 (2025-02-18)R1 1776Sonar Reasoning Pro: 95.7 (2025-03-07)Sonar Reasoning Pro
#ModelIndexReasonCodingMathAgentsMultiGeneralLong ctxContextSpeedIn $/M
1Sonar Reasoning Pro95.795.7128K$2.00
2R1 177695.495.4$0.00
3Qwen3.7 Max92.392.31M203$2.50
4Gemini 3.5 Flash92.292.21M221$1.50
5GPT-5.3-Codex91.591.5400K73$1.75
6Grok 4.20 0309 v291.191.1105$2.00
7Claude Opus 4.790.994.287.61M49$5.00
8Gemini 3 Flash90.290.484.497891M191$0.50
9Grok 4.390.190.11M88$1.25
10DeepSeek V3.2 Speciale89.987.189.696.786.3164K$0.29
11GPT-5.2-Codex89.989.9400K106$1.75
12DeepSeek-V4-Flash89.489.41M109$0.10
13Grok-4 Heavy89.388.479.4100
14Qwen3.5 397B A17B89.389.3262K53$0.39
15GLM 4.78985.989.49585.6203K98$0.40
16GPT-5.18988.186.89487400K115$1.25
17Qwen3.6 Max88.888.8262K36$1.04
18Grok 4.20 030988.588.597$2.00
19Muse Spark88.488.4$0.00
20GPT-5 Pro88.488.4400K$15.00
21GPT-5.1-Codex88.28684.995.786400K188$1.25
22Qwen3.6 Plus88.288.21M52$0.33
23DeepSeek-V4-Pro88.290.187.187.51M30$0.44
24MiMo-V2-Flash8884.686.896.384.3262K145$0.10
25Claude Opus 4.588878491.389.5200K58$5.00
26Kimi K2.587.987.9262K35$0.40
27GPT-5.4 mini87.587.5400K162$0.75
28MiniMax M2.787.487.4205K50$0.28
29GPT-5 Codex87.183.779.398.786.5400K180$1.25
30DeepSeek-V3.287.18486.29286.2131K$0.25
31MiMo-V2-Pro87871M60$1.00
32GLM 5.186.886.8203K53$0.98
33Hy386.786.7262K100$0.07
34MiMo-V2.5-Pro86.686.61M58$1.00
35GPT-5.286.272.784.710087.4400K73$1.75
36Grok-3 Mini85.98480.493.3128K100$0.30
37Qwen3.5-27B85.885.8262K91$0.20
38Qwen3.5-122B-A10B85.785.7262K129$0.26
39Gemma 4 31B85.785.7262K36$0.12
40Ring-2.6-1T85.785.7262K120$0.08
41Grok 4.1 Fast85.685.382.289.385.4$0.00
42Kimi K2 Thinking85.684.578.394.784.8262K100$0.60
43MiMo-V2-Omni-032785.585.5110$0.40
44KAT-Coder-Pro V285.585.5256K108$0.30
45Nanbeige4.1-3B84.984.9$0.00
46MiMo-V2.584.984.91M92$0.40
47MiniMax M2.584.884.8205K87$0.15
48GPT-5.1-Codex-Mini84.781.383.691.782400K175$0.25
49GLM 5 Turbo84.784.7203K$1.20
50o3 Pro84.584.5200K25$20.00
51Qwen3.5-35B-A3B84.584.5262K121$0.14
52Qwen3 235B A22B 250784.27978.894.784.359$0.40
53Qwen3.6 27B84.284.2262K64$0.30
54Qwen3.6 35B A3B84.184.1262K169$0.15
55MiniMax M2.183.6838182.787.5205K92$0.29
56DeepSeek V3.1 Terminus83.579.279.889.785.1164K$0.27
57Gemini 3.1 Pro83.285.780.61M142$2.00
58Step 3.5 Flash83.183.1262K194$0.09
59JT-35B-Flash82.982.9$0.00
60MiMo-V2-Omni82.882.8262K108$0.40
61Gemini 3 Pro82.861.58495.789.81M141$2.00
62Qwen3.5 Omni Plus82.682.654$0.40
63Step 3.5 Flash 260382.682.6197$0.00
64Grok-382.684.679.491.27880128K100$3.00
65o1-pro82.57986200K$150.00
66K-EXAONE82.378.376.890.383.8$0.00
67Kimi-k1.582.286.972.587.2
68Gemini 3.1 Flash Lite82.282.21M342$0.25
69Nova 2 Lite82.181.171.194.381.81M229$0.30
70GLM-581.98677.8203K67$0.60
71KAT-Coder-Pro V181.876.474.794.781.3108$0.30
72ERNIE 5.0 Thinking81.777.781.28583$0.00
73GPT-5.4 nano81.781.7400K157$0.20
74INTELLECT-38176.177.78882.2131K$0.20
75Nova 2.0 Pro80.978.5738983149$1.30
76Grok 3 mini Reasoning80.979.169.69282.833$0.30
77GLM 5V Turbo80.980.9203K$1.20
78Qwen3.5-9B80.680.6262K51$0.04
79GPT-580.587.382.578.466.281.387.1400K100$1.25
80Claude Sonnet 4.580.483.466.28778.187.51M42$3.00
81Apriel-v1.6-15B-Thinker80.373.380.78879$0.00
82Qwen3-Next-80B-A3B80.375.978.484.382.4262K147$0.50
83NVIDIA Nemotron 3 Nano 30B A3B80.175.774.19179.4148$0.10
84NVIDIA Nemotron 3 Super 120B A12B8080211$0.30
85EXAONE 4.0 32B79.873.974.788.981.8$0.00
86Qwen3-235B-A22B-Thinking-250779.681.192.360.984.3256K$0.30
87gpt-oss-120b79.680.975.193.467.880.8131K500$0.04
88Qwen3 Max79.576.476.780.784.1262K45$0.78
89Doubao Seed Code79.476.476.679.385.4$0.00
90EXAONE 4.5 33B79.479.4$0.00
91Llama Nemotron Super 49B v1.579.474.873.787.581.451$0.10
92Claude Opus 4.679.480.180.877.31M48$5.00
93Gemma 4 26B A4B79.279.2262K66$0.06
94GPT-5 mini79.282.383.86783.7400K200$0.25
95Qwen2.5 VL 72B Instruct79.179.1131K$0.25
96Seed-OSS-36B-Instruct78.872.676.584.781.537$0.20
97Grok 4 Fast78.785.78092.744.9902M90$0.20
98Qwen3 VL 235B A22B78.477.264.688.383.634$0.80
99Qwen3 VL 32B78.473.373.884.781.893$0.70
100o4-mini78.481.470.39557.582.983.2200K115$1.10
101Gemini 2.5 Flash78.379.371.378.384.2$0.00
102Llama 3.1 Nemotron Ultra 253B v178.37666.384.88642$0.60
103Nova 2.0 Omni78.2766689.780.9$0.30
104Grok 478.251.77995.486.6256K100$3.00
105Magistral Medium 1.278.173.9758281.542$2.00
106Ring-1T77.977.464.389.380.6$0.00
107Nemotron Nano 9B V277.66471.184.990.3131K$0.04
108Qwen3 Next 80B A3B Thinking77.577.287.861.783.1262K$0.10
109Apriel-v1.5-15B-Thinker77.271.372.887.577.3$0.00
110Sonar Reasoning77.262.392.1$0.00
111Qwen3.5 4B77.177.1164$0.00
112Mercury 27777128K790$0.25
113Mistral Small 476.976.9262K145$0.15
114Gemini 2.5 Pro Preview 06-0576.686.472.88882541M85$1.25
115Claude Sonnet 4.676.372.979.61M75$3.00
116Qwen3 VL 30B A3B76.27269.782.380.7122$0.20
117Qwen3 Max Thinking76.186.153.582.382.4262K45$0.78
118GPT-5.576.193.558.61.1M67$5.00
119MiniMax-M27677.766.178.382205K91$0.26
120Cogito v2.175.876.868.872.784.956$1.30
121Nemotron Cascade 2 30B A3B75.875.8$0.00
122Qwen375.865.881.580128K
123MiniMax M1 80k75.569.771.179.581.6$0.60
124Claude Opus 4.175.480.961.17869.288200K120$15.00
125Claude Haiku 4.575.37353.896.373.480200K100$1.00
126Trinity Large Thinking75.275.2262K129$0.22
127Ling-2.6-1T75.275.2262K$0.08
128DeepSeek-R17571.561.782.384.4128K189$0.55
129DeepSeek VL274.974.9129K22$9.50
130Qwen3 235B A22B74.968.268.386.770.880.3131K68$0.46
131Kimi K2.674.991.158.6262K57$0.73
132GPT-5.474.99257.71.1M84$2.50
133Mistral Medium 3.574.874.8262K140$1.50
134Qwen3 30B A3B 250774.770.770.776.980.5151$0.30
135Claude 3.7 Sonnet74.784.850.979.169.87588.5200K101$3.00
136Ring-flash-2.074.672.562.883.779.3$0.10
137Claude Sonnet 474.575.457.984.870.374.484.21M101$3.00
138Mi:dm K 2.5 Pro74.472.265.678.781.3$0.00
139DeepSeek-Coder-V274.374.3$0.00
140Qwen3.5 Omni Flash74.274.2235$0.10
141Magistral Small 1.273.966.372.380.376.8106$0.50
142Sarvam 105B73.873.8128$0.00
143Qwen3 32B73.866.865.783.570.382.8131K328$0.08
144Qwen3 Coder Next73.773.7262K92$0.11
145K2-V273.668.169.478.378.6$0.00
146Motif-2-12.7B-Reasoning73.669.565.180.379.6$0.00
147gpt-oss-20b73.671.577.789.354.874.8131K1000$0.03
148Kimi K273.676.660.774.682.4131K26$0.57
149Hermes 4 - Llama-3.1 405B73.572.768.669.782.934$1.00
150Qwen3 Omni 30B A3B73.472.667.97479.2102$0.30
151Ling-1T73.371.967.771.382.2$0.00
152Phi 4 Mini Reasoning73.35294.6
153DeepSeek VL2 Small73.173.1
154Gemini 2.5 Flash73.182.862.18679.755.11M85$0.30
155GLM-4.57379.158.287.655.584.6131K85$0.60
156Falcon-H1R-7B72.866.172.48072.5$0.00
157Solar Pro 272.568.761.67980.5$0.00
158Solar Pro 372.472.4128K$0.15
159GLM-4.672.48159.393.945.182.9203K85$0.43
160Gemini 2.5 Flash-Lite72.370.968.868.780.8$0.10
161Qwen3-235B-A22B-Instruct-250772.277.565.984.257.775.9131K63$0.15
162DeepSeek V3.2 Exp72.279.963.586.440.191.1164K100$0.27
163Qwen3 4B 250771.966.764.182.774.3$0.00
164Qwen3 30B A3B71.765.862.682.469.178.8131K122$0.09
165Gemini 2.5 Pro71.644.573.392.279.668.41M85$1.25
166o371.647.177.173.364.98285.3200K50$2.00
167DeepSeek R1 Zero71.573.35091.3
168K2 Think V271.371.3$0.00
169Hermes 4 - Llama-3.1 70B71.369.965.368.781.160$0.10
170Grok-1.5V71.371.3
171GPT-5 nano71.271.278.956.878400K500$0.05
172Kimi K2 09057175.86164.782.5262K16$0.60
173QvQ-72B-Preview70.970.9
174Ministral 8B Instruct70.970.9128K0$0.10
175Qwen3 VL 235B A22B Instruct70.971.259.470.782.3262K51$0.20
176Olmo 3.1 32B Think70.659.169.577.376.3$0.00
177o1-mini70.56057.69074.2128K115$3.00
178Phi 4 Reasoning Plus70.468.953.179.780
179GLM 4.5 Air70.47552.889.453.381.4131K63$0.13
180Claude 3.5 Sonnet70.382.543.677.157.683.377.6200K101$3.00
181DeepSeek R1 Distill Llama 70B70.165.257.578.379.5128K37$0.10
182GLM 4.5V70.168.460.47378.866K85$0.60
183Qwen2.5 VL 7B Instruct7070
184GLM 4.6V69.671.941.185.379.9131K44$0.30
185Olmo 3 32B Think69.56167.273.775.966K$0.15
186Gemini 3 Deep Think69.569.51M$0.00
187NVIDIA Nemotron Nano 12B v2 VL69.457.269.47575.9244$0.20
188Claude Opus 469.444.158.486.970.587.3200K120$15.00
189Qwen3 30B A3B 2507 Instruct69.365.951.581.977.7122$0.20
190Gemini 2.0 Flash Thinking69.174.232.183.975.479.8$0.00
191Step3 VL 10B6969$0.00
192Qwen3 Next 80B A3B Instruct68.972.968.769.551.981.3262K161$0.09
193Granite 3.3 8B Instruct68.564.375.166.2
194DeepSeek R1 Distill Qwen 32B68.462.157.280.273.9128K37$0.12
195NVIDIA Nemotron Nano 9B V268.35772.469.774.2129$0.00
196Llama 3.1 Nemotron Nano 8B V168.254.171.379.3
197ERNIE 4.5 300B A47B68.281.146.767.277.6131K24$0.28
198QwQ-32B67.865.263.466.466.477.831$0.70
199MiniMax M1 40k67.668.265.755.580.8$0.00
200JT-MINI67.667.6$0.00
201Gemini 2.0 Pro67.462.234.792.380.5$0.00
202Qwen2-VL-72B-Instruct67.367.3
203Gemini 1.5 Pro67.374.431.687.66775.82M85$1.25
204DeepSeek VL2 Tiny67.267.2
205Ling-flash-2.066.965.758.965.377.791$0.10
206Qwen3 14B66.860.452.377.177.4132K62$0.10
207Qwen2.5 32B Instruct66.76750.180.569$0.00
208Qwen3 Coder 480B A35B Instruct66.561.858.566.878.869$0.30
209Kimi K2 Instruct66.575.160.463.863.669.6131K45$0.57
210Qwen3 VL 30B A3B Instruct66.569.547.672.376.4262K123$0.13
211Qwen3 VL 32B Instruct66.567.151.468.379.1262K76$0.10
212Phi 4 Reasoning66.465.853.869.177
213DeepSeek R1 0528 Qwen3 8B66.261.251.378.573.9$0.00
214Qwen2.5 14B Instruct66.161.972.863.7
215Pixtral-12B66.170.861.3128K0$0.15
216Kimi K2-Instruct-09056675.15863.863.669.6
217Grok-2 mini65.95174.872
218DeepSeek-V3 032465.968.449.264.881.2164K$0.28
219Solar Open 100B65.765.7$0.00
220DeepSeek R1 Distill Qwen 14B65.759.153.176.574$0.00
221Magistral Medium 165.567.952.76675.3$0.00
222HyperCLOVA X SEED Think65.561.562.95978.5$0.00
223o165.47854.558.960.474.766200K66$15.00
224Grok Code Fast 165.372.765.743.379.3$0.00
225Magistral Small 164.764.151.468.874.6$0.00
226Granite 3.3 8B Base64.652.675.166.2
227o3-mini64.177.262.5654570.6200K115$1.10
228Llama-3.3 Nemotron Super 49B v163.966.72877.583.4$0.00
229Llama 4 Maverick63.969.836.754.178.280.51M639$0.15
230GPT-4.163.866.351.253.758.773.579.61M100$2.00
231Qwen2.5 Max63.658.735.983.576.250$1.60
232LongCat Flash Lite63.663.6110$0.00
233DeepSeek-V2.563.484.316.876.376.28K100$0.14
234Sarvam 30B63.363.3214$0.00
235DeepSeek-R1-052863.38148.889.28.988.7131K45$0.55
236Magistral Medium62.970.848.769.3
237QwQ-32B-Preview62.665.25070.364.833K99$0.15
238Olmo 3 7B Think62.451.661.770.765.5$0.00
239Grok-262.45626.777.876.275.5128K85$2.00
240Qwen2.5 VL 32B Instruct62.14671.468.8
241Mistral Small 3 24B Instruct62.145.378.932K134$0.10
242Magistral Small 250662.168.251.366.8
243Gemini 1.5 Flash61.968.327.382.764.167.31M150$0.15
244Phi-3.5-vision-instruct61.761.7
245Nova Pro61.673.123.342.868.481.580.6300K100$0.80
246MiniMax-M161.561.51M$0.40
247Llama 3.1 405B Instruct60.967.830.536.788.580.9128K100$0.89
248Gemini 2.0 Flash60.362.135.157.470.776.41M183$0.10
249Tri-21B-Think60.160.1$0.00
250Qwen2 72B Instruct59.962.442.670.164.4$0.00
251DeepSeek-V3.159.874.955.549.93088.6164K$0.21
252GPT-4 Turbo59.86729.173.769.4128K100$10.00
253GPT-4.559.471.441.536.759.273.873.8128K50$75.00
254Ling-2.6-flash59.359.3262K$0.01
255Qwen2.5 72B Instruct59.14965.349.972.2131K100$0.36
256Llama 4 Scout58.957.232.849.280.874.310M776$0.08
257Sonar Pro58.857.827.574.575.5200K$3.00
258Mistral Medium 358.657.84060.576131K32$0.40
259Claude 3 Opus58.573.427.964.168.5200K120$15.00
260Gemma 3 27B58.46529.78356131K33$0.08
261DeepSeek R1 Distill Qwen 7B58.349.137.688.1
262Mistral Large 358.36846.53880.7262K54$0.50
263GPT-458.358.38K104$30.00
264GPT-4.1 Mini58.26534.654.345.972.976.41M150$0.40
265GLM 4.7 Flash58.158.1203K113$0.06
266DeepSeek-V358.175.452.251.862.348.7131K100$0.23
267Qwen3 8B57.858.940.657.474.3131K69$0.05
268Nova Lite57.768.216.741.866.678.574.4300K100$0.06
269Gemma 4 E4B57.657.6$0.00
270Llama 3.1 Tulu3 405B57.551.629.177.871.6$0.00
271Qwen3 Omni 30B A3B Instruct57.36242.252.372.5103$0.30
272o1-preview57.373.341.367.247.3128K66$15.00
273Mistral Saba57.142.467.761.1$0.00
274Gemini 2.5 Flash Lite5764.630.773.472.943.31M6$0.10
275Gemma 3n E4B56.956.9
276Sonar56.847.129.581.768.9127K$1.00
277Qwen3 4B56.552.246.557.869.6103$0.10
278Sarvam M56.441.629.584.769.6136$0.00
279GPT-4o56.470.131.242.75377.763.7128K132$2.50
280Reka Flash 356.252.943.561.566.966K93$0.10
281Mistral Small 3.2 24B Instruct56.246.18141.4
282Llama 3.1 70B Instruct5660.723.234.584.877131K1204$0.40
283Command A55.976.128.747.571.2256K203$2.50
284Gemma 3 12B55.563.324.682.351.9131K33$0.04
285Qwen3 Coder 30B A3B Instruct55.451.640.359.270.6160K97$0.07
286Qwen3-Coder55.455.4262K
287Llama 3.1 Nemotron Nano 4B v1.154.540.849.372.455.6$0.00
288Claude 3.5 Haiku54.562.43672.136.965200K104$0.80
289Gemini 2.0 Flash Lite54.451.518.587.36846.71M85$0.08
290Devstral 254.359.444.836.776.2262K51$0.40
291Llama 3.2 90B Instruct5446.721.462.971.867.1128K100$0.35
292Ling-mini-2.053.956.242.949.367.1$0.00
293Olmo 3.1 32B Instruct53.953.9$0.00
294Grok53.847.124.173.770.3$0.00
295DeepSeek R1 Distill Llama 8B53.34939.670.154.3$0.00
296Exaone 4.0 1.2B53.151.551.650.358.8$0.00
297Nova Premier53.156.931.750.673.340$2.50
298Pixtral Large53.150.526.136.981.770.1131K0$2.00
299Reka Flash52.952.985$0.20
300Qwen3 4B 2507 Instruct52.251.737.752.367.2$0.00
301Qwen2.5-Omni-7B51.530.865.871.238.3
302Mistral Medium 3.151.558.840.638.368.3131K47$0.40
303NVIDIA Nemotron 3 Nano 4B51.351.3$0.00
304Mistral Small 3.25150.527.557.768.1100$0.10
305Mistral Small 3.1 24B Base50.937.559.356128K137$0.10
306Llama 3.3 70B Instruct50.650.528.842.580.5131K2220$0.10
307Qwen2.5 Turbo50.34116.380.563.367$0.10
308Grok-1.550.335.96451
309Kimi K2 Base50.248.152.3
310Qwen2.5 Coder 32B Instruct50.141.731.476.750.4128K110$0.66
311Phi-3.5-MoE-instruct49.85841.6
312Qwen3 VL 8B49.757.935.330.774.9120$0.20
313Gemma 3n E2B49.149.1
314GPT-4o-mini49.1601646.858.164.8128K92$0.15
315Nova Micro4966.31438.256.270.2128K100$0.03
316Gemini 1.5 Flash 8B48.438.421.768.954.258.71M150$0.07
317Granite 4.1 30B48.148.1$0.00
318Mistral Small 3.1 24B Instruct484659.338.6
319IBM Granite 4.0 Tiny Preview485144.9
320Ministral 3 14B47.957.235.13069.3262K67$0.20
321Mistral Large 247.948.629.343.869.7128K42$2.00
322Devstral Medium47.849.233.737.770.8131K72$0.40
323Phi 447.665.823.149.551.916K33$0.07
324Devstral Small 247.553.234.834.367.862$0.00
325LFM2-24B-A2B47.447.4128K208$0.03
326Qwen3 1.7B46.935.630.864.157138$0.10
327Nemotron 3 Nano Omni 30B A3B Reasoning46.946.9301$0.10
328Qwen2.5 7B Instruct46.636.449.653.9131K138$0.04
329Phi-4-multimodal-instruct46.231.513.169.368.848.5128K25$0.05
330Phi-3.5-mini-instruct4649.742.2128K23$0.10
331Claude 3 Sonnet45.867.417.541.456.8200K120$3.00
332Gemma 3 4B45.851.512.673.145.9131K33$0.04
333Qwen3.5 2B45.645.6328$0.00
334Devstral Small45.343.425.848.963.2190$0.10
335Phi 4 Mini45.347.842.8
336Llama 3.1 8B Instruct454511.628.176.164.4131K2047$0.02
337Gemma 3 27B Instruct44.542.813.754.566.9$0.10
338Mistral Small 3 24B Base44.434.454.4
339Qwen3 VL 4B44.349.43225.770$0.00
340DeepHermes 3 - Mistral 24B43.838.219.559.558$0.00
341Llama 3.1 Nemotron 70B Instruct43.746.516.942.269292$1.20
342Mistral Small 343.646.225.237.965.233K136$0.05
343Kimi Linear 48B A3B Instruct43.541.237.836.358.5$0.00
344Gemma 4 E2B43.343.3$0.00
345Ministral 3 8B43.347.130.331.764.2262K86$0.15
346Granite 4.1 8B43.343.3131K133$0.05
347Qwen3 VL 8B Instruct4342.733.227.368.6256K145$0.08
348Jamba 1.5 Large42.836.914.360.659.5256K100$2.00
349Jamba 1.6 Large42.638.717.25856.552$2.00
350Hermes 3 - Llama-3.1 70B42.540.118.853.857.132$0.30
351Molmo2-8B42.542.5$0.00
352Mistral Small 3.142.445.421.237.265.9134$0.10
353GPT-4.1 Nano42.150.316.246.118.355.865.81M200$0.10
354Qwen3 VL 4B Instruct41.637.1293763.4$0.00
355Llama 3 70B Instruct40.837.919.848.357.48K45$0.51
356Mistral Small40.338.114.156.352.9134$0.20
357Gemma 3 12B Instruct4034.913.751.859.5$0.10
358Olmo 3 7B Instruct404026.641.352.2$0.10
359Qwen2.5-Coder 7B Instruct39.633.918.26640.1$0.00
360Mistral Large39.335.117.852.751.5128K$2.00
361Mixtral 8x22B Instruct39.133.214.854.553.766K$2.00
362Claude 3 Haiku38.961.815.439.4200K104$0.25
363Qwen2 7B Instruct37.425.342.944.1
364Llama 3.2 11B Instruct36.732.81126.766.446.4128K168$0.05
365Jamba Large 1.736.53918.131.257.7256K48$2.00
366Granite 4.0 H Small35.741.625.113.762.4524$0.10
367GPT-3.5 Turbo35.250.544.1046.216K100$0.50
368Gemma 3n E4B Instruct34.729.614.645.748.856$0.00
369Claude 2.134.631.919.537.449.5$0.00
370Gemini 1.0 Pro3427.911.640.347.343.133K120$0.50
371LFM2.5-1.2B-Thinking33.933.9$0.00
372Ministral 3 3B33.735.824.72252.4131K154$0.10
373Mistral Medium33.634.99.940.549.145$2.80
374Claude 233.434.417.148.6100K$0.00
375LFM 40B33.232.79.64842.5$0.00
376Solar Mini33.133.163$0.20
377LFM2.5-1.2B-Instruct32.632.6$0.00
378DeepSeek R1 Distill Qwen 1.5B32.633.816.952.926.9$0.00
379Phi 4 Mini Instruct32.633.112.638.246.5131K$0.08
380Granite 3.3 8B32.533.812.736.646.8376$0.00
381Llama 3 8B Instruct32.429.69.649.940.58K81$0.04
382Gemma 3 4B Instruct31.729.111.244.741.7$0.00
383Granite 4.1 3B31.431.4$0.00
384LFM2 8B A1B31.334.415.125.350.5$0.00
385Llama 3.2 3B Instruct30.832.88.326.156.1131K172$0.05
386Jamba Reasoning 3B30.733.32110.757.7$0.00
387Tiny Aya Global30.530.5126$0.00
388MiniCPM-V 4.6 1.3B30.530.5$0.00
389Gemma 3n E4B Instructed LiteRT Preview30.345.813.211.650.6
390DeepSeek Coder V2 Lite Instruct30.231.915.842.9$0.00
391Gemini Diffusion30.240.426.923.3
392Jamba 1.5 Mini29.632.36.235.744.3256K100$0.20
393Qwen3 0.6B29.323.912.146.534.7225$0.10
394Qwen1.5 Chat 110B28.928.9$0.00
395Llama 2 Chat 13B28.932.19.832.940.6$0.00
396Llama 2 Chat 70B28.932.79.832.340.6$0.00
397LFM2.5-VL-1.6B28.928.9$0.00
398Command R+28.932.312.227.943.2128K100$0.15
399Claude Instant28.43310.926.443.4$0.00
400DBRX Instruct27.533.19.327.939.7$0.00
401Phi-3 Mini Instruct 3.8B27.531.911.62343.5$0.00
402Gemma 3n E2B Instruct27.522.99.539.737.8$0.00
403Apertus 70B Instruct27.227.2$0.80
404MiniCPM5-1B26.926.9$0.00
405Mixtral 8x7B Instruct26.129.26.629.938.7$0.50
406Apertus 8B Instruct25.625.6$0.10
407Granite 4.0 Micro25.633.618644.7131K$0.02
408Gemma 3n E2B Instructed LiteRT (Preview)25.44113.26.740.5
409Jamba 1.6 Mini24.9307.125.736.7183$0.20
410Gemma 3n E4B Instructed24.823.713.211.650.632K42$20.00
411OpenChat 3.524.12311.530.731$0.00
412Qwen3.5 0.8B23.623.6120$0.00
413OLMo 2 32B23.532.86.83.351.1$0.00
414DeepHermes 3 - Llama-3.1 8B23.5278.521.836.5$0.00
415Jamba 1.7 Mini22.532.26.113.138.8$0.00
416Gemma 3n E2B Instructed21.324.813.26.740.5
417Gemma 3 1B21.229.21.932.4
418LFM2 2.6B19.230.68.18.329.8$0.00
419Granite 4.0 H 1B1826.311.56.327.7$0.00
420Granite 4.0 1B17.928.14.76.332.5$0.00
421Molmo 7B-D16.3243.9037.1$0.00
422Gemma 3 1B Instruct16.223.71.725.913.5$0.00
423OLMo 2 7B15.528.84.10.728.2$0.00
424Mistral 7B Instruct14.717.74.612.124.590$0.20
425LFM2 1.2B13.522.823.325.7$0.00
426Llama 3.2 1B Instruct12.119.61.9720131K91$0.03
427Llama 2 Chat 7B11.322.70.25.916.4113$0.10
428Granite 4.0 H 350M10.425.71.91.312.7$0.00
429Granite 4.0 350M10.226.12.4012.4$0.00
430Gemma 3 270M7.622.40.32.35.5$0.00

430 models ranked. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.