Leaderboards
Model rankings
A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →
Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error
Price vs. intelligence
Intelligence index vs. input price — up and to the left is better value.
Speed vs. intelligence
Intelligence index vs. output speed — up and to the right is fast and smart.
| # | Model | General idx ↓ | Multi-IF | LiveBench | Arena Hard | Humanity’s Last Exam | IFEval | SimpleQA | MMLU-Pro | MMLU | Context | Speed | In $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | DeepSeek V3.2 Exp | 91.1 | — | — | — | 19.8 | — | 97.1 | 85 | — | 164K | 100 | $0.27 |
| 2 | Nemotron Nano 9B V2 | 90.3 | — | — | — | — | 90.3 | — | — | — | 131K | — | $0.04 |
| 3 | Grok 4 Fast | 90 | — | — | — | 20 | — | 95 | 85 | — | 2M | 90 | $0.20 |
| 4 | Gemini 3 Pro | 89.8 | — | — | — | 37.5 | — | — | 89.8 | — | 1M | 141 | $2.00 |
| 5 | Claude Opus 4.5 | 89.5 | — | — | — | 28.4 | — | — | 89.5 | — | 200K | 58 | $5.00 |
| 6 | Gemini 3 Flash | 89 | — | — | — | 34.7 | — | — | 89 | — | 1M | 191 | $0.50 |
| 7 | DeepSeek-R1-0528 | 88.7 | — | — | — | 17.7 | — | 92.3 | 85 | — | 131K | 45 | $0.55 |
| 8 | DeepSeek-V3.1 | 88.6 | — | — | — | 15.9 | — | 93.4 | 83.7 | — | 164K | — | $0.21 |
| 9 | Claude 3.7 Sonnet | 88.5 | — | — | — | 10.3 | 93.2 | — | 83.7 | 86.1 | 200K | 101 | $3.00 |
| 10 | Claude Opus 4.1 | 88 | — | — | — | 11.9 | — | — | 88 | — | 200K | 120 | $15.00 |
| 11 | DeepSeek-V4-Pro | 87.5 | — | — | — | 35.9 | — | — | 87.5 | — | 1M | 30 | $0.44 |
| 12 | MiniMax M2.1 | 87.5 | — | — | — | 22.2 | — | — | 87.5 | — | 205K | 92 | $0.29 |
| 13 | Claude Sonnet 4.5 | 87.5 | — | — | — | 17.3 | — | — | 87.5 | — | 1M | 42 | $3.00 |
| 14 | GPT-5.2 | 87.4 | — | — | — | 35.4 | — | — | 87.4 | — | 400K | 73 | $1.75 |
| 15 | Claude Opus 4 | 87.3 | — | — | — | 11.7 | — | — | 87.3 | 88.8 | 200K | 120 | $15.00 |
| 16 | Kimi-k1.5 | 87.2 | — | — | — | — | 87.2 | — | — | 87.4 | — | — | — |
| 17 | GPT-5 | 87.1 | — | — | — | 24.8 | — | — | 87.1 | 92.5 | 400K | 100 | $1.25 |
| 18 | GPT-5.1 | 87 | — | — | — | 26.5 | — | — | 87 | — | 400K | 115 | $1.25 |
| 19 | Grok 4 | 86.6 | — | — | — | 40 | — | — | 86.6 | — | 256K | 100 | $3.00 |
| 20 | GPT-5 Codex | 86.5 | — | — | — | 25.6 | — | — | 86.5 | — | 400K | 180 | $1.25 |
| 21 | DeepSeek V3.2 Speciale | 86.3 | — | — | — | 26.1 | — | — | 86.3 | — | 164K | — | $0.29 |
| 22 | DeepSeek-V3.2 | 86.2 | — | — | — | 22.2 | — | — | 86.2 | — | 131K | — | $0.25 |
| 23 | GPT-5.1-Codex | 86 | — | — | — | 23.4 | — | — | 86 | — | 400K | 188 | $1.25 |
| 24 | Llama 3.1 Nemotron Ultra 253B v1 | 86 | — | — | — | 8.1 | 89.5 | — | 82.5 | — | — | 42 | $0.60 |
| 25 | GLM 4.7 | 85.6 | — | — | — | 25.1 | — | — | 85.6 | — | 203K | 98 | $0.40 |
| 26 | Grok 4.1 Fast | 85.4 | — | — | — | 17.6 | — | — | 85.4 | — | — | — | $0.00 |
| 27 | Doubao Seed Code | 85.4 | — | — | — | 13.3 | — | — | 85.4 | — | — | — | $0.00 |
| 28 | o3 | 85.3 | — | — | — | 24.3 | — | — | 85.3 | — | 200K | 50 | $2.00 |
| 29 | DeepSeek V3.1 Terminus | 85.1 | — | — | — | 15.2 | — | — | 85.1 | — | 164K | — | $0.27 |
| 30 | Cogito v2.1 | 84.9 | — | — | — | 11 | — | — | 84.9 | — | — | 56 | $1.30 |
| 31 | Kimi K2 Thinking | 84.8 | — | — | — | 22.3 | — | — | 84.8 | — | 262K | 100 | $0.60 |
| 32 | GLM-4.5 | 84.6 | — | — | — | 14.4 | — | — | 84.6 | — | 131K | 85 | $0.60 |
| 33 | DeepSeek-R1 | 84.4 | — | — | — | 9.3 | — | — | 84.4 | 90.8 | 128K | 189 | $0.55 |
| 34 | MiMo-V2-Flash | 84.3 | — | — | — | 21.1 | — | — | 84.3 | — | 262K | 145 | $0.10 |
| 35 | Qwen3 235B A22B 2507 | 84.3 | — | — | — | 15 | — | — | 84.3 | — | — | 59 | $0.40 |
| 36 | Qwen3-235B-A22B-Thinking-2507 | 84.3 | 80.6 | — | — | 18.2 | 87.8 | — | 84.4 | — | 256K | — | $0.30 |
| 37 | Gemini 2.5 Flash | 84.2 | — | — | — | 12.7 | — | — | 84.2 | — | — | — | $0.00 |
| 38 | Claude Sonnet 4 | 84.2 | — | — | — | 9.6 | — | — | 84.2 | 88 | 1M | 101 | $3.00 |
| 39 | Qwen3 Max | 84.1 | — | — | — | 11.1 | — | — | 84.1 | — | 262K | 45 | $0.78 |
| 40 | K-EXAONE | 83.8 | — | — | — | 13.1 | — | — | 83.8 | — | — | — | $0.00 |
| 41 | GPT-5 mini | 83.7 | — | — | — | 16.7 | — | — | 83.7 | — | 400K | 200 | $0.25 |
| 42 | Qwen3 VL 235B A22B | 83.6 | — | — | — | 10.1 | — | — | 83.6 | — | — | 34 | $0.80 |
| 43 | Llama-3.3 Nemotron Super 49B v1 | 83.4 | — | — | 88.3 | 6.5 | — | — | 78.5 | — | — | — | $0.00 |
| 44 | o4-mini | 83.2 | — | — | — | 14.7 | — | — | 83.2 | — | 200K | 115 | $1.10 |
| 45 | Qwen3 Next 80B A3B Thinking | 83.1 | 77.8 | — | — | — | 88.9 | — | 82.7 | — | 262K | — | $0.10 |
| 46 | ERNIE 5.0 Thinking | 83 | — | — | — | 12.7 | — | — | 83 | — | — | — | $0.00 |
| 47 | Nova 2.0 Pro | 83 | — | — | — | 8.9 | — | — | 83 | — | — | 149 | $1.30 |
| 48 | Hermes 4 - Llama-3.1 405B | 82.9 | — | — | — | 10.3 | — | — | 82.9 | — | — | 34 | $1.00 |
| 49 | GLM-4.6 | 82.9 | — | — | — | 17.2 | — | — | 82.9 | — | 203K | 85 | $0.43 |
| 50 | Grok 3 mini Reasoning | 82.8 | — | — | — | 11.1 | — | — | 82.8 | — | — | 33 | $0.30 |
| 51 | Qwen3 32B | 82.8 | — | 74.9 | 93.8 | 8.3 | — | — | 79.8 | — | 131K | 328 | $0.08 |
| 52 | Kimi K2 0905 | 82.5 | — | — | — | 6.3 | — | — | 82.5 | 90.2 | 262K | 16 | $0.60 |
| 53 | Qwen3-Next-80B-A3B | 82.4 | — | — | — | 11.7 | — | — | 82.4 | — | 262K | 147 | $0.50 |
| 54 | Qwen3 Max Thinking | 82.4 | — | — | — | 26.2 | — | — | 82.4 | — | 262K | 45 | $0.78 |
| 55 | Kimi K2 | 82.4 | — | — | — | 7 | — | — | 82.4 | 89.5 | 131K | 26 | $0.57 |
| 56 | Qwen3 VL 235B A22B Instruct | 82.3 | — | — | — | 6.3 | — | — | 82.3 | — | 262K | 51 | $0.20 |
| 57 | INTELLECT-3 | 82.2 | — | — | — | 12.1 | — | — | 82.2 | — | 131K | — | $0.20 |
| 58 | Ling-1T | 82.2 | — | — | — | 7.2 | — | — | 82.2 | — | — | — | $0.00 |
| 59 | GPT-5.1-Codex-Mini | 82 | — | — | — | 16.9 | — | — | 82 | — | 400K | 175 | $0.25 |
| 60 | MiniMax-M2 | 82 | — | — | — | 12.5 | — | — | 82 | — | 205K | 91 | $0.26 |
| 61 | Nova 2 Lite | 81.8 | — | — | — | 10.9 | — | — | 81.8 | — | 1M | 229 | $0.30 |
| 62 | EXAONE 4.0 32B | 81.8 | — | — | — | 10.5 | — | — | 81.8 | — | — | — | $0.00 |
| 63 | Qwen3 VL 32B | 81.8 | — | — | — | 9.6 | — | — | 81.8 | — | — | 93 | $0.70 |
| 64 | MiniMax M1 80k | 81.6 | — | — | — | 8.2 | — | — | 81.6 | — | — | — | $0.60 |
| 65 | Seed-OSS-36B-Instruct | 81.5 | — | — | — | 9.1 | — | — | 81.5 | — | — | 37 | $0.20 |
| 66 | Magistral Medium 1.2 | 81.5 | — | — | — | 9.6 | — | — | 81.5 | — | — | 42 | $2.00 |
| 67 | Llama Nemotron Super 49B v1.5 | 81.4 | — | — | — | 6.8 | — | — | 81.4 | — | — | 51 | $0.10 |
| 68 | GLM 4.5 Air | 81.4 | — | — | — | 10.6 | — | — | 81.4 | — | 131K | 63 | $0.13 |
| 69 | KAT-Coder-Pro V1 | 81.3 | — | — | — | 33.4 | — | — | 81.3 | — | — | 108 | $0.30 |
| 70 | Mi:dm K 2.5 Pro | 81.3 | — | — | — | 8.8 | — | — | 81.3 | — | — | — | $0.00 |
| 71 | Qwen3 Next 80B A3B Instruct | 81.3 | 75.8 | — | — | 7.3 | 87.6 | — | 80.6 | — | 262K | 161 | $0.09 |
| 72 | DeepSeek-V3 0324 | 81.2 | — | — | — | 5.2 | — | — | 81.2 | — | 164K | — | $0.28 |
| 73 | Hermes 4 - Llama-3.1 70B | 81.1 | — | — | — | 7.9 | — | — | 81.1 | — | — | 60 | $0.10 |
| 74 | Nova 2.0 Omni | 80.9 | — | — | — | 6.8 | — | — | 80.9 | — | — | — | $0.30 |
| 75 | Llama 3.1 405B Instruct | 80.9 | — | — | — | 4.2 | 88.6 | — | 73.3 | 87.3 | 128K | 100 | $0.89 |
| 76 | gpt-oss-120b | 80.8 | — | — | — | 19 | — | — | 80.8 | 90 | 131K | 500 | $0.04 |
| 77 | Gemini 2.5 Flash-Lite | 80.8 | — | — | — | 6.6 | — | — | 80.8 | — | — | — | $0.10 |
| 78 | MiniMax M1 40k | 80.8 | — | — | — | 7.5 | — | — | 80.8 | — | — | — | $0.00 |
| 79 | Qwen3 VL 30B A3B | 80.7 | — | — | — | 8.7 | — | — | 80.7 | — | — | 122 | $0.20 |
| 80 | Mistral Large 3 | 80.7 | — | — | — | 4.1 | — | — | 80.7 | — | 262K | 54 | $0.50 |
| 81 | Ring-1T | 80.6 | — | — | — | 10.2 | — | — | 80.6 | — | — | — | $0.00 |
| 82 | Nova Pro | 80.6 | — | — | — | 3.4 | 92.1 | — | 69.1 | 85.9 | 300K | 100 | $0.80 |
| 83 | Qwen3 30B A3B 2507 | 80.5 | — | — | — | 9.8 | — | — | 80.5 | — | — | 151 | $0.30 |
| 84 | Solar Pro 2 | 80.5 | — | — | — | 7 | — | — | 80.5 | — | — | — | $0.00 |
| 85 | Gemini 2.0 Pro | 80.5 | — | — | — | 6.8 | — | — | 80.5 | — | — | — | $0.00 |
| 86 | Llama 4 Maverick | 80.5 | — | — | — | 4.8 | — | — | 80.5 | 85.5 | 1M | 639 | $0.15 |
| 87 | Llama 3.3 70B Instruct | 80.5 | — | — | — | 4 | 92.1 | — | 68.9 | 86 | 131K | 2220 | $0.10 |
| 88 | Qwen3 235B A22B | 80.3 | — | 77.1 | 95.6 | 11.7 | — | — | 68.2 | 87.8 | 131K | 68 | $0.46 |
| 89 | Grok-3 | 80 | — | — | — | 5.1 | — | — | 80 | — | 128K | 100 | $3.00 |
| 90 | Qwen3 | 80 | — | — | — | — | — | — | 80 | — | 128K | — | — |
| 91 | Claude Haiku 4.5 | 80 | — | — | — | 9.7 | — | — | 80 | — | 200K | 100 | $1.00 |
| 92 | Phi 4 Reasoning Plus | 80 | — | — | 79 | — | 84.9 | — | 76 | — | — | — | — |
| 93 | GLM 4.6V | 79.9 | — | — | — | 8.9 | — | — | 79.9 | — | 131K | 44 | $0.30 |
| 94 | Gemini 2.0 Flash Thinking | 79.8 | — | — | — | 7.1 | — | — | 79.8 | — | — | — | $0.00 |
| 95 | Motif-2-12.7B-Reasoning | 79.6 | — | — | — | 8.2 | — | — | 79.6 | — | — | — | $0.00 |
| 96 | GPT-4.1 | 79.6 | 70.8 | — | — | 5.4 | 87.4 | — | 80.6 | 90.2 | 1M | 100 | $2.00 |
| 97 | DeepSeek R1 Distill Llama 70B | 79.5 | — | — | — | 6.1 | — | — | 79.5 | — | 128K | 37 | $0.10 |
| 98 | NVIDIA Nemotron 3 Nano 30B A3B | 79.4 | — | — | — | 10.2 | — | — | 79.4 | — | — | 148 | $0.10 |
| 99 | Ring-flash-2.0 | 79.3 | — | — | — | 8.9 | — | — | 79.3 | — | — | — | $0.10 |
| 100 | Llama 3.1 Nemotron Nano 8B V1 | 79.3 | — | — | — | — | 79.3 | — | — | — | — | — | — |
| 101 | Grok Code Fast 1 | 79.3 | — | — | — | 7.5 | — | — | 79.3 | — | — | — | $0.00 |
| 102 | Qwen3 Omni 30B A3B | 79.2 | — | — | — | 7.3 | — | — | 79.2 | — | — | 102 | $0.30 |
| 103 | Qwen3 VL 32B Instruct | 79.1 | — | — | — | 6.3 | — | — | 79.1 | — | 262K | 76 | $0.10 |
| 104 | Apriel-v1.6-15B-Thinker | 79 | — | — | — | 9.8 | — | — | 79 | — | — | — | $0.00 |
| 105 | Mistral Small 3 24B Instruct | 78.9 | — | — | 87.6 | — | 82.9 | — | 66.3 | — | 32K | 134 | $0.10 |
| 106 | Qwen3 30B A3B | 78.8 | 72.2 | 74.3 | 91 | 6.6 | — | — | 77.7 | — | 131K | 122 | $0.09 |
| 107 | GLM 4.5V | 78.8 | — | — | — | 5.9 | — | — | 78.8 | — | 66K | 85 | $0.60 |
| 108 | Qwen3 Coder 480B A35B Instruct | 78.8 | — | — | — | 4.4 | — | — | 78.8 | — | — | 69 | $0.30 |
| 109 | K2-V2 | 78.6 | — | — | — | 9.8 | — | — | 78.6 | — | — | — | $0.00 |
| 110 | HyperCLOVA X SEED Think | 78.5 | — | — | — | 5.5 | — | — | 78.5 | — | — | — | $0.00 |
| 111 | GPT-5 nano | 78 | — | — | — | 8.7 | — | — | 78 | — | 400K | 500 | $0.05 |
| 112 | QwQ-32B | 77.8 | — | 73.1 | — | 8.2 | 83.9 | — | 76.4 | — | — | 31 | $0.70 |
| 113 | Qwen3 30B A3B 2507 Instruct | 77.7 | — | — | — | 6.8 | — | — | 77.7 | — | — | 122 | $0.20 |
| 114 | Ling-flash-2.0 | 77.7 | — | — | — | 6.3 | — | — | 77.7 | — | — | 91 | $0.10 |
| 115 | Claude 3.5 Sonnet | 77.6 | — | — | — | 3.9 | — | — | 77.6 | 90.4 | 200K | 101 | $3.00 |
| 116 | ERNIE 4.5 300B A47B | 77.6 | — | — | — | 3.5 | — | — | 77.6 | — | 131K | 24 | $0.28 |
| 117 | Qwen3 14B | 77.4 | — | — | — | 4.3 | — | — | 77.4 | — | 132K | 62 | $0.10 |
| 118 | Apriel-v1.5-15B-Thinker | 77.3 | — | — | — | 12 | — | — | 77.3 | — | — | — | $0.00 |
| 119 | Phi 4 Reasoning | 77 | — | — | 73.3 | — | 83.4 | — | 74.3 | — | — | — | — |
| 120 | Llama 3.1 70B Instruct | 77 | — | — | — | 4.6 | 87.5 | — | 66.4 | 83.6 | 131K | 1204 | $0.40 |
| 121 | Magistral Small 1.2 | 76.8 | — | — | — | 6.1 | — | — | 76.8 | — | — | 106 | $0.50 |
| 122 | Qwen3 VL 30B A3B Instruct | 76.4 | — | — | — | 6.4 | — | — | 76.4 | — | 262K | 123 | $0.13 |
| 123 | Gemini 2.0 Flash | 76.4 | — | — | — | 5.3 | — | — | 76.4 | 87 | 1M | 183 | $0.10 |
| 124 | GPT-4.1 Mini | 76.4 | 67 | — | — | 3.7 | 84.1 | — | 78.1 | 87.5 | 1M | 150 | $0.40 |
| 125 | Olmo 3.1 32B Think | 76.3 | — | — | — | 6 | — | — | 76.3 | — | — | — | $0.00 |
| 126 | Qwen2.5 Max | 76.2 | — | — | — | 4.5 | — | — | 76.2 | — | — | 50 | $1.60 |
| 127 | DeepSeek-V2.5 | 76.2 | — | — | 76.2 | — | — | — | — | 80.4 | 8K | 100 | $0.14 |
| 128 | Devstral 2 | 76.2 | — | — | — | 3.6 | — | — | 76.2 | — | 262K | 51 | $0.40 |
| 129 | Mistral Medium 3 | 76 | — | — | — | 4.3 | — | — | 76 | — | 131K | 32 | $0.40 |
| 130 | Qwen3-235B-A22B-Instruct-2507 | 75.9 | 77.5 | — | — | 10.6 | 88.7 | 54.3 | 83 | — | 131K | 63 | $0.15 |
| 131 | Olmo 3 32B Think | 75.9 | — | — | — | 5.9 | — | — | 75.9 | — | 66K | — | $0.15 |
| 132 | NVIDIA Nemotron Nano 12B v2 VL | 75.9 | — | — | — | 5.3 | — | — | 75.9 | — | — | 244 | $0.20 |
| 133 | Gemini 1.5 Pro | 75.8 | — | — | — | 4.9 | — | — | 75.8 | 85.9 | 2M | 85 | $1.25 |
| 134 | Grok-2 | 75.5 | — | — | — | 3.8 | — | — | 75.5 | 87.5 | 128K | 85 | $2.00 |
| 135 | Sonar Pro | 75.5 | — | — | — | 7.9 | — | — | 75.5 | — | 200K | — | $3.00 |
| 136 | Magistral Medium 1 | 75.3 | — | — | — | 9.5 | — | — | 75.3 | — | — | — | $0.00 |
| 137 | Qwen3 VL 8B | 74.9 | — | — | — | 3.3 | — | — | 74.9 | — | — | 120 | $0.20 |
| 138 | gpt-oss-20b | 74.8 | — | — | — | 17.3 | — | — | 74.8 | 85.3 | 131K | 1000 | $0.03 |
| 139 | Magistral Small 1 | 74.6 | — | — | — | 7.2 | — | — | 74.6 | — | — | — | $0.00 |
| 140 | Nova Lite | 74.4 | — | — | — | 4.6 | 89.7 | — | 59 | 80.5 | 300K | 100 | $0.06 |
| 141 | Qwen3 4B 2507 | 74.3 | — | — | — | 5.9 | — | — | 74.3 | — | — | — | $0.00 |
| 142 | Llama 4 Scout | 74.3 | — | — | — | 4.3 | — | — | 74.3 | 79.6 | 10M | 776 | $0.08 |
| 143 | Qwen3 8B | 74.3 | — | — | — | 4.2 | — | — | 74.3 | — | 131K | 69 | $0.05 |
| 144 | o1-mini | 74.2 | — | — | — | 4.9 | — | — | 74.2 | 85.2 | 128K | 115 | $3.00 |
| 145 | NVIDIA Nemotron Nano 9B V2 | 74.2 | — | — | — | 4.6 | — | — | 74.2 | — | — | 129 | $0.00 |
| 146 | DeepSeek R1 Distill Qwen 14B | 74 | — | — | — | 4.4 | — | — | 74 | — | — | — | $0.00 |
| 147 | DeepSeek R1 Distill Qwen 32B | 73.9 | — | — | — | 5.5 | — | — | 73.9 | — | 128K | 37 | $0.12 |
| 148 | DeepSeek R1 0528 Qwen3 8B | 73.9 | — | — | — | 5.6 | — | — | 73.9 | — | — | — | $0.00 |
| 149 | GPT-4.5 | 73.8 | 70.8 | — | — | — | 88.2 | 62.5 | — | 90.8 | 128K | 50 | $75.00 |
| 150 | Nova Premier | 73.3 | — | — | — | 4.7 | — | — | 73.3 | — | — | 40 | $2.50 |
| 151 | Falcon-H1R-7B | 72.5 | — | — | — | 10.8 | — | — | 72.5 | — | — | — | $0.00 |
| 152 | Qwen3 Omni 30B A3B Instruct | 72.5 | — | — | — | 5.1 | — | — | 72.5 | — | — | 103 | $0.30 |
| 153 | Qwen2.5 72B Instruct | 72.2 | — | 52.3 | 81.2 | 4.2 | 84.1 | — | 71.1 | — | 131K | 100 | $0.36 |
| 154 | Grok-2 mini | 72 | — | — | — | — | — | — | 72 | 86.2 | — | — | — |
| 155 | Llama 3.1 Tulu3 405B | 71.6 | — | — | — | 3.5 | — | — | 71.6 | — | — | — | $0.00 |
| 156 | Command A | 71.2 | — | — | — | 11.4 | — | — | 71.2 | — | 256K | 203 | $2.50 |
| 157 | Ministral 8B Instruct | 70.9 | — | — | 70.9 | — | — | — | — | 65 | 128K | 0 | $0.10 |
| 158 | Devstral Medium | 70.8 | — | — | — | 3.8 | — | — | 70.8 | — | 131K | 72 | $0.40 |
| 159 | o3-mini | 70.6 | 79.5 | 84.6 | — | 12.3 | 93.9 | 15 | 80.2 | 86.9 | 200K | 115 | $1.10 |
| 160 | Qwen3 Coder 30B A3B Instruct | 70.6 | — | — | — | 4 | — | — | 70.6 | — | 160K | 97 | $0.07 |
| 161 | Grok | 70.3 | — | — | — | 4.7 | — | — | 70.3 | — | — | — | $0.00 |
| 162 | Nova Micro | 70.2 | — | — | — | 4.7 | 87.2 | — | 53.1 | 77.6 | 128K | 100 | $0.03 |
| 163 | Pixtral Large | 70.1 | — | — | — | 3.6 | — | — | 70.1 | — | 131K | 0 | $2.00 |
| 164 | Qwen3 VL 4B | 70 | — | — | — | 4.4 | — | — | 70 | — | — | — | $0.00 |
| 165 | Mistral Large 2 | 69.7 | — | — | — | 4 | — | — | 69.7 | 84 | 128K | 42 | $2.00 |
| 166 | Kimi K2 Instruct | 69.6 | — | 76.4 | — | 4.7 | 89.8 | 31 | 81.1 | 89.5 | 131K | 45 | $0.57 |
| 167 | Kimi K2-Instruct-0905 | 69.6 | — | 76.4 | — | 4.7 | 89.8 | 31 | 81.1 | 89.5 | — | — | — |
| 168 | Qwen3 4B | 69.6 | — | — | — | 5.1 | — | — | 69.6 | — | — | 103 | $0.10 |
| 169 | Sarvam M | 69.6 | — | — | — | 3.3 | — | — | 69.6 | — | — | 136 | $0.00 |
| 170 | GPT-4 Turbo | 69.4 | — | — | — | 3.3 | — | — | 69.4 | 86.5 | 128K | 100 | $10.00 |
| 171 | Ministral 3 14B | 69.3 | — | — | — | 4.6 | — | — | 69.3 | — | 262K | 67 | $0.20 |
| 172 | Qwen2.5 32B Instruct | 69 | — | — | — | 3.8 | — | — | 69 | 83.3 | — | — | $0.00 |
| 173 | Llama 3.1 Nemotron 70B Instruct | 69 | — | — | — | 4.6 | — | — | 69 | 80.2 | — | 292 | $1.20 |
| 174 | Sonar | 68.9 | — | — | — | 7.3 | — | — | 68.9 | — | 127K | — | $1.00 |
| 175 | Qwen2.5 VL 32B Instruct | 68.8 | — | — | — | — | — | — | 68.8 | 78.4 | — | — | — |
| 176 | Qwen3 VL 8B Instruct | 68.6 | — | — | — | 2.9 | — | — | 68.6 | — | 256K | 145 | $0.08 |
| 177 | Claude 3 Opus | 68.5 | — | — | — | 3.1 | — | — | 68.5 | 86.8 | 200K | 120 | $15.00 |
| 178 | Gemini 2.5 Pro | 68.4 | — | — | — | 17.8 | — | 50.8 | 86 | — | 1M | 85 | $1.25 |
| 179 | Mistral Medium 3.1 | 68.3 | — | — | — | 4.4 | — | — | 68.3 | — | 131K | 47 | $0.40 |
| 180 | Mistral Small 3.2 | 68.1 | — | — | — | 4.3 | — | — | 68.1 | — | — | 100 | $0.10 |
| 181 | Devstral Small 2 | 67.8 | — | — | — | 3.4 | — | — | 67.8 | — | — | 62 | $0.00 |
| 182 | Gemini 1.5 Flash | 67.3 | — | — | — | 4.2 | — | — | 67.3 | 78.9 | 1M | 150 | $0.15 |
| 183 | Qwen3 4B 2507 Instruct | 67.2 | — | — | — | 4.7 | — | — | 67.2 | — | — | — | $0.00 |
| 184 | Llama 3.2 90B Instruct | 67.1 | — | — | — | 4.9 | — | — | 67.1 | 86 | 128K | 100 | $0.35 |
| 185 | Ling-mini-2.0 | 67.1 | — | — | — | 5 | — | — | 67.1 | — | — | — | $0.00 |
| 186 | Reka Flash 3 | 66.9 | — | — | — | 5.1 | — | — | 66.9 | — | 66K | 93 | $0.10 |
| 187 | Gemma 3 27B Instruct | 66.9 | — | — | — | 4.7 | — | — | 66.9 | — | — | — | $0.10 |
| 188 | Granite 3.3 8B Instruct | 66.2 | — | — | 57.6 | — | 74.8 | — | — | 65.5 | — | — | — |
| 189 | Granite 3.3 8B Base | 66.2 | — | — | 57.6 | — | 74.8 | — | — | 63.9 | — | — | — |
| 190 | o1 | 66 | — | 67 | — | 7.7 | — | 47 | 84.1 | 92 | 200K | 66 | $15.00 |
| 191 | Mistral Small 3.1 | 65.9 | — | — | — | 4.8 | — | — | 65.9 | — | — | 134 | $0.10 |
| 192 | GPT-4.1 Nano | 65.8 | 57.2 | — | — | 3.9 | 74.5 | — | 65.7 | 80.1 | 1M | 200 | $0.10 |
| 193 | Olmo 3 7B Think | 65.5 | — | — | — | 5.7 | — | — | 65.5 | — | — | — | $0.00 |
| 194 | Mistral Small 3 | 65.2 | — | — | — | 4.1 | — | — | 65.2 | — | 33K | 136 | $0.05 |
| 195 | Claude 3.5 Haiku | 65 | — | — | — | 3.5 | — | — | 65 | 80.9 | 200K | 104 | $0.80 |
| 196 | QwQ-32B-Preview | 64.8 | — | — | — | 4.8 | — | — | 64.8 | — | 33K | 99 | $0.15 |
| 197 | GPT-4o-mini | 64.8 | — | — | — | 4 | — | — | 64.8 | 82 | 128K | 92 | $0.15 |
| 198 | Qwen2 72B Instruct | 64.4 | — | — | — | 3.7 | — | — | 64.4 | 82.3 | — | — | $0.00 |
| 199 | Llama 3.1 8B Instruct | 64.4 | — | — | — | 5.1 | 80.4 | — | 48.3 | 69.4 | 131K | 2047 | $0.02 |
| 200 | Ministral 3 8B | 64.2 | — | — | — | 4.3 | — | — | 64.2 | — | 262K | 86 | $0.15 |
| 201 | Qwen2.5 14B Instruct | 63.7 | — | — | — | — | — | — | 63.7 | 79.7 | — | — | — |
| 202 | GPT-4o | 63.7 | 60.9 | — | — | 5.3 | 81 | 38.2 | 74.7 | 88.7 | 128K | 132 | $2.50 |
| 203 | Qwen3 VL 4B Instruct | 63.4 | — | — | — | 3.7 | — | — | 63.4 | — | — | — | $0.00 |
| 204 | Qwen2.5 Turbo | 63.3 | — | — | — | 4.2 | — | — | 63.3 | — | — | 67 | $0.10 |
| 205 | Devstral Small | 63.2 | — | — | — | 4 | — | — | 63.2 | — | — | 190 | $0.10 |
| 206 | Granite 4.0 H Small | 62.4 | — | — | — | 3.7 | — | — | 62.4 | — | — | 524 | $0.10 |
| 207 | DeepSeek-V3 | 62.3 | — | — | — | 3.6 | 86.1 | 24.9 | 75.9 | 88.5 | 131K | 100 | $0.23 |
| 208 | Pixtral-12B | 61.3 | — | — | — | — | 61.3 | — | — | 69.2 | 128K | 0 | $0.15 |
| 209 | Mistral Saba | 61.1 | — | — | — | 4.1 | — | — | 61.1 | — | — | — | $0.00 |
| 210 | Jamba 1.5 Large | 59.5 | — | — | 65.4 | 4 | — | — | 53.5 | 81.2 | 256K | 100 | $2.00 |
| 211 | Gemma 3 12B Instruct | 59.5 | — | — | — | 4.8 | — | — | 59.5 | — | — | — | $0.10 |
| 212 | Exaone 4.0 1.2B | 58.8 | — | — | — | 5.8 | — | — | 58.8 | — | — | — | $0.00 |
| 213 | Gemini 1.5 Flash 8B | 58.7 | — | — | — | 4.5 | — | — | 58.7 | — | 1M | 150 | $0.07 |
| 214 | Kimi Linear 48B A3B Instruct | 58.5 | — | — | — | 2.7 | — | — | 58.5 | — | — | — | $0.00 |
| 215 | DeepHermes 3 - Mistral 24B | 58 | — | — | — | 3.9 | — | — | 58 | — | — | — | $0.00 |
| 216 | Jamba Large 1.7 | 57.7 | — | — | — | 3.8 | — | — | 57.7 | — | 256K | 48 | $2.00 |
| 217 | Jamba Reasoning 3B | 57.7 | — | — | — | 4.6 | — | — | 57.7 | — | — | — | $0.00 |
| 218 | Llama 3 70B Instruct | 57.4 | — | — | — | 4.4 | — | — | 57.4 | — | 8K | 45 | $0.51 |
| 219 | Hermes 3 - Llama-3.1 70B | 57.1 | — | — | — | 4.1 | — | — | 57.1 | — | — | 32 | $0.30 |
| 220 | Qwen3 1.7B | 57 | — | — | — | 5.2 | — | — | 57 | — | — | 138 | $0.10 |
| 221 | Claude 3 Sonnet | 56.8 | — | — | — | 3.8 | — | — | 56.8 | 79 | 200K | 120 | $3.00 |
| 222 | Jamba 1.6 Large | 56.5 | — | — | — | 4 | — | — | 56.5 | — | — | 52 | $2.00 |
| 223 | Llama 3.2 3B Instruct | 56.1 | — | — | — | 5.2 | 77.4 | — | 34.7 | 63.4 | 131K | 172 | $0.05 |
| 224 | Gemma 3 27B | 56 | — | — | — | — | 90.4 | 10 | 67.5 | — | 131K | 33 | $0.08 |
| 225 | Mistral Small 3.1 24B Base | 56 | — | — | — | — | — | — | 56 | 81 | 128K | 137 | $0.10 |
| 226 | Llama 3.1 Nemotron Nano 4B v1.1 | 55.6 | — | — | — | 5.1 | — | — | 55.6 | — | — | — | $0.00 |
| 227 | Gemini 2.5 Flash | 55.1 | — | — | — | 11 | — | 26.9 | 83.2 | — | 1M | 85 | $0.30 |
| 228 | Mistral Small 3 24B Base | 54.4 | — | — | — | — | — | — | 54.4 | 80.7 | — | — | — |
| 229 | DeepSeek R1 Distill Llama 8B | 54.3 | — | — | — | 4.2 | — | — | 54.3 | — | — | — | $0.00 |
| 230 | Gemini 2.5 Pro Preview 06-05 | 54 | — | — | — | 21.6 | — | 54 | — | — | 1M | 85 | $1.25 |
| 231 | Qwen2.5 7B Instruct | 53.9 | — | 35.9 | 52 | — | 71.2 | — | 56.3 | — | 131K | 138 | $0.04 |
| 232 | Mixtral 8x22B Instruct | 53.7 | — | — | — | 4.1 | — | — | 53.7 | — | 66K | — | $2.00 |
| 233 | Mistral Small | 52.9 | — | — | — | 4.4 | — | — | 52.9 | — | — | 134 | $0.20 |
| 234 | Ministral 3 3B | 52.4 | — | — | — | 5.3 | — | — | 52.4 | — | 131K | 154 | $0.10 |
| 235 | Kimi K2 Base | 52.3 | — | — | — | — | — | 35.3 | 69.2 | 87.8 | — | — | — |
| 236 | Olmo 3 7B Instruct | 52.2 | — | — | — | 5.8 | — | — | 52.2 | — | — | — | $0.10 |
| 237 | Gemma 3 12B | 51.9 | — | — | — | — | 88.9 | 6.3 | 60.6 | — | 131K | 33 | $0.04 |
| 238 | Phi 4 | 51.9 | — | 47.6 | 75.4 | 4.1 | 63 | 3 | 70.4 | 84.8 | 16K | 33 | $0.07 |
| 239 | Mistral Large | 51.5 | — | — | — | 3.4 | — | — | 51.5 | — | 128K | — | $2.00 |
| 240 | OLMo 2 32B | 51.1 | — | — | — | 3.7 | — | — | 51.1 | — | — | — | $0.00 |
| 241 | Grok-1.5 | 51 | — | — | — | — | — | — | 51 | 81.3 | — | — | — |
| 242 | Gemma 3n E4B Instructed LiteRT Preview | 50.6 | — | — | — | — | — | — | 50.6 | 64.9 | — | — | — |
| 243 | Gemma 3n E4B Instructed | 50.6 | — | — | — | — | — | — | 50.6 | 64.9 | 32K | 42 | $20.00 |
| 244 | LFM2 8B A1B | 50.5 | — | — | — | 4.9 | — | — | 50.5 | — | — | — | $0.00 |
| 245 | Qwen2.5 Coder 32B Instruct | 50.4 | — | — | — | 3.8 | — | — | 50.4 | 75.1 | 128K | 110 | $0.66 |
| 246 | Claude 2.1 | 49.5 | — | — | — | 4.2 | — | — | 49.5 | — | — | — | $0.00 |
| 247 | Mistral Medium | 49.1 | — | — | — | 3.4 | — | — | 49.1 | — | — | 45 | $2.80 |
| 248 | Gemma 3n E4B Instruct | 48.8 | — | — | — | 4.9 | — | — | 48.8 | — | — | 56 | $0.00 |
| 249 | Claude 2 | 48.6 | — | — | — | — | — | — | 48.6 | 78.5 | 100K | — | $0.00 |
| 250 | Phi-4-multimodal-instruct | 48.5 | — | — | — | 4.4 | — | — | 48.5 | — | 128K | 25 | $0.05 |
| 251 | o1-preview | 47.3 | — | 52.3 | — | — | — | 42.4 | — | 90.8 | 128K | 66 | $15.00 |
| 252 | Granite 3.3 8B | 46.8 | — | — | — | 4.2 | — | — | 46.8 | — | — | 376 | $0.00 |
| 253 | Gemini 2.0 Flash Lite | 46.7 | — | — | — | 4.4 | — | 21.7 | 71.6 | — | 1M | 85 | $0.08 |
| 254 | Phi 4 Mini Instruct | 46.5 | — | — | — | 4.2 | — | — | 46.5 | — | 131K | — | $0.08 |
| 255 | Llama 3.2 11B Instruct | 46.4 | — | — | — | 5.2 | — | — | 46.4 | 73 | 128K | 168 | $0.05 |
| 256 | GPT-3.5 Turbo | 46.2 | — | — | — | — | — | — | 46.2 | 70 | 16K | 100 | $0.50 |
| 257 | Gemma 3 4B | 45.9 | — | — | — | — | 90.2 | 4 | 43.6 | — | 131K | 33 | $0.04 |
| 258 | IBM Granite 4.0 Tiny Preview | 44.9 | — | — | 26.7 | — | 63 | — | — | 60.4 | — | — | — |
| 259 | Granite 4.0 Micro | 44.7 | — | — | — | 5.1 | — | — | 44.7 | — | 131K | — | $0.02 |
| 260 | Jamba 1.5 Mini | 44.3 | — | — | 46.1 | 5.1 | — | — | 42.5 | 69.7 | 256K | 100 | $0.20 |
| 261 | Qwen2 7B Instruct | 44.1 | — | — | — | — | — | — | 44.1 | 70.5 | — | — | — |
| 262 | Phi-3 Mini Instruct 3.8B | 43.5 | — | — | — | 4.4 | — | — | 43.5 | — | — | — | $0.00 |
| 263 | Claude Instant | 43.4 | — | — | — | 3.8 | — | — | 43.4 | — | — | — | $0.00 |
| 264 | Gemini 2.5 Flash Lite | 43.3 | — | — | — | 5.1 | — | 10.7 | 75.9 | — | 1M | 6 | $0.10 |
| 265 | Command R+ | 43.2 | — | — | — | 4.8 | — | — | 43.2 | 75.7 | 128K | 100 | $0.15 |
| 266 | Gemini 1.0 Pro | 43.1 | — | — | — | 4.6 | — | — | 43.1 | 71.8 | 33K | 120 | $0.50 |
| 267 | DeepSeek Coder V2 Lite Instruct | 42.9 | — | — | — | 5.3 | — | — | 42.9 | — | — | — | $0.00 |
| 268 | Phi 4 Mini | 42.8 | — | — | 32.8 | — | — | — | 52.8 | 67.3 | — | — | — |
| 269 | LFM 40B | 42.5 | — | — | — | 4.9 | — | — | 42.5 | — | — | — | $0.00 |
| 270 | Phi-3.5-mini-instruct | 42.2 | — | — | 37 | — | — | — | 47.4 | 69 | 128K | 23 | $0.10 |
| 271 | Gemma 3 4B Instruct | 41.7 | — | — | — | 5.2 | — | — | 41.7 | — | — | — | $0.00 |
| 272 | Phi-3.5-MoE-instruct | 41.6 | — | — | 37.9 | — | — | — | 45.3 | 78.9 | — | — | — |
| 273 | Mistral Small 3.2 24B Instruct | 41.4 | — | — | 43.1 | — | — | 12.1 | 69.1 | 80.5 | — | — | — |
| 274 | Llama 2 Chat 13B | 40.6 | — | — | — | 4.7 | — | — | 40.6 | — | — | — | $0.00 |
| 275 | Llama 2 Chat 70B | 40.6 | — | — | — | 5 | — | — | 40.6 | — | — | — | $0.00 |
| 276 | Llama 3 8B Instruct | 40.5 | — | — | — | 5.1 | — | — | 40.5 | — | 8K | 81 | $0.04 |
| 277 | Gemma 3n E2B Instructed LiteRT (Preview) | 40.5 | — | — | — | — | — | — | 40.5 | 60.1 | — | — | — |
| 278 | Gemma 3n E2B Instructed | 40.5 | — | — | — | — | — | — | 40.5 | 60.1 | — | — | — |
| 279 | Qwen2.5-Coder 7B Instruct | 40.1 | — | — | — | 4.8 | — | — | 40.1 | 67.6 | — | — | $0.00 |
| 280 | DBRX Instruct | 39.7 | — | — | — | 6.6 | — | — | 39.7 | — | — | — | $0.00 |
| 281 | Jamba 1.7 Mini | 38.8 | — | — | — | 4.5 | — | — | 38.8 | — | — | — | $0.00 |
| 282 | Mixtral 8x7B Instruct | 38.7 | — | — | — | 4.5 | — | — | 38.7 | — | — | — | $0.50 |
| 283 | Mistral Small 3.1 24B Instruct | 38.6 | — | — | — | — | — | 10.4 | 66.8 | 80.6 | — | — | — |
| 284 | Qwen2.5-Omni-7B | 38.3 | — | 29.6 | — | — | — | — | 47 | — | — | — | — |
| 285 | Gemma 3n E2B Instruct | 37.8 | — | — | — | 4 | — | — | 37.8 | — | — | — | $0.00 |
| 286 | Molmo 7B-D | 37.1 | — | — | — | 5.1 | — | — | 37.1 | — | — | — | $0.00 |
| 287 | Jamba 1.6 Mini | 36.7 | — | — | — | 4.6 | — | — | 36.7 | — | — | 183 | $0.20 |
| 288 | DeepHermes 3 - Llama-3.1 8B | 36.5 | — | — | — | 4.3 | — | — | 36.5 | — | — | — | $0.00 |
| 289 | Qwen3 0.6B | 34.7 | — | — | — | 5.7 | — | — | 34.7 | — | — | 225 | $0.10 |
| 290 | Granite 4.0 1B | 32.5 | — | — | — | 5.1 | — | — | 32.5 | — | — | — | $0.00 |
| 291 | Gemma 3 1B | 32.4 | — | — | — | — | 80.2 | 2.2 | 14.7 | — | — | — | — |
| 292 | OpenChat 3.5 | 31 | — | — | — | 4.8 | — | — | 31 | — | — | — | $0.00 |
| 293 | LFM2 2.6B | 29.8 | — | — | — | 5.2 | — | — | 29.8 | — | — | — | $0.00 |
| 294 | OLMo 2 7B | 28.2 | — | — | — | 5.5 | — | — | 28.2 | — | — | — | $0.00 |
| 295 | Granite 4.0 H 1B | 27.7 | — | — | — | 5 | — | — | 27.7 | — | — | — | $0.00 |
| 296 | DeepSeek R1 Distill Qwen 1.5B | 26.9 | — | — | — | 3.3 | — | — | 26.9 | — | — | — | $0.00 |
| 297 | LFM2 1.2B | 25.7 | — | — | — | 5.7 | — | — | 25.7 | — | — | — | $0.00 |
| 298 | Mistral 7B Instruct | 24.5 | — | — | — | 4.3 | — | — | 24.5 | — | — | 90 | $0.20 |
| 299 | Llama 3.2 1B Instruct | 20 | — | — | — | 5.3 | — | — | 20 | — | 131K | 91 | $0.03 |
| 300 | Llama 2 Chat 7B | 16.4 | — | — | — | 5.8 | — | — | 16.4 | — | — | 113 | $0.10 |
| 301 | Gemma 3 1B Instruct | 13.5 | — | — | — | 5.2 | — | — | 13.5 | — | — | — | $0.00 |
| 302 | Granite 4.0 H 350M | 12.7 | — | — | — | 6.4 | — | — | 12.7 | — | — | — | $0.00 |
| 303 | Granite 4.0 350M | 12.4 | — | — | — | 5.7 | — | — | 12.4 | — | — | — | $0.00 |
| 304 | Gemma 3 270M | 5.5 | — | — | — | 4.2 | — | — | 5.5 | — | — | — | $0.00 |
304 models ranked on General. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.