AI Hub
Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence
Sonar Reasoning Pro
95.7 index
Top reasoning
Claude Opus 4.7
94.2
Top math
Grok-4 Heavy
100
Fastest
Llama 3.3 70B Instruct
2220 tok/s
Cheapest
Ling-2.6-flash
$0.01/M
Longest context
Llama 4 Scout
10M
Best open-weights
DeepSeek V3.2 Speciale
89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

1008468523620$0$4$8$12$16$20Input price ($/M tokens)Intelligence indexSonar Reasoning Pro: 95.7 @ $2Sonar Reasoning ProQwen3.7 Max: 92.3 @ $2.5Gemini 3.5 Flash: 92.2 @ $1.5GPT-5.3-Codex: 91.5 @ $1.75Grok 4.20 0309 v2: 91.1 @ $2Claude Opus 4.7: 90.9 @ $5Gemini 3 Flash: 90.2 @ $0.5Gemini 3 FlashGrok 4.3: 90.1 @ $1.25DeepSeek V3.2 Speciale: 89.9 @ $0.287DeepSeek V3.2 SpecialeGPT-5.2-Codex: 89.9 @ $1.75DeepSeek-V4-Flash: 89.4 @ $0.1DeepSeek-V4-FlashQwen3.5 397B A17B: 89.3 @ $0.39Qwen3.5 397B A17BGLM 4.7: 89 @ $0.4GPT-5.1: 89 @ $1.25Qwen3.6 Max: 88.8 @ $1.04Grok 4.20 0309: 88.5 @ $2GPT-5 Pro: 88.4 @ $15GPT-5.1-Codex: 88.2 @ $1.25Qwen3.6 Plus: 88.2 @ $0.325DeepSeek-V4-Pro: 88.2 @ $0.435MiMo-V2-Flash: 88 @ $0.1MiMo-V2-FlashClaude Opus 4.5: 88 @ $5Kimi K2.5: 87.9 @ $0.4GPT-5.4 mini: 87.5 @ $0.75MiniMax M2.7: 87.4 @ $0.279GPT-5 Codex: 87.1 @ $1.25DeepSeek-V3.2: 87.1 @ $0.252MiMo-V2-Pro: 87 @ $1GLM 5.1: 86.8 @ $0.98Hy3: 86.7 @ $0.066MiMo-V2.5-Pro: 86.6 @ $1GPT-5.2: 86.2 @ $1.75Grok-3 Mini: 85.9 @ $0.3Qwen3.5-27B: 85.8 @ $0.195Qwen3.5-122B-A10B: 85.7 @ $0.26Gemma 4 31B: 85.7 @ $0.12Ring-2.6-1T: 85.7 @ $0.075Kimi K2 Thinking: 85.6 @ $0.6MiMo-V2-Omni-0327: 85.5 @ $0.4KAT-Coder-Pro V2: 85.5 @ $0.3MiMo-V2.5: 84.9 @ $0.4MiniMax M2.5: 84.8 @ $0.15GPT-5.1-Codex-Mini: 84.7 @ $0.25GLM 5 Turbo: 84.7 @ $1.2o3 Pro: 84.5 @ $20Qwen3.5-35B-A3B: 84.5 @ $0.139Qwen3 235B A22B 2507: 84.2 @ $0.4Qwen3.6 27B: 84.2 @ $0.3Qwen3.6 35B A3B: 84.1 @ $0.15MiniMax M2.1: 83.6 @ $0.29DeepSeek V3.1 Terminus: 83.5 @ $0.27Gemini 3.1 Pro: 83.2 @ $2Step 3.5 Flash: 83.1 @ $0.09MiMo-V2-Omni: 82.8 @ $0.4Gemini 3 Pro: 82.8 @ $2Qwen3.5 Omni Plus: 82.6 @ $0.4Grok-3: 82.6 @ $3o1-pro: 82.5 @ $150Gemini 3.1 Flash Lite: 82.2 @ $0.25Nova 2 Lite: 82.1 @ $0.3GLM-5: 81.9 @ $0.6KAT-Coder-Pro V1: 81.8 @ $0.3GPT-5.4 nano: 81.7 @ $0.2INTELLECT-3: 81 @ $0.2Nova 2.0 Pro: 80.9 @ $1.3Grok 3 mini Reasoning: 80.9 @ $0.3GLM 5V Turbo: 80.9 @ $1.2Qwen3.5-9B: 80.6 @ $0.04GPT-5: 80.5 @ $1.25Claude Sonnet 4.5: 80.4 @ $3Qwen3-Next-80B-A3B: 80.3 @ $0.5NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ $0.1NVIDIA Nemotron 3 Super 120B A12B: 80 @ $0.3Qwen3-235B-A22B-Thinking-2507: 79.6 @ $0.3gpt-oss-120b: 79.6 @ $0.039Qwen3 Max: 79.5 @ $0.78Llama Nemotron Super 49B v1.5: 79.4 @ $0.1Claude Opus 4.6: 79.4 @ $5Gemma 4 26B A4B: 79.2 @ $0.06GPT-5 mini: 79.2 @ $0.25Qwen2.5 VL 72B Instruct: 79.1 @ $0.25Seed-OSS-36B-Instruct: 78.8 @ $0.2Grok 4 Fast: 78.7 @ $0.2Qwen3 VL 235B A22B: 78.4 @ $0.8Qwen3 VL 32B: 78.4 @ $0.7o4-mini: 78.4 @ $1.1Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ $0.6Nova 2.0 Omni: 78.2 @ $0.3Grok 4: 78.2 @ $3Magistral Medium 1.2: 78.1 @ $2Nemotron Nano 9B V2: 77.6 @ $0.04Qwen3 Next 80B A3B Thinking: 77.5 @ $0.098Mercury 2: 77 @ $0.25Mistral Small 4: 76.9 @ $0.15Gemini 2.5 Pro Preview 06-05: 76.6 @ $1.25Claude Sonnet 4.6: 76.3 @ $3Qwen3 VL 30B A3B: 76.2 @ $0.2Qwen3 Max Thinking: 76.1 @ $0.78GPT-5.5: 76.1 @ $5MiniMax-M2: 76 @ $0.255Cogito v2.1: 75.8 @ $1.3MiniMax M1 80k: 75.5 @ $0.6Claude Opus 4.1: 75.4 @ $15Claude Haiku 4.5: 75.3 @ $1Trinity Large Thinking: 75.2 @ $0.22Ling-2.6-1T: 75.2 @ $0.075DeepSeek-R1: 75 @ $0.55DeepSeek VL2: 74.9 @ $9.5Qwen3 235B A22B: 74.9 @ $0.455Kimi K2.6: 74.9 @ $0.73GPT-5.4: 74.9 @ $2.5Mistral Medium 3.5: 74.8 @ $1.5Qwen3 30B A3B 2507: 74.7 @ $0.3Claude 3.7 Sonnet: 74.7 @ $3Ring-flash-2.0: 74.6 @ $0.1Claude Sonnet 4: 74.5 @ $3Qwen3.5 Omni Flash: 74.2 @ $0.1Magistral Small 1.2: 73.9 @ $0.5Qwen3 32B: 73.8 @ $0.08Qwen3 Coder Next: 73.7 @ $0.11gpt-oss-20b: 73.6 @ $0.03Kimi K2: 73.6 @ $0.57Hermes 4 - Llama-3.1 405B: 73.5 @ $1Qwen3 Omni 30B A3B: 73.4 @ $0.3Gemini 2.5 Flash: 73.1 @ $0.3GLM-4.5: 73 @ $0.6Solar Pro 3: 72.4 @ $0.15GLM-4.6: 72.4 @ $0.43Gemini 2.5 Flash-Lite: 72.3 @ $0.1Qwen3-235B-A22B-Instruct-2507: 72.2 @ $0.15DeepSeek V3.2 Exp: 72.2 @ $0.27Qwen3 30B A3B: 71.7 @ $0.09Gemini 2.5 Pro: 71.6 @ $1.25o3: 71.6 @ $2Hermes 4 - Llama-3.1 70B: 71.3 @ $0.1GPT-5 nano: 71.2 @ $0.05Kimi K2 0905: 71 @ $0.6Ministral 8B Instruct: 70.9 @ $0.1Qwen3 VL 235B A22B Instruct: 70.9 @ $0.2o1-mini: 70.5 @ $3GLM 4.5 Air: 70.4 @ $0.13Claude 3.5 Sonnet: 70.3 @ $3DeepSeek R1 Distill Llama 70B: 70.1 @ $0.1GLM 4.5V: 70.1 @ $0.6GLM 4.6V: 69.6 @ $0.3Olmo 3 32B Think: 69.5 @ $0.15NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ $0.2Claude Opus 4: 69.4 @ $15Qwen3 30B A3B 2507 Instruct: 69.3 @ $0.2Qwen3 Next 80B A3B Instruct: 68.9 @ $0.09DeepSeek R1 Distill Qwen 32B: 68.4 @ $0.12ERNIE 4.5 300B A47B: 68.2 @ $0.28QwQ-32B: 67.8 @ $0.7Gemini 1.5 Pro: 67.3 @ $1.25Ling-flash-2.0: 66.9 @ $0.1Qwen3 14B: 66.8 @ $0.1Qwen3 Coder 480B A35B Instruct: 66.5 @ $0.3Kimi K2 Instruct: 66.5 @ $0.57Qwen3 VL 30B A3B Instruct: 66.5 @ $0.13Qwen3 VL 32B Instruct: 66.5 @ $0.104Pixtral-12B: 66.1 @ $0.15DeepSeek-V3 0324: 65.9 @ $0.28o1: 65.4 @ $15o3-mini: 64.1 @ $1.1Llama 4 Maverick: 63.9 @ $0.15GPT-4.1: 63.8 @ $2Qwen2.5 Max: 63.6 @ $1.6DeepSeek-V2.5: 63.4 @ $0.14DeepSeek-R1-0528: 63.3 @ $0.55QwQ-32B-Preview: 62.6 @ $0.15Grok-2: 62.4 @ $2Mistral Small 3 24B Instruct: 62.1 @ $0.1Gemini 1.5 Flash: 61.9 @ $0.15Nova Pro: 61.6 @ $0.8MiniMax-M1: 61.5 @ $0.4Llama 3.1 405B Instruct: 60.9 @ $0.89Gemini 2.0 Flash: 60.3 @ $0.1DeepSeek-V3.1: 59.8 @ $0.21GPT-4 Turbo: 59.8 @ $10GPT-4.5: 59.4 @ $75Ling-2.6-flash: 59.3 @ $0.01Qwen2.5 72B Instruct: 59.1 @ $0.36Llama 4 Scout: 58.9 @ $0.08Sonar Pro: 58.8 @ $3Mistral Medium 3: 58.6 @ $0.4Claude 3 Opus: 58.5 @ $15Gemma 3 27B: 58.4 @ $0.08Mistral Large 3: 58.3 @ $0.5GPT-4: 58.3 @ $30GPT-4.1 Mini: 58.2 @ $0.4GLM 4.7 Flash: 58.1 @ $0.06DeepSeek-V3: 58.1 @ $0.229Qwen3 8B: 57.8 @ $0.05Nova Lite: 57.7 @ $0.06Qwen3 Omni 30B A3B Instruct: 57.3 @ $0.3o1-preview: 57.3 @ $15Gemini 2.5 Flash Lite: 57 @ $0.1Sonar: 56.8 @ $1Qwen3 4B: 56.5 @ $0.1GPT-4o: 56.4 @ $2.5Reka Flash 3: 56.2 @ $0.1Llama 3.1 70B Instruct: 56 @ $0.4Command A: 55.9 @ $2.5Gemma 3 12B: 55.5 @ $0.04Qwen3 Coder 30B A3B Instruct: 55.4 @ $0.07Claude 3.5 Haiku: 54.5 @ $0.8Gemini 2.0 Flash Lite: 54.4 @ $0.075Devstral 2: 54.3 @ $0.4Llama 3.2 90B Instruct: 54 @ $0.35Nova Premier: 53.1 @ $2.5Pixtral Large: 53.1 @ $2Reka Flash: 52.9 @ $0.2Mistral Medium 3.1: 51.5 @ $0.4Mistral Small 3.2: 51 @ $0.1Mistral Small 3.1 24B Base: 50.9 @ $0.1Llama 3.3 70B Instruct: 50.6 @ $0.1Qwen2.5 Turbo: 50.3 @ $0.1Qwen2.5 Coder 32B Instruct: 50.1 @ $0.66Qwen3 VL 8B: 49.7 @ $0.2GPT-4o-mini: 49.1 @ $0.15Nova Micro: 49 @ $0.03Gemini 1.5 Flash 8B: 48.4 @ $0.07Ministral 3 14B: 47.9 @ $0.2Mistral Large 2: 47.9 @ $2Devstral Medium: 47.8 @ $0.4Phi 4: 47.6 @ $0.065LFM2-24B-A2B: 47.4 @ $0.03Qwen3 1.7B: 46.9 @ $0.1Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ $0.1Qwen2.5 7B Instruct: 46.6 @ $0.04Phi-4-multimodal-instruct: 46.2 @ $0.05Phi-3.5-mini-instruct: 46 @ $0.1Claude 3 Sonnet: 45.8 @ $3Gemma 3 4B: 45.8 @ $0.04Devstral Small: 45.3 @ $0.1Llama 3.1 8B Instruct: 45 @ $0.02Gemma 3 27B Instruct: 44.5 @ $0.1Llama 3.1 Nemotron 70B Instruct: 43.7 @ $1.2Mistral Small 3: 43.6 @ $0.05Ministral 3 8B: 43.3 @ $0.15Granite 4.1 8B: 43.3 @ $0.05Qwen3 VL 8B Instruct: 43 @ $0.08Jamba 1.5 Large: 42.8 @ $2Jamba 1.6 Large: 42.6 @ $2Hermes 3 - Llama-3.1 70B: 42.5 @ $0.3Mistral Small 3.1: 42.4 @ $0.1GPT-4.1 Nano: 42.1 @ $0.1Llama 3 70B Instruct: 40.8 @ $0.51Mistral Small: 40.3 @ $0.2Gemma 3 12B Instruct: 40 @ $0.1Olmo 3 7B Instruct: 40 @ $0.1Mistral Large: 39.3 @ $2Mixtral 8x22B Instruct: 39.1 @ $2Claude 3 Haiku: 38.9 @ $0.25Llama 3.2 11B Instruct: 36.7 @ $0.05Jamba Large 1.7: 36.5 @ $2Granite 4.0 H Small: 35.7 @ $0.1GPT-3.5 Turbo: 35.2 @ $0.5Gemini 1.0 Pro: 34 @ $0.5Ministral 3 3B: 33.7 @ $0.1Mistral Medium: 33.6 @ $2.8Solar Mini: 33.1 @ $0.2Phi 4 Mini Instruct: 32.6 @ $0.08Llama 3 8B Instruct: 32.4 @ $0.04Llama 3.2 3B Instruct: 30.8 @ $0.051Jamba 1.5 Mini: 29.6 @ $0.2Qwen3 0.6B: 29.3 @ $0.1Command R+: 28.9 @ $0.15Apertus 70B Instruct: 27.2 @ $0.8Mixtral 8x7B Instruct: 26.1 @ $0.5Apertus 8B Instruct: 25.6 @ $0.1Granite 4.0 Micro: 25.6 @ $0.017Jamba 1.6 Mini: 24.9 @ $0.2Gemma 3n E4B Instructed: 24.8 @ $20Mistral 7B Instruct: 14.7 @ $0.2Llama 3.2 1B Instruct: 12.1 @ $0.027Llama 2 Chat 7B: 11.3 @ $0.1

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

1008468523620080160240320400Output speed (tokens/s)Intelligence indexQwen3.7 Max: 92.3 @ 203Gemini 3.5 Flash: 92.2 @ 221GPT-5.3-Codex: 91.5 @ 73Grok 4.20 0309 v2: 91.1 @ 105Claude Opus 4.7: 90.9 @ 49Gemini 3 Flash: 90.2 @ 191Grok 4.3: 90.1 @ 88GPT-5.2-Codex: 89.9 @ 106DeepSeek-V4-Flash: 89.4 @ 109Qwen3.5 397B A17B: 89.3 @ 53GLM 4.7: 89 @ 98GPT-5.1: 89 @ 115Qwen3.6 Max: 88.8 @ 36Grok 4.20 0309: 88.5 @ 97GPT-5.1-Codex: 88.2 @ 188Qwen3.6 Plus: 88.2 @ 52DeepSeek-V4-Pro: 88.2 @ 30MiMo-V2-Flash: 88 @ 145Claude Opus 4.5: 88 @ 58Kimi K2.5: 87.9 @ 35GPT-5.4 mini: 87.5 @ 162MiniMax M2.7: 87.4 @ 50GPT-5 Codex: 87.1 @ 180MiMo-V2-Pro: 87 @ 60GLM 5.1: 86.8 @ 53Hy3: 86.7 @ 100MiMo-V2.5-Pro: 86.6 @ 58GPT-5.2: 86.2 @ 73Grok-3 Mini: 85.9 @ 100Qwen3.5-27B: 85.8 @ 91Qwen3.5-122B-A10B: 85.7 @ 129Gemma 4 31B: 85.7 @ 36Ring-2.6-1T: 85.7 @ 120Kimi K2 Thinking: 85.6 @ 100MiMo-V2-Omni-0327: 85.5 @ 110KAT-Coder-Pro V2: 85.5 @ 108MiMo-V2.5: 84.9 @ 92MiniMax M2.5: 84.8 @ 87GPT-5.1-Codex-Mini: 84.7 @ 175o3 Pro: 84.5 @ 25Qwen3.5-35B-A3B: 84.5 @ 121Qwen3 235B A22B 2507: 84.2 @ 59Qwen3.6 27B: 84.2 @ 64Qwen3.6 35B A3B: 84.1 @ 169MiniMax M2.1: 83.6 @ 92Gemini 3.1 Pro: 83.2 @ 142Step 3.5 Flash: 83.1 @ 194MiMo-V2-Omni: 82.8 @ 108Gemini 3 Pro: 82.8 @ 141Qwen3.5 Omni Plus: 82.6 @ 54Step 3.5 Flash 2603: 82.6 @ 197Grok-3: 82.6 @ 100Gemini 3.1 Flash Lite: 82.2 @ 342Nova 2 Lite: 82.1 @ 229GLM-5: 81.9 @ 67KAT-Coder-Pro V1: 81.8 @ 108GPT-5.4 nano: 81.7 @ 157Nova 2.0 Pro: 80.9 @ 149Grok 3 mini Reasoning: 80.9 @ 33Qwen3.5-9B: 80.6 @ 51GPT-5: 80.5 @ 100Claude Sonnet 4.5: 80.4 @ 42Qwen3-Next-80B-A3B: 80.3 @ 147NVIDIA Nemotron 3 Nano 30B A3B: 80.1 @ 148NVIDIA Nemotron 3 Super 120B A12B: 80 @ 211gpt-oss-120b: 79.6 @ 500Qwen3 Max: 79.5 @ 45Llama Nemotron Super 49B v1.5: 79.4 @ 51Claude Opus 4.6: 79.4 @ 48Gemma 4 26B A4B: 79.2 @ 66GPT-5 mini: 79.2 @ 200Seed-OSS-36B-Instruct: 78.8 @ 37Grok 4 Fast: 78.7 @ 90Qwen3 VL 235B A22B: 78.4 @ 34Qwen3 VL 32B: 78.4 @ 93o4-mini: 78.4 @ 115Llama 3.1 Nemotron Ultra 253B v1: 78.3 @ 42Grok 4: 78.2 @ 100Magistral Medium 1.2: 78.1 @ 42Qwen3.5 4B: 77.1 @ 164Mercury 2: 77 @ 790Mercury 2Mistral Small 4: 76.9 @ 145Gemini 2.5 Pro Preview 06-05: 76.6 @ 85Claude Sonnet 4.6: 76.3 @ 75Qwen3 VL 30B A3B: 76.2 @ 122Qwen3 Max Thinking: 76.1 @ 45GPT-5.5: 76.1 @ 67MiniMax-M2: 76 @ 91Cogito v2.1: 75.8 @ 56Claude Opus 4.1: 75.4 @ 120Claude Haiku 4.5: 75.3 @ 100Trinity Large Thinking: 75.2 @ 129DeepSeek-R1: 75 @ 189DeepSeek VL2: 74.9 @ 22Qwen3 235B A22B: 74.9 @ 68Kimi K2.6: 74.9 @ 57GPT-5.4: 74.9 @ 84Mistral Medium 3.5: 74.8 @ 140Qwen3 30B A3B 2507: 74.7 @ 151Claude 3.7 Sonnet: 74.7 @ 101Claude Sonnet 4: 74.5 @ 101Qwen3.5 Omni Flash: 74.2 @ 235Magistral Small 1.2: 73.9 @ 106Sarvam 105B: 73.8 @ 128Qwen3 32B: 73.8 @ 328Qwen3 Coder Next: 73.7 @ 92gpt-oss-20b: 73.6 @ 1000gpt-oss-20bKimi K2: 73.6 @ 26Hermes 4 - Llama-3.1 405B: 73.5 @ 34Qwen3 Omni 30B A3B: 73.4 @ 102Gemini 2.5 Flash: 73.1 @ 85GLM-4.5: 73 @ 85GLM-4.6: 72.4 @ 85Qwen3-235B-A22B-Instruct-2507: 72.2 @ 63DeepSeek V3.2 Exp: 72.2 @ 100Qwen3 30B A3B: 71.7 @ 122Gemini 2.5 Pro: 71.6 @ 85o3: 71.6 @ 50Hermes 4 - Llama-3.1 70B: 71.3 @ 60GPT-5 nano: 71.2 @ 500Kimi K2 0905: 71 @ 16Ministral 8B Instruct: 70.9 @ 0Qwen3 VL 235B A22B Instruct: 70.9 @ 51o1-mini: 70.5 @ 115GLM 4.5 Air: 70.4 @ 63Claude 3.5 Sonnet: 70.3 @ 101DeepSeek R1 Distill Llama 70B: 70.1 @ 37GLM 4.5V: 70.1 @ 85GLM 4.6V: 69.6 @ 44NVIDIA Nemotron Nano 12B v2 VL: 69.4 @ 244Claude Opus 4: 69.4 @ 120Qwen3 30B A3B 2507 Instruct: 69.3 @ 122Qwen3 Next 80B A3B Instruct: 68.9 @ 161DeepSeek R1 Distill Qwen 32B: 68.4 @ 37NVIDIA Nemotron Nano 9B V2: 68.3 @ 129ERNIE 4.5 300B A47B: 68.2 @ 24QwQ-32B: 67.8 @ 31Gemini 1.5 Pro: 67.3 @ 85Ling-flash-2.0: 66.9 @ 91Qwen3 14B: 66.8 @ 62Qwen3 Coder 480B A35B Instruct: 66.5 @ 69Kimi K2 Instruct: 66.5 @ 45Qwen3 VL 30B A3B Instruct: 66.5 @ 123Qwen3 VL 32B Instruct: 66.5 @ 76Pixtral-12B: 66.1 @ 0o1: 65.4 @ 66o3-mini: 64.1 @ 115Llama 4 Maverick: 63.9 @ 639GPT-4.1: 63.8 @ 100Qwen2.5 Max: 63.6 @ 50LongCat Flash Lite: 63.6 @ 110DeepSeek-V2.5: 63.4 @ 100Sarvam 30B: 63.3 @ 214DeepSeek-R1-0528: 63.3 @ 45QwQ-32B-Preview: 62.6 @ 99Grok-2: 62.4 @ 85Mistral Small 3 24B Instruct: 62.1 @ 134Gemini 1.5 Flash: 61.9 @ 150Nova Pro: 61.6 @ 100Llama 3.1 405B Instruct: 60.9 @ 100Gemini 2.0 Flash: 60.3 @ 183GPT-4 Turbo: 59.8 @ 100GPT-4.5: 59.4 @ 50Qwen2.5 72B Instruct: 59.1 @ 100Llama 4 Scout: 58.9 @ 776Llama 4 ScoutMistral Medium 3: 58.6 @ 32Claude 3 Opus: 58.5 @ 120Gemma 3 27B: 58.4 @ 33Mistral Large 3: 58.3 @ 54GPT-4: 58.3 @ 104GPT-4.1 Mini: 58.2 @ 150GLM 4.7 Flash: 58.1 @ 113DeepSeek-V3: 58.1 @ 100Qwen3 8B: 57.8 @ 69Nova Lite: 57.7 @ 100Qwen3 Omni 30B A3B Instruct: 57.3 @ 103o1-preview: 57.3 @ 66Gemini 2.5 Flash Lite: 57 @ 6Qwen3 4B: 56.5 @ 103Sarvam M: 56.4 @ 136GPT-4o: 56.4 @ 132Reka Flash 3: 56.2 @ 93Llama 3.1 70B Instruct: 56 @ 1204Llama 3.1 70B InstructCommand A: 55.9 @ 203Gemma 3 12B: 55.5 @ 33Qwen3 Coder 30B A3B Instruct: 55.4 @ 97Claude 3.5 Haiku: 54.5 @ 104Gemini 2.0 Flash Lite: 54.4 @ 85Devstral 2: 54.3 @ 51Llama 3.2 90B Instruct: 54 @ 100Nova Premier: 53.1 @ 40Pixtral Large: 53.1 @ 0Reka Flash: 52.9 @ 85Mistral Medium 3.1: 51.5 @ 47Mistral Small 3.2: 51 @ 100Mistral Small 3.1 24B Base: 50.9 @ 137Llama 3.3 70B Instruct: 50.6 @ 2220Llama 3.3 70B InstructQwen2.5 Turbo: 50.3 @ 67Qwen2.5 Coder 32B Instruct: 50.1 @ 110Qwen3 VL 8B: 49.7 @ 120GPT-4o-mini: 49.1 @ 92Nova Micro: 49 @ 100Gemini 1.5 Flash 8B: 48.4 @ 150Ministral 3 14B: 47.9 @ 67Mistral Large 2: 47.9 @ 42Devstral Medium: 47.8 @ 72Phi 4: 47.6 @ 33Devstral Small 2: 47.5 @ 62LFM2-24B-A2B: 47.4 @ 208Qwen3 1.7B: 46.9 @ 138Nemotron 3 Nano Omni 30B A3B Reasoning: 46.9 @ 301Qwen2.5 7B Instruct: 46.6 @ 138Phi-4-multimodal-instruct: 46.2 @ 25Phi-3.5-mini-instruct: 46 @ 23Claude 3 Sonnet: 45.8 @ 120Gemma 3 4B: 45.8 @ 33Qwen3.5 2B: 45.6 @ 328Devstral Small: 45.3 @ 190Llama 3.1 8B Instruct: 45 @ 2047Llama 3.1 8B InstructLlama 3.1 Nemotron 70B Instruct: 43.7 @ 292Mistral Small 3: 43.6 @ 136Ministral 3 8B: 43.3 @ 86Granite 4.1 8B: 43.3 @ 133Qwen3 VL 8B Instruct: 43 @ 145Jamba 1.5 Large: 42.8 @ 100Jamba 1.6 Large: 42.6 @ 52Hermes 3 - Llama-3.1 70B: 42.5 @ 32Mistral Small 3.1: 42.4 @ 134GPT-4.1 Nano: 42.1 @ 200Llama 3 70B Instruct: 40.8 @ 45Mistral Small: 40.3 @ 134Claude 3 Haiku: 38.9 @ 104Llama 3.2 11B Instruct: 36.7 @ 168Jamba Large 1.7: 36.5 @ 48Granite 4.0 H Small: 35.7 @ 524GPT-3.5 Turbo: 35.2 @ 100Gemma 3n E4B Instruct: 34.7 @ 56Gemini 1.0 Pro: 34 @ 120Ministral 3 3B: 33.7 @ 154Mistral Medium: 33.6 @ 45Solar Mini: 33.1 @ 63Granite 3.3 8B: 32.5 @ 376Llama 3 8B Instruct: 32.4 @ 81Llama 3.2 3B Instruct: 30.8 @ 172Tiny Aya Global: 30.5 @ 126Jamba 1.5 Mini: 29.6 @ 100Qwen3 0.6B: 29.3 @ 225Command R+: 28.9 @ 100Jamba 1.6 Mini: 24.9 @ 183Gemma 3n E4B Instructed: 24.8 @ 42Qwen3.5 0.8B: 23.6 @ 120Mistral 7B Instruct: 14.7 @ 90Llama 3.2 1B Instruct: 12.1 @ 91Llama 2 Chat 7B: 11.3 @ 113
#ModelReason idxBIG-Bench HardARC-AGI-2DROPGPQA DiamondContextSpeedIn $/M
1Claude Opus 4.794.294.21M49$5.00
2GPT-5.593.593.51.1M67$5.00
3Qwen3.7 Max92.392.31M203$2.50
4Gemini 3.5 Flash92.292.21M221$1.50
5GPT-5.492921.1M84$2.50
6GPT-5.3-Codex91.591.5400K73$1.75
7Grok 4.20 0309 v291.191.1105$2.00
8Kimi K2.691.191.1262K57$0.73
9Gemini 3 Flash90.490.41M191$0.50
10Grok 4.390.190.11M88$1.25
11DeepSeek-V4-Pro90.190.11M30$0.44
12GPT-5.2-Codex89.989.9400K106$1.75
13DeepSeek-V4-Flash89.489.41M109$0.10
14Qwen3.5 397B A17B89.389.3262K53$0.39
15Qwen3.6 Max88.888.8262K36$1.04
16Grok 4.20 030988.588.597$2.00
17Grok-4 Heavy88.488.4
18Muse Spark88.488.4$0.00
19GPT-5 Pro88.488.4400K$15.00
20Qwen3.6 Plus88.288.21M52$0.33
21GPT-5.188.188.1400K115$1.25
22Kimi K2.587.987.9262K35$0.40
23GPT-5.4 mini87.587.5400K162$0.75
24MiniMax M2.787.487.4205K50$0.28
25GPT-587.387.3400K100$1.25
26DeepSeek V3.2 Speciale87.187.1164K$0.29
27Claude Opus 4.58787200K58$5.00
28MiMo-V2-Pro87871M60$1.00
29GLM 5.186.886.8203K53$0.98
30Hy386.786.7262K100$0.07
31MiMo-V2.5-Pro86.686.61M58$1.00
32Gemini 2.5 Pro Preview 06-0586.486.41M85$1.25
33Qwen3 Max Thinking86.186.1262K45$0.78
34GPT-5.1-Codex8686400K188$1.25
35GLM-58686203K67$0.60
36GLM 4.785.985.9203K98$0.40
37Qwen3.5-27B85.885.8262K91$0.20
38Qwen3.5-122B-A10B85.785.7262K129$0.26
39Gemma 4 31B85.785.7262K36$0.12
40Ring-2.6-1T85.785.7262K120$0.08
41Gemini 3.1 Pro85.777.194.31M142$2.00
42Grok 4 Fast85.785.72M90$0.20
43MiMo-V2-Omni-032785.585.5110$0.40
44KAT-Coder-Pro V285.585.5256K108$0.30
45Grok 4.1 Fast85.385.3$0.00
46Nanbeige4.1-3B84.984.9$0.00
47MiMo-V2.584.984.91M92$0.40
48MiniMax M2.584.884.8205K87$0.15
49Claude 3.7 Sonnet84.884.8200K101$3.00
50GLM 5 Turbo84.784.7203K$1.20
51MiMo-V2-Flash84.684.6262K145$0.10
52Grok-384.684.6128K100$3.00
53Kimi K2 Thinking84.584.5262K100$0.60
54o3 Pro84.584.5200K25$20.00
55Qwen3.5-35B-A3B84.584.5262K121$0.14
56DeepSeek-V2.584.384.38K100$0.14
57Qwen3.6 27B84.284.2262K64$0.30
58Qwen3.6 35B A3B84.184.1262K169$0.15
59DeepSeek-V3.28484131K$0.25
60Grok-3 Mini8484128K100$0.30
61GPT-5 Codex83.783.7400K180$1.25
62Claude Sonnet 4.583.483.41M42$3.00
63Step 3.5 Flash83.183.1262K194$0.09
64MiniMax M2.18383205K92$0.29
65JT-35B-Flash82.982.9$0.00
66MiMo-V2-Omni82.882.8262K108$0.40
67Gemini 2.5 Flash82.882.81M85$0.30
68Qwen3.5 Omni Plus82.682.654$0.40
69Step 3.5 Flash 260382.682.6197$0.00
70Claude 3.5 Sonnet82.593.187.167.2200K101$3.00
71GPT-5 mini82.382.3400K200$0.25
72Gemini 3.1 Flash Lite82.282.21M342$0.25
73GPT-5.4 nano81.781.7400K157$0.20
74o4-mini81.481.4200K115$1.10
75GPT-5.1-Codex-Mini81.381.3400K175$0.25
76Nova 2 Lite81.181.11M229$0.30
77Qwen3-235B-A22B-Thinking-250781.181.1256K$0.30
78ERNIE 4.5 300B A47B81.181.1131K24$0.28
79GLM-4.68181203K85$0.43
80DeepSeek-R1-05288181131K45$0.55
81GLM 5V Turbo80.980.9203K$1.20
82gpt-oss-120b80.980.9131K500$0.04
83Claude Opus 4.180.980.9200K120$15.00
84Qwen3.5-9B80.680.6262K51$0.04
85Claude Opus 4.680.168.891.31M48$5.00
86NVIDIA Nemotron 3 Super 120B A12B8080211$0.30
87DeepSeek V3.2 Exp79.979.9164K100$0.27
88EXAONE 4.5 33B79.479.4$0.00
89Gemini 2.5 Flash79.379.3$0.00
90DeepSeek V3.1 Terminus79.279.2164K$0.27
91Gemma 4 26B A4B79.279.2262K66$0.06
92Grok 3 mini Reasoning79.179.133$0.30
93GLM-4.579.179.1131K85$0.60
94Qwen3 235B A22B 2507797959$0.40
95o1-pro7979200K$150.00
96Nova 2.0 Pro78.578.5149$1.30
97K-EXAONE78.378.3$0.00
98o17878200K66$15.00
99ERNIE 5.0 Thinking77.777.7$0.00
100MiniMax-M277.777.7205K91$0.26
101Qwen3-235B-A22B-Instruct-250777.577.5131K63$0.15
102Ring-1T77.477.4$0.00
103Qwen3 VL 235B A22B77.277.234$0.80
104Qwen3 Next 80B A3B Thinking77.277.2262K$0.10
105o3-mini77.277.2200K115$1.10
106Qwen3.5 4B77.177.1164$0.00
107Mercury 27777128K790$0.25
108Mistral Small 476.976.9262K145$0.15
109Cogito v2.176.876.856$1.30
110Kimi K276.676.6131K26$0.57
111KAT-Coder-Pro V176.476.4108$0.30
112Qwen3 Max76.476.4262K45$0.78
113Doubao Seed Code76.476.4$0.00
114INTELLECT-376.176.1131K$0.20
115Command A76.176.1256K203$2.50
116Llama 3.1 Nemotron Ultra 253B v1767642$0.60
117Nova 2.0 Omni7676$0.30
118Qwen3-Next-80B-A3B75.975.9262K147$0.50
119Nemotron Cascade 2 30B A3B75.875.8$0.00
120Kimi K2 090575.875.8262K16$0.60
121NVIDIA Nemotron 3 Nano 30B A3B75.775.7148$0.10
122Claude Sonnet 475.475.41M101$3.00
123DeepSeek-V375.491.659.1131K100$0.23
124Trinity Large Thinking75.275.2262K129$0.22
125Ling-2.6-1T75.275.2262K$0.08
126Kimi K2 Instruct75.175.1131K45$0.57
127Kimi K2-Instruct-090575.175.1
128GLM 4.5 Air7575131K63$0.13
129DeepSeek-V3.174.974.9164K$0.21
130Llama Nemotron Super 49B v1.574.874.851$0.10
131Mistral Medium 3.574.874.8262K140$1.50
132Gemini 1.5 Pro74.489.274.959.12M85$1.25
133Qwen3.5 Omni Flash74.274.2235$0.10
134Gemini 2.0 Flash Thinking74.274.2$0.00
135EXAONE 4.0 32B73.973.9$0.00
136Magistral Medium 1.273.973.942$2.00
137Sarvam 105B73.873.8128$0.00
138Qwen3 Coder Next73.773.7262K92$0.11
139Claude 3 Opus73.486.883.150.4200K120$15.00
140Apriel-v1.6-15B-Thinker73.373.3$0.00
141Qwen3 VL 32B73.373.393$0.70
142DeepSeek R1 Zero73.373.3
143o1-preview73.373.3128K66$15.00
144Nova Pro73.186.985.446.9300K100$0.80
145Claude Haiku 4.57373200K100$1.00
146Claude Sonnet 4.672.958.387.51M75$3.00
147Qwen3 Next 80B A3B Instruct72.972.9262K161$0.09
148GPT-5.272.752.992.4400K73$1.75
149Hermes 4 - Llama-3.1 405B72.772.734$1.00
150Grok Code Fast 172.772.7$0.00
151Seed-OSS-36B-Instruct72.672.637$0.20
152Qwen3 Omni 30B A3B72.672.6102$0.30
153Ring-flash-2.072.572.5$0.10
154Solar Pro 372.472.4128K$0.15
155Mi:dm K 2.5 Pro72.272.2$0.00
156Qwen3 VL 30B A3B7272122$0.20
157Ling-1T71.971.9$0.00
158GLM 4.6V71.971.9131K44$0.30
159DeepSeek-R171.571.5128K189$0.55
160gpt-oss-20b71.571.5131K1000$0.03
161GPT-4.571.471.4128K50$75.00
162Apriel-v1.5-15B-Thinker71.371.3$0.00
163K2 Think V271.371.3$0.00
164GPT-5 nano71.271.2400K500$0.05
165Qwen3 VL 235B A22B Instruct71.271.2262K51$0.20
166Gemini 2.5 Flash-Lite70.970.9$0.10
167Magistral Medium70.870.8
168Qwen3 30B A3B 250770.770.7151$0.30
169GPT-4o70.170.1128K132$2.50
170Hermes 4 - Llama-3.1 70B69.969.960$0.10
171Llama 4 Maverick69.869.81M639$0.15
172MiniMax M1 80k69.769.7$0.60
173Motif-2-12.7B-Reasoning69.569.5$0.00
174Gemini 3 Deep Think69.545.193.81M$0.00
175Qwen3 VL 30B A3B Instruct69.569.5262K123$0.13
176Step3 VL 10B6969$0.00
177Phi 4 Reasoning Plus68.968.9
178Solar Pro 268.768.7$0.00
179GLM 4.5V68.468.466K85$0.60
180DeepSeek-V3 032468.468.4164K$0.28
181Gemini 1.5 Flash68.385.5511M150$0.15
182Qwen3 235B A22B68.288.947.5131K68$0.46
183MiniMax M1 40k68.268.2$0.00
184Magistral Small 250668.268.2
185Nova Lite68.282.480.242300K100$0.06
186K2-V268.168.1$0.00
187Mistral Large 36868262K54$0.50
188Magistral Medium 167.967.9$0.00
189Llama 3.1 405B Instruct67.884.850.7128K100$0.89
190JT-MINI67.667.6$0.00
191Claude 3 Sonnet67.482.978.940.4200K120$3.00
192Qwen3 VL 32B Instruct67.167.1262K76$0.10
193Qwen2.5 32B Instruct6784.549.5$0.00
194GPT-4 Turbo678648128K100$10.00
195Qwen3 32B66.866.8131K328$0.08
196Qwen3 4B 250766.766.7$0.00
197Llama-3.3 Nemotron Super 49B v166.766.7$0.00
198Magistral Small 1.266.366.3106$0.50
199GPT-4.166.366.31M100$2.00
200Nova Micro66.379.579.340128K100$0.03
201Falcon-H1R-7B66.166.1$0.00
202Qwen3 30B A3B 2507 Instruct65.965.9122$0.20
203Qwen365.865.8128K
204Qwen3 30B A3B65.865.8131K122$0.09
205Phi 4 Reasoning65.865.8
206Phi 465.875.556.116K33$0.07
207Ling-flash-2.065.765.791$0.10
208Solar Open 100B65.765.7$0.00
209DeepSeek R1 Distill Llama 70B65.265.2128K37$0.10
210QwQ-32B65.265.231$0.70
211QwQ-32B-Preview65.265.233K99$0.15
212Gemma 3 27B6587.642.4131K33$0.08
213GPT-4.1 Mini65651M150$0.40
214Gemini 2.5 Flash Lite64.664.61M6$0.10
215Granite 3.3 8B Instruct64.369.159.4
216Magistral Small 164.164.1$0.00
217Nemotron Nano 9B V26464131K$0.04
218LongCat Flash Lite63.663.6110$0.00
219Sarvam 30B63.363.3214$0.00
220Gemma 3 12B63.385.740.9131K33$0.04
221Qwen2 72B Instruct62.482.442.4$0.00
222Claude 3.5 Haiku62.483.141.6200K104$0.80
223Sonar Reasoning62.362.3$0.00
224Gemini 2.0 Pro62.262.2$0.00
225DeepSeek R1 Distill Qwen 32B62.162.1128K37$0.12
226Gemini 2.0 Flash62.162.11M183$0.10
227Qwen3 Omni 30B A3B Instruct6262103$0.30
228Qwen2.5 14B Instruct61.978.245.5
229Qwen3 Coder 480B A35B Instruct61.861.869$0.30
230Claude 3 Haiku61.873.778.433.3200K104$0.25
231Gemini 3 Pro61.531.191.91M141$2.00
232HyperCLOVA X SEED Think61.561.5$0.00
233DeepSeek R1 0528 Qwen3 8B61.261.2$0.00
234Olmo 3 32B Think616166K$0.15
235Llama 3.1 70B Instruct60.779.641.7131K1204$0.40
236Qwen3 14B60.460.4132K62$0.10
237Tri-21B-Think60.160.1$0.00
238o1-mini6060128K115$3.00
239GPT-4o-mini6079.740.2128K92$0.15
240Devstral 259.459.4262K51$0.40
241Ling-2.6-flash59.359.3262K$0.01
242Olmo 3.1 32B Think59.159.1$0.00
243DeepSeek R1 Distill Qwen 14B59.159.1$0.00
244Qwen3 8B58.958.9131K69$0.05
245Mistral Medium 3.158.858.8131K47$0.40
246Qwen2.5 Max58.758.750$1.60
247GPT-458.380.935.78K104$30.00
248GLM 4.7 Flash58.158.1203K113$0.06
249Phi-3.5-MoE-instruct5879.136.8
250Qwen3 VL 8B57.957.9120$0.20
251Sonar Pro57.857.8200K$3.00
252Mistral Medium 357.857.8131K32$0.40
253Gemma 4 E4B57.657.6$0.00
254NVIDIA Nemotron Nano 12B v2 VL57.257.2244$0.20
255Llama 4 Scout57.257.210M776$0.08
256Ministral 3 14B57.257.2262K67$0.20
257NVIDIA Nemotron Nano 9B V25757129$0.00
258Gemma 3n E4B56.952.960.8
259Nova Premier56.956.940$2.50
260Ling-mini-2.056.256.2$0.00
261Grok-25656128K85$2.00
262Llama 3.1 Nemotron Nano 8B V154.154.1
263Olmo 3.1 32B Instruct53.953.9$0.00
264Devstral Small 253.253.262$0.00
265Reka Flash 352.952.966K93$0.10
266Granite 3.3 8B Base52.669.136.1
267Qwen3 4B52.252.2103$0.10
268Phi 4 Mini Reasoning5252
269Grok 451.715.987.5256K100$3.00
270Qwen3 4B 2507 Instruct51.751.7$0.00
271Olmo 3 7B Think51.651.6$0.00
272Llama 3.1 Tulu3 405B51.651.6$0.00
273Qwen3 Coder 30B A3B Instruct51.651.6160K97$0.07
274Gemini 2.0 Flash Lite51.551.51M85$0.08
275Exaone 4.0 1.2B51.551.5$0.00
276Gemma 3 4B51.572.230.8131K33$0.04
277NVIDIA Nemotron 3 Nano 4B51.351.3$0.00
278Grok-2 mini5151
279IBM Granite 4.0 Tiny Preview5155.746.2
280Pixtral Large50.550.5131K0$2.00
281Mistral Small 3.250.550.5100$0.10
282Llama 3.3 70B Instruct50.550.5131K2220$0.10
283GPT-3.5 Turbo50.570.230.816K100$0.50
284GPT-4.1 Nano50.350.31M200$0.10
285Phi-3.5-mini-instruct49.76930.4128K23$0.10
286Qwen3 VL 4B49.449.4$0.00
287Devstral Medium49.249.2131K72$0.40
288DeepSeek R1 Distill Qwen 7B49.149.1
289Gemma 3n E2B49.144.353.9
290Qwen2.5 72B Instruct4949131K100$0.36
291DeepSeek R1 Distill Llama 8B4949$0.00
292Mistral Large 248.648.6128K42$2.00
293Kimi K2 Base48.148.1
294Granite 4.1 30B48.148.1$0.00
295Phi 4 Mini47.870.425.2
296LFM2-24B-A2B47.447.4128K208$0.03
297o347.16.587.7200K50$2.00
298Sonar47.147.1127K$1.00
299Grok47.147.1$0.00
300Ministral 3 8B47.147.1262K86$0.15
301Nemotron 3 Nano Omni 30B A3B Reasoning46.946.9301$0.10
302Llama 3.2 90B Instruct46.746.7128K100$0.35
303Llama 3.1 Nemotron 70B Instruct46.546.5292$1.20
304Mistral Small 346.246.233K136$0.05
305Mistral Small 3.2 24B Instruct46.146.1
306Qwen2.5 VL 32B Instruct4646
307Mistral Small 3.1 24B Instruct4646
308Gemma 3n E4B Instructed LiteRT Preview45.852.960.823.7
309Qwen3.5 2B45.645.6328$0.00
310Mistral Small 3.145.445.4134$0.10
311Mistral Small 3 24B Instruct45.345.332K134$0.10
312Llama 3.1 8B Instruct4559.530.4131K2047$0.02
313Gemini 2.5 Pro44.54.9841M85$1.25
314Claude Opus 444.18.679.6200K120$15.00
315Devstral Small43.443.4190$0.10
316Gemma 4 E2B43.343.3$0.00
317Granite 4.1 8B43.343.3131K133$0.05
318Gemma 3 27B Instruct42.842.8$0.10
319Qwen3 VL 8B Instruct42.742.7256K145$0.08
320Molmo2-8B42.542.5$0.00
321Mistral Saba42.442.4$0.00
322Qwen2.5 Coder 32B Instruct41.741.7128K110$0.66
323Sarvam M41.641.6136$0.00
324Granite 4.0 H Small41.641.6524$0.10
325Kimi Linear 48B A3B Instruct41.241.2$0.00
326Qwen2.5 Turbo414167$0.10
327Gemma 3n E2B Instructed LiteRT (Preview)4144.353.924.8
328Llama 3.1 Nemotron Nano 4B v1.140.840.8$0.00
329Gemini Diffusion40.440.4
330Hermes 3 - Llama-3.1 70B40.140.132$0.30
331Olmo 3 7B Instruct4040$0.10
332Jamba Large 1.73939256K48$2.00
333Jamba 1.6 Large38.738.752$2.00
334Gemini 1.5 Flash 8B38.438.41M150$0.07
335DeepHermes 3 - Mistral 24B38.238.2$0.00
336Mistral Small38.138.1134$0.20
337Llama 3 70B Instruct37.937.98K45$0.51
338Mistral Small 3.1 24B Base37.537.5128K137$0.10
339Qwen3 VL 4B Instruct37.137.1$0.00
340Jamba 1.5 Large36.936.9256K100$2.00
341Qwen2.5 7B Instruct36.436.4131K138$0.04
342Grok-1.535.935.9
343Ministral 3 3B35.835.8131K154$0.10
344Qwen3 1.7B35.635.6138$0.10
345Mistral Large35.135.1128K$2.00
346Gemma 3 12B Instruct34.934.9$0.10
347Mistral Medium34.934.945$2.80
348Mistral Small 3 24B Base34.434.4
349Claude 234.434.4100K$0.00
350LFM2 8B A1B34.434.4$0.00
351Qwen2.5-Coder 7B Instruct33.933.9$0.00
352LFM2.5-1.2B-Thinking33.933.9$0.00
353DeepSeek R1 Distill Qwen 1.5B33.833.8$0.00
354Granite 3.3 8B33.833.8376$0.00
355Granite 4.0 Micro33.633.6131K$0.02
356Jamba Reasoning 3B33.333.3$0.00
357Mixtral 8x22B Instruct33.233.266K$2.00
358Phi 4 Mini Instruct33.133.1131K$0.08
359DBRX Instruct33.133.1$0.00
360Claude Instant3333$0.00
361Llama 3.2 11B Instruct32.832.8128K168$0.05
362Llama 3.2 3B Instruct32.832.8131K172$0.05
363OLMo 2 32B32.832.8$0.00
364LFM 40B32.732.7$0.00
365Llama 2 Chat 70B32.732.7$0.00
366LFM2.5-1.2B-Instruct32.632.6$0.00
367Jamba 1.5 Mini32.332.3256K100$0.20
368Command R+32.332.3128K100$0.15
369Jamba 1.7 Mini32.232.2$0.00
370Llama 2 Chat 13B32.132.1$0.00
371Claude 2.131.931.9$0.00
372DeepSeek Coder V2 Lite Instruct31.931.9$0.00
373Phi-3 Mini Instruct 3.8B31.931.9$0.00
374Phi-4-multimodal-instruct31.531.5128K25$0.05
375Granite 4.1 3B31.431.4$0.00
376Qwen2.5-Omni-7B30.830.8
377LFM2 2.6B30.630.6$0.00
378Tiny Aya Global30.530.5126$0.00
379MiniCPM-V 4.6 1.3B30.530.5$0.00
380Jamba 1.6 Mini3030183$0.20
381Gemma 3n E4B Instruct29.629.656$0.00
382Llama 3 8B Instruct29.629.68K81$0.04
383Mixtral 8x7B Instruct29.229.2$0.50
384Gemma 3 1B29.239.119.2
385Gemma 3 4B Instruct29.129.1$0.00
386Qwen1.5 Chat 110B28.928.9$0.00
387LFM2.5-VL-1.6B28.928.9$0.00
388OLMo 2 7B28.828.8$0.00
389Granite 4.0 1B28.128.1$0.00
390Gemini 1.0 Pro27.927.933K120$0.50
391Apertus 70B Instruct27.227.2$0.80
392DeepHermes 3 - Llama-3.1 8B2727$0.00
393MiniCPM5-1B26.926.9$0.00
394Granite 4.0 H 1B26.326.3$0.00
395Granite 4.0 350M26.126.1$0.00
396Granite 4.0 H 350M25.725.7$0.00
397Apertus 8B Instruct25.625.6$0.10
398Qwen2 7B Instruct25.325.3
399Gemma 3n E2B Instructed24.824.8
400Molmo 7B-D2424$0.00
401Qwen3 0.6B23.923.9225$0.10
402Gemma 3n E4B Instructed23.723.732K42$20.00
403Gemma 3 1B Instruct23.723.7$0.00
404Qwen3.5 0.8B23.623.6120$0.00
405OpenChat 3.52323$0.00
406Gemma 3n E2B Instruct22.922.9$0.00
407LFM2 1.2B22.822.8$0.00
408Llama 2 Chat 7B22.722.7113$0.10
409Gemma 3 270M22.422.4$0.00
410Llama 3.2 1B Instruct19.619.6131K91$0.03
411Mistral 7B Instruct17.717.790$0.20

411 models ranked on Reasoning. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.