Leaderboards

Model rankings

A balanced intelligence index averages each model's per-category scores. Drill into a category for individual benchmarks, or sort by speed, price, and context. See what changed → How this is calculated → Embed this leaderboard →

Updated May 25, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Top intelligence

Sonar Reasoning Pro

95.7 index

Top reasoning

Claude Opus 4.7

94.2

Top coding

DeepSeek V3.2 Speciale

89.6

Top math

Grok-4 Heavy

100

Fastest

Llama 3.3 70B Instruct

2220 tok/s

Cheapest

Ling-2.6-flash

$0.01/M

Longest context

Llama 4 Scout

10M

Best open-weights

DeepSeek V3.2 Speciale

89.9 index

Price vs. intelligence

Intelligence index vs. input price — up and to the left is better value.

Speed vs. intelligence

Intelligence index vs. output speed — up and to the right is fast and smart.

Overview Reasoning Coding Math Agents Multimodal General Long Context

#	Model	Math idx ↓	MATH-500	FrontierMath	HMMT 2025	GSM8K	MGSM	AIME 2024	AIME 2025	MATH	Context	Speed	In $/M
1	Grok-4 HeavyxAI	100	—	—	—	—	—	—	100	—	—	—	—
2	GPT-5.2OpenAI	100	—	—	—	—	—	—	100	—	400K	73	$1.75
3	GPT-5 CodexOpenAI	98.7	—	—	—	—	—	—	98.7	—	400K	180	$1.25
4	Gemini 3 FlashGoogle	97	—	—	—	—	—	—	97	—	1M	191	$0.50
5	DeepSeek V3.2 SpecialeDeepSeek	96.7	—	—	—	—	—	—	96.7	—	164K	—	$0.29
6	MiMo-V2-FlashXiaomi	96.3	—	—	—	—	—	—	96.3	—	262K	145	$0.10
7	Claude Haiku 4.5Anthropic	96.3	—	—	—	—	—	—	96.3	—	200K	100	$1.00
8	Sonar Reasoning ProPerplexity	95.7	95.7	—	—	—	—	—	—	—	128K	—	$2.00
9	GPT-5.1-CodexOpenAI	95.7	—	—	—	—	—	—	95.7	—	400K	188	$1.25
10	Gemini 3 ProGoogle	95.7	—	—	—	—	—	—	95.7	—	1M	141	$2.00
11	R1 1776Perplexity	95.4	95.4	—	—	—	—	—	—	—	—	—	$0.00
12	Grok 4xAI	95.4	99	—	—	—	—	—	91.7	—	256K	100	$3.00
13	GLM 4.7Zhipu AI	95	—	—	—	—	—	—	95	—	203K	98	$0.40
14	o4-miniOpenAI	95	98.9	—	—	—	—	93.4	92.7	—	200K	115	$1.10
15	Kimi K2 ThinkingMoonshot AI	94.7	—	—	—	—	—	—	94.7	—	262K	100	$0.60
16	Qwen3 235B A22B 2507Alibaba	94.7	98.4	—	—	—	—	—	91	—	—	59	$0.40
17	KAT-Coder-Pro V1Kuaishou	94.7	—	—	—	—	—	—	94.7	—	—	108	$0.30
18	Phi 4 Mini ReasoningMicrosoft	94.6	94.6	—	—	—	—	—	—	—	—	—	—
19	Nova 2 LiteAmazon	94.3	—	—	—	—	—	—	94.3	—	1M	229	$0.30
20	GPT-5.1OpenAI	94	—	—	—	—	—	—	94	—	400K	115	$1.25
21	GLM-4.6Zhipu AI	93.9	—	—	—	—	—	—	93.9	—	203K	85	$0.43
22	gpt-oss-120bOpenAI	93.4	—	—	—	—	—	—	93.4	—	131K	500	$0.04
23	Grok-3 MinixAI	93.3	—	—	—	—	—	95.8	90.8	—	128K	100	$0.30
24	Grok 4 FastxAI	92.7	—	—	93.3	—	—	—	92	—	2M	90	$0.20
25	Qwen3-235B-A22B-Thinking-2507Alibaba	92.3	—	—	—	—	—	—	92.3	—	256K	—	$0.30
26	Gemini 2.0 ProGoogle	92.3	92.3	—	—	—	—	—	—	—	—	—	$0.00
27	Gemini 2.5 ProGoogle	92.2	96.7	—	—	—	—	92	88	92	1M	85	$1.25
28	Sonar ReasoningPerplexity	92.1	92.1	—	—	—	—	—	—	—	—	—	$0.00
29	DeepSeek-V3.2DeepSeek	92	—	—	—	—	—	—	92	—	131K	—	$0.25
30	Grok 3 mini ReasoningxAI	92	99.2	—	—	—	—	—	84.7	—	—	33	$0.30
31	GPT-5.1-Codex-MiniOpenAI	91.7	—	—	—	—	—	—	91.7	—	400K	175	$0.25
32	Claude Opus 4.5Anthropic	91.3	—	—	—	—	—	—	91.3	—	200K	58	$5.00
33	DeepSeek R1 ZeroDeepSeek	91.3	95.9	—	—	—	—	86.7	—	—	—	—	—
34	Grok-3xAI	91.2	87	—	—	—	—	93.3	93.3	—	128K	100	$3.00
35	NVIDIA Nemotron 3 Nano 30B A3BNVIDIA	91	—	—	—	—	—	—	91	—	—	148	$0.10
36	K-EXAONELG AI Research	90.3	—	—	—	—	—	—	90.3	—	—	—	$0.00
37	o1-miniOpenAI	90	90	—	—	—	—	—	—	—	128K	115	$3.00
38	DeepSeek V3.1 TerminusDeepSeek	89.7	—	—	—	—	—	—	89.7	—	164K	—	$0.27
39	Nova 2.0 OmniAmazon	89.7	—	—	—	—	—	—	89.7	—	—	—	$0.30
40	GLM 4.5 AirZhipu AI	89.4	98.1	—	—	—	—	89.4	80.7	—	131K	63	$0.13
41	Grok 4.1 FastxAI	89.3	—	—	—	—	—	—	89.3	—	—	—	$0.00
42	Ring-1TInclusionAI	89.3	—	—	—	—	—	—	89.3	—	—	—	$0.00
43	gpt-oss-20bOpenAI	89.3	—	—	—	—	—	—	89.3	—	131K	1000	$0.03
44	DeepSeek-R1-0528DeepSeek	89.2	98.3	—	79.4	—	—	91.4	87.5	—	131K	45	$0.55
45	Nova 2.0 ProAmazon	89	—	—	—	—	—	—	89	—	—	149	$1.30
46	EXAONE 4.0 32BLG AI Research	88.9	97.7	—	—	—	—	—	80	—	—	—	$0.00
47	Qwen3 VL 235B A22BAlibaba	88.3	—	—	—	—	—	—	88.3	—	—	34	$0.80
48	DeepSeek R1 Distill Qwen 7BDeepSeek	88.1	92.8	—	—	—	—	83.3	—	—	—	—	—
49	INTELLECT-3Prime Intellect	88	—	—	—	—	—	—	88	—	131K	—	$0.20
50	Apriel-v1.6-15B-ThinkerServiceNow	88	—	—	—	—	—	—	88	—	—	—	$0.00
51	Gemini 2.5 Pro Preview 06-05Google	88	—	—	—	—	—	—	88	—	1M	85	$1.25
52	Qwen3 Next 80B A3B ThinkingAlibaba	87.8	—	—	—	—	—	—	87.8	—	262K	—	$0.10
53	GLM-4.5Zhipu AI	87.6	98.2	—	—	—	—	91	73.7	—	131K	85	$0.60
54	Gemini 1.5 ProGoogle	87.6	87.6	—	—	90.8	87.5	—	—	86.5	2M	85	$1.25
55	Llama Nemotron Super 49B v1.5NVIDIA	87.5	98.3	—	—	—	—	—	76.7	—	—	51	$0.10
56	Apriel-v1.5-15B-ThinkerServiceNow	87.5	—	—	—	—	—	—	87.5	—	—	—	$0.00
57	Gemini 2.0 Flash LiteGoogle	87.3	87.3	—	—	—	—	—	—	86.8	1M	85	$0.08
58	Claude Sonnet 4.5Anthropic	87	—	—	—	—	—	—	87	—	1M	42	$3.00
59	Kimi-k1.5Moonshot AI	86.9	96.2	—	—	—	—	77.5	—	—	—	—	—
60	Claude Opus 4Anthropic	86.9	98.2	—	—	—	—	—	75.5	—	200K	120	$15.00
61	Qwen3 235B A22BAlibaba	86.7	93	—	—	94.4	83.5	85.7	81.5	71.8	131K	68	$0.46
62	DeepSeek V3.2 ExpDeepSeek	86.4	—	—	83.6	—	—	—	89.3	—	164K	100	$0.27
63	o1-proOpenAI	86	—	—	—	—	—	86	—	—	200K	—	$150.00
64	Gemini 2.5 FlashGoogle	86	98.1	—	—	—	—	88	72	—	1M	85	$0.30
65	GLM 4.6VZhipu AI	85.3	—	—	—	—	—	—	85.3	—	131K	44	$0.30
66	ERNIE 5.0 ThinkingBaidu	85	—	—	—	—	—	—	85	—	—	—	$0.00
67	Nemotron Nano 9B V2NVIDIA	84.9	97.8	—	—	—	—	—	72.1	—	131K	—	$0.04
68	Llama 3.1 Nemotron Ultra 253B v1NVIDIA	84.8	97	—	—	—	—	—	72.5	—	—	42	$0.60
69	Claude Sonnet 4Anthropic	84.8	99.1	—	—	—	—	—	70.5	—	1M	101	$3.00
70	Seed-OSS-36B-InstructByteDance	84.7	—	—	—	—	—	—	84.7	—	—	37	$0.20
71	Qwen3 VL 32BAlibaba	84.7	—	—	—	—	—	—	84.7	—	—	93	$0.70
72	Sarvam MSarvam	84.7	84.7	—	—	—	—	—	—	—	—	136	$0.00
73	Qwen3-Next-80B-A3BAlibaba	84.3	—	—	—	—	—	—	84.3	—	262K	147	$0.50
74	Qwen3-235B-A22B-Instruct-2507Alibaba	84.2	98	—	—	—	—	—	70.3	—	131K	63	$0.15
75	Gemini 2.0 Flash ThinkingGoogle	83.9	94.4	—	—	—	—	73.3	—	—	—	—	$0.00
76	Ring-flash-2.0InclusionAI	83.7	—	—	—	—	—	—	83.7	—	—	—	$0.10
77	Qwen3 32BAlibaba	83.5	96.1	—	—	—	—	81.4	72.9	—	131K	328	$0.08
78	Qwen2.5 MaxAlibaba	83.5	83.5	—	—	—	—	—	—	—	—	50	$1.60
79	MiniMax M2.1MiniMax	82.7	—	—	—	—	—	—	82.7	—	205K	92	$0.29
80	Qwen3 4B 2507Alibaba	82.7	—	—	—	—	—	—	82.7	—	—	—	$0.00
81	Gemini 1.5 FlashGoogle	82.7	82.7	—	—	86.2	82.6	—	—	77.9	1M	150	$0.15
82	Qwen3 30B A3BAlibaba	82.4	95.9	—	—	—	—	80.4	70.9	—	131K	122	$0.09
83	Qwen3 VL 30B A3BAlibaba	82.3	—	—	—	—	—	—	82.3	—	—	122	$0.20
84	Qwen3 Max ThinkingAlibaba	82.3	—	—	—	—	—	—	82.3	—	262K	45	$0.78
85	DeepSeek-R1DeepSeek	82.3	96.6	—	—	—	—	—	68	—	128K	189	$0.55
86	Magistral Medium 1.2Mistral AI	82	—	—	—	—	—	—	82	—	—	42	$2.00
87	Qwen3 30B A3B 2507 InstructAlibaba	81.9	97.5	—	—	—	—	—	66.3	—	—	122	$0.20
88	SonarPerplexity	81.7	81.7	—	—	—	—	—	—	—	127K	—	$1.00
89	Qwen3Alibaba	81.5	—	—	—	—	—	—	81.5	—	128K	—	—
90	Qwen3 MaxAlibaba	80.7	—	—	—	—	—	—	80.7	—	262K	45	$0.78
91	Qwen2.5 32B InstructAlibaba	80.5	80.5	—	—	95.9	—	—	—	83.1	—	—	$0.00
92	Qwen2.5 TurboAlibaba	80.5	80.5	—	—	—	—	—	—	—	—	67	$0.10
93	Magistral Small 1.2Mistral AI	80.3	—	—	—	—	—	—	80.3	—	—	106	$0.50
94	Motif-2-12.7B-ReasoningMotif Technologies	80.3	—	—	—	—	—	—	80.3	—	—	—	$0.00
95	DeepSeek R1 Distill Qwen 32BDeepSeek	80.2	94.3	—	—	—	—	83.3	63	—	128K	37	$0.12
96	Falcon-H1R-7BTII UAE	80	—	—	—	—	—	—	80	—	—	—	$0.00
97	Phi 4 Reasoning PlusMicrosoft	79.7	—	—	—	—	—	81.3	78	—	—	—	—
98	MiniMax M1 80kMiniMax	79.5	98	—	—	—	—	—	61	—	—	—	$0.60
99	Doubao Seed CodeByteDance	79.3	—	—	—	—	—	—	79.3	—	—	—	$0.00
100	Claude 3.7 SonnetAnthropic	79.1	96.2	—	—	—	—	80	61	82	200K	101	$3.00
101	Solar Pro 2Upstage	79	96.7	—	—	—	—	—	61.3	—	—	—	$0.00
102	Mi:dm K 2.5 ProKorea Telecom	78.7	—	—	—	—	—	—	78.7	—	—	—	$0.00
103	DeepSeek R1 0528 Qwen3 8BDeepSeek	78.5	93.2	—	—	—	—	—	63.7	—	—	—	$0.00
104	GPT-5OpenAI	78.4	99.4	26.3	93.3	—	—	—	94.6	84.7	400K	100	$1.25
105	Gemini 2.5 FlashGoogle	78.3	—	—	—	—	—	—	78.3	—	—	—	$0.00
106	MiniMax-M2MiniMax	78.3	—	—	—	—	—	—	78.3	—	205K	91	$0.26
107	K2-V2MBZUAI Institute of Foundation Models	78.3	—	—	—	—	—	—	78.3	—	—	—	$0.00
108	DeepSeek R1 Distill Llama 70BDeepSeek	78.3	94.5	—	—	—	—	86.7	53.7	—	128K	37	$0.10
109	Claude Opus 4.1Anthropic	78	—	—	—	—	—	—	78	—	200K	120	$15.00
110	Grok-2xAI	77.8	77.8	—	—	—	—	—	—	76.1	128K	85	$2.00
111	Llama 3.1 Tulu3 405BAllen Institute for AI	77.8	77.8	—	—	—	—	—	—	—	—	—	$0.00
112	Llama-3.3 Nemotron Super 49B v1NVIDIA	77.5	96.6	—	—	—	—	—	58.4	—	—	—	$0.00
113	Olmo 3.1 32B ThinkAllen Institute for AI	77.3	—	—	—	—	—	—	77.3	—	—	—	$0.00
114	Claude 3.5 SonnetAnthropic	77.1	77.1	—	—	96.4	91.6	—	—	78.3	200K	101	$3.00
115	Qwen3 14BAlibaba	77.1	96.1	—	—	—	—	—	58	—	132K	62	$0.10
116	Qwen3 30B A3B 2507Alibaba	76.9	97.6	—	—	—	—	—	56.3	—	—	151	$0.30
117	Qwen2.5 Coder 32B InstructAlibaba	76.7	76.7	—	—	91.1	—	—	—	57.2	128K	110	$0.66
118	DeepSeek R1 Distill Qwen 14BDeepSeek	76.5	93.9	—	—	—	—	80	55.7	—	—	—	$0.00
119	DeepSeek-V2.5DeepSeek	76.3	76.3	—	—	95.1	—	—	—	74.7	8K	100	$0.14
120	Granite 3.3 8B InstructIBM	75.1	69	—	—	80.9	—	81.2	—	—	—	—	—
121	Granite 3.3 8B BaseIBM	75.1	69	—	—	59	—	81.2	—	—	—	—	—
122	NVIDIA Nemotron Nano 12B v2 VLNVIDIA	75	—	—	—	—	—	—	75	—	—	244	$0.20
123	Kimi K2Moonshot AI	74.6	97.1	—	—	—	—	69.6	57	—	131K	26	$0.57
124	Sonar ProPerplexity	74.5	74.5	—	—	—	—	—	—	—	200K	—	$3.00
125	DeepSeek-Coder-V2DeepSeek	74.3	74.3	—	—	—	—	—	—	—	—	—	$0.00
126	Qwen3 Omni 30B A3BAlibaba	74	—	—	—	—	—	—	74	—	—	102	$0.30
127	Olmo 3 32B ThinkAllen Institute for AI	73.7	—	—	—	—	—	—	73.7	—	66K	—	$0.15
128	GPT-4 TurboOpenAI	73.7	73.7	—	—	—	88.5	—	—	72.6	128K	100	$10.00
129	GrokxAI	73.7	73.7	—	—	—	—	—	—	—	—	—	$0.00
130	Gemini 2.5 Flash LiteGoogle	73.4	96.9	—	—	—	—	—	49.8	—	1M	6	$0.10
131	o3OpenAI	73.3	99.2	15.8	—	—	—	91.6	86.4	—	200K	50	$2.00
132	GLM 4.5VZhipu AI	73	—	—	—	—	—	—	73	—	66K	85	$0.60
133	Cogito v2.1Deep Cogito	72.7	—	—	—	—	—	—	72.7	—	—	56	$1.30
134	Llama 3.1 Nemotron Nano 4B v1.1NVIDIA	72.4	94.7	—	—	—	—	—	50	—	—	—	$0.00
135	Qwen3 VL 30B A3B InstructAlibaba	72.3	—	—	—	—	—	—	72.3	—	262K	123	$0.13
136	Claude 3.5 HaikuAnthropic	72.1	72.1	—	—	—	85.6	—	—	69.4	200K	104	$0.80
137	Ling-1TInclusionAI	71.3	—	—	—	—	—	—	71.3	—	—	—	$0.00
138	Llama 3.1 Nemotron Nano 8B V1NVIDIA	71.3	95.4	—	—	—	—	—	47.1	—	—	—	—
139	Qwen3 VL 235B A22B InstructAlibaba	70.7	—	—	—	—	—	—	70.7	—	262K	51	$0.20
140	Olmo 3 7B ThinkAllen Institute for AI	70.7	—	—	—	—	—	—	70.7	—	—	—	$0.00
141	QwQ-32B-PreviewAlibaba	70.3	90.6	—	—	—	—	50	—	—	33K	99	$0.15
142	Qwen2 72B InstructAlibaba	70.1	70.1	—	—	91.1	—	—	—	59.7	—	—	$0.00
143	DeepSeek R1 Distill Llama 8BDeepSeek	70.1	89.1	—	—	—	—	80	41.3	—	—	—	$0.00
144	Hermes 4 - Llama-3.1 405BNous Research	69.7	—	—	—	—	—	—	69.7	—	—	34	$1.00
145	NVIDIA Nemotron Nano 9B V2NVIDIA	69.7	—	—	—	—	—	—	69.7	—	—	129	$0.00
146	Qwen3 Next 80B A3B InstructAlibaba	69.5	—	—	—	—	—	—	69.5	—	262K	161	$0.09
147	Magistral MediumMistral AI	69.3	—	—	—	—	—	73.6	64.9	—	—	—	—
148	Phi-4-multimodal-instructMicrosoft	69.3	69.3	—	—	—	—	—	—	—	128K	25	$0.05
149	Phi 4 ReasoningMicrosoft	69.1	—	—	—	—	—	75.3	62.9	—	—	—	—
150	Gemini 1.5 Flash 8BGoogle	68.9	68.9	—	—	—	—	—	—	58.7	1M	150	$0.07
151	Magistral Small 1Mistral AI	68.8	96.3	—	—	—	—	—	41.3	—	—	—	$0.00
152	Gemini 2.5 Flash-LiteGoogle	68.7	—	—	—	—	—	—	68.7	—	—	—	$0.10
153	Hermes 4 - Llama-3.1 70BNous Research	68.7	—	—	—	—	—	—	68.7	—	—	60	$0.10
154	Qwen3 VL 32B InstructAlibaba	68.3	—	—	—	—	—	—	68.3	—	262K	76	$0.10
155	Mistral SabaMistral AI	67.7	67.7	—	—	—	—	—	—	—	—	—	$0.00
156	ERNIE 4.5 300B A47BBaidu	67.2	93.1	—	—	—	—	—	41.3	—	131K	24	$0.28
157	o1-previewOpenAI	67.2	92.4	—	—	—	90.8	42	—	85.5	128K	66	$15.00
158	GPT-5 miniOpenAI	67	—	22.1	87.8	—	—	—	91.1	—	400K	200	$0.25
159	Qwen3 Coder 480B A35B InstructAlibaba	66.8	94.2	—	—	—	—	—	39.3	—	—	69	$0.30
160	Magistral Small 2506Mistral AI	66.8	—	—	—	—	—	70.7	62.8	—	—	—	—
161	QwQ-32BAlibaba	66.4	90.6	—	—	—	—	79.5	29	—	—	31	$0.70
162	Magistral Medium 1Mistral AI	66	91.7	—	—	—	—	—	40.3	—	—	—	$0.00
163	Qwen2.5-Coder 7B InstructAlibaba	66	66	—	—	83.9	—	—	—	46.6	—	—	$0.00
164	Ling-flash-2.0InclusionAI	65.3	—	—	—	—	—	—	65.3	—	—	91	$0.10
165	o3-miniOpenAI	65	98.5	9.2	—	—	92	87.3	—	97.9	200K	115	$1.10
166	DeepSeek-V3 0324DeepSeek	64.8	94	—	—	—	—	59.4	41	—	164K	—	$0.28
167	Kimi K2 0905Moonshot AI	64.7	—	—	—	—	—	72	57.3	89.1	262K	16	$0.60
168	Claude 3 OpusAnthropic	64.1	64.1	—	—	95	90.7	—	—	60.1	200K	120	$15.00
169	Qwen3 1.7BAlibaba	64.1	89.4	—	—	—	—	—	38.7	—	—	138	$0.10
170	Kimi K2 InstructMoonshot AI	63.8	97.4	—	38.8	97.3	—	69.6	49.5	—	131K	45	$0.57
171	Kimi K2-Instruct-0905Moonshot AI	63.8	97.4	—	38.8	—	—	69.6	49.5	—	—	—	—
172	Llama 3.2 90B InstructMeta	62.9	62.9	—	—	—	86.9	—	—	68	128K	100	$0.35
173	Reka Flash 3Reka AI	61.5	89.3	—	—	—	—	—	33.7	—	66K	93	$0.10
174	Jamba 1.5 LargeAI21 Labs	60.6	60.6	—	—	87	—	—	—	—	256K	100	$2.00
175	Mistral Medium 3Mistral AI	60.5	90.7	—	—	—	—	—	30.3	—	131K	32	$0.40
176	DeepHermes 3 - Mistral 24BNous Research	59.5	59.5	—	—	—	—	—	—	—	—	—	$0.00
177	Qwen3 Coder 30B A3B InstructAlibaba	59.2	89.3	—	—	—	—	—	29	—	160K	97	$0.07
178	HyperCLOVA X SEED ThinkNaver	59	—	—	—	—	—	—	59	—	—	—	$0.00
179	o1OpenAI	58.9	97	5.5	—	97.1	89.3	74.3	—	96.4	200K	66	$15.00
180	Jamba 1.6 LargeAI21 Labs	58	58	—	—	—	—	—	—	—	—	52	$2.00
181	Qwen3 4BAlibaba	57.8	93.3	—	—	—	—	—	22.3	—	—	103	$0.10
182	Mistral Small 3.2Mistral AI	57.7	88.3	—	—	—	—	—	27	—	—	100	$0.10
183	Gemini 2.0 FlashGoogle	57.4	93	—	—	—	—	—	21.7	89.7	1M	183	$0.10
184	Qwen3 8BAlibaba	57.4	90.4	—	—	—	—	—	24.3	—	131K	69	$0.05
185	GPT-5 nanoOpenAI	56.8	—	9.6	75.6	—	—	—	85.2	—	400K	500	$0.05
186	Mistral SmallMistral AI	56.3	56.3	—	—	—	—	—	—	—	—	134	$0.20
187	MiniMax M1 40kMiniMax	55.5	97.2	—	—	—	—	—	13.7	—	—	—	$0.00
188	Gemma 3 27B InstructGoogle	54.5	88.3	—	—	—	—	—	20.7	—	—	—	$0.10
189	Mixtral 8x22B InstructMistral AI	54.5	54.5	—	—	—	—	—	—	—	66K	—	$2.00
190	GPT-4.1 MiniOpenAI	54.3	92.5	—	35	—	—	49.6	40.2	—	1M	150	$0.40
191	Llama 4 MaverickMeta	54.1	88.9	—	—	—	92.3	—	19.3	61.2	1M	639	$0.15
192	Hermes 3 - Llama-3.1 70BNous Research	53.8	53.8	—	—	—	—	—	—	—	—	32	$0.30
193	GPT-4.1OpenAI	53.7	91.3	—	28.9	—	—	48.1	46.4	87	1M	100	$2.00
194	Reka FlashReka AI	52.9	52.9	—	—	—	—	—	—	—	—	85	$0.20
195	DeepSeek R1 Distill Qwen 1.5BDeepSeek	52.9	83.9	—	—	—	—	52.7	22	—	—	—	$0.00
196	Mistral LargeMistral AI	52.7	52.7	—	—	—	—	—	—	—	128K	—	$2.00
197	Qwen3 Omni 30B A3B InstructAlibaba	52.3	—	—	—	—	—	—	52.3	—	—	103	$0.30
198	Qwen3 4B 2507 InstructAlibaba	52.3	—	—	—	—	—	—	52.3	—	—	—	$0.00
199	DeepSeek-V3DeepSeek	51.8	90.2	—	—	—	—	39.2	26	—	131K	100	$0.23
200	Gemma 3 12B InstructGoogle	51.8	85.3	—	—	—	—	—	18.3	—	—	—	$0.10
201	Nova PremierAmazon	50.6	83.9	—	—	—	—	—	17.3	—	—	40	$2.50
202	Exaone 4.0 1.2BLG AI Research	50.3	—	—	—	—	—	—	50.3	—	—	—	$0.00
203	DeepSeek-V3.1DeepSeek	49.9	—	—	33.5	—	—	66.3	49.8	—	164K	—	$0.21
204	Qwen2.5 72B InstructAlibaba	49.9	85.8	—	—	95.8	—	—	14	83.1	131K	100	$0.36
205	Llama 3 8B InstructMeta	49.9	49.9	—	—	—	—	—	—	—	8K	81	$0.04
206	Phi 4Microsoft	49.5	81	—	—	—	80.6	—	18	80.4	16K	33	$0.07
207	Ling-mini-2.0InclusionAI	49.3	—	—	—	—	—	—	49.3	—	—	—	$0.00
208	Llama 4 ScoutMeta	49.2	84.4	—	—	—	90.6	—	14	50.3	10M	776	$0.08
209	Devstral SmallMistral AI	48.9	68.4	—	—	—	—	—	29.3	—	—	190	$0.10
210	Llama 3 70B InstructMeta	48.3	48.3	—	—	—	—	—	—	—	8K	45	$0.51
211	LFM 40BLiquid AI	48	48	—	—	—	—	—	—	—	—	—	$0.00
212	Command ACohere	47.5	81.9	—	—	—	—	—	13	—	256K	203	$2.50
213	GPT-4o-miniOpenAI	46.8	78.9	—	—	—	87	—	14.7	70.2	128K	92	$0.15
214	Qwen3 0.6BAlibaba	46.5	75	—	—	—	—	—	18	—	—	225	$0.10
215	GPT-4.1 NanoOpenAI	46.1	84.8	—	—	—	—	29.4	24	—	1M	200	$0.10
216	Gemma 3n E4B InstructGoogle	45.7	77.1	—	—	—	—	—	14.3	—	—	56	$0.00
217	Gemma 3 4B InstructGoogle	44.7	76.6	—	—	—	—	—	12.7	—	—	—	$0.00
218	GPT-3.5 TurboOpenAI	44.1	44.1	—	—	—	56.3	—	—	43.1	16K	100	$0.50
219	Mistral Large 2Mistral AI	43.8	73.6	—	—	93	—	—	14	—	128K	42	$2.00
220	Grok Code Fast 1xAI	43.3	—	—	—	—	—	—	43.3	—	—	—	$0.00
221	Nova ProAmazon	42.8	78.6	—	—	94.8	—	—	7	76.6	300K	100	$0.80
222	GPT-4oOpenAI	42.7	89.3	—	—	—	—	13.1	25.7	—	128K	132	$2.50
223	Llama 3.3 70B InstructMeta	42.5	77.3	—	—	—	91.1	—	7.7	77	131K	2220	$0.10
224	Llama 3.1 Nemotron 70B InstructNVIDIA	42.2	73.3	—	—	91.4	—	—	11	—	—	292	$1.20
225	Nova LiteAmazon	41.8	76.5	—	—	94.5	—	—	7	73.3	300K	100	$0.06
226	Claude 3 SonnetAnthropic	41.4	41.4	—	—	92.3	83.5	—	—	43.1	200K	120	$3.00
227	Olmo 3 7B InstructAllen Institute for AI	41.3	—	—	—	—	—	—	41.3	—	—	—	$0.10
228	Mistral MediumMistral AI	40.5	40.5	—	—	—	—	—	—	—	—	45	$2.80
229	Gemini 1.0 ProGoogle	40.3	40.3	—	—	—	—	—	—	32.6	33K	120	$0.50
230	Gemma 3n E2B InstructGoogle	39.7	69.1	—	—	—	—	—	10.3	—	—	—	$0.00
231	Claude 3 HaikuAnthropic	39.4	39.4	—	—	88.9	75.1	—	—	38.9	200K	104	$0.25
232	Mistral Medium 3.1Mistral AI	38.3	—	—	—	—	—	—	38.3	—	131K	47	$0.40
233	Nova MicroAmazon	38.2	70.3	—	—	92.3	—	—	6	69.3	128K	100	$0.03
234	Phi 4 Mini InstructMicrosoft	38.2	69.6	—	—	—	—	—	6.7	—	131K	—	$0.08
235	Mistral Large 3Mistral AI	38	—	—	—	—	—	—	38	—	262K	54	$0.50
236	Mistral Small 3Mistral AI	37.9	71.5	—	—	—	—	—	4.3	—	33K	136	$0.05
237	Devstral MediumMistral AI	37.7	70.7	—	—	—	—	—	4.7	—	131K	72	$0.40
238	Claude 2.1Anthropic	37.4	37.4	—	—	—	—	—	—	—	—	—	$0.00
239	Mistral Small 3.1Mistral AI	37.2	70.7	—	—	—	—	—	3.7	—	—	134	$0.10
240	Qwen3 VL 4B InstructAlibaba	37	—	—	—	—	—	—	37	—	—	—	$0.00
241	Pixtral LargeMistral AI	36.9	71.4	—	—	—	—	—	2.3	—	131K	0	$2.00
242	Llama 3.1 405B InstructMeta	36.7	70.3	—	—	96.8	—	—	3	73.8	128K	100	$0.89
243	GPT-4.5OpenAI	36.7	—	—	—	97	—	36.7	—	85	128K	50	$75.00
244	Devstral 2Mistral AI	36.7	—	—	—	—	—	—	36.7	—	262K	51	$0.40
245	Granite 3.3 8BIBM	36.6	66.5	—	—	—	—	—	6.7	—	—	376	$0.00
246	Kimi Linear 48B A3B InstructMoonshot AI	36.3	—	—	—	—	—	—	36.3	—	—	—	$0.00
247	Jamba 1.5 MiniAI21 Labs	35.7	35.7	—	—	75.8	—	—	—	—	256K	100	$0.20
248	Llama 3.1 70B InstructMeta	34.5	64.9	—	—	—	—	—	4	—	131K	1204	$0.40
249	Devstral Small 2Mistral AI	34.3	—	—	—	—	—	—	34.3	—	—	62	$0.00
250	Solar MiniUpstage	33.1	33.1	—	—	—	—	—	—	—	—	63	$0.20
251	Llama 2 Chat 13BMeta	32.9	32.9	—	—	—	—	—	—	—	—	—	$0.00
252	Llama 2 Chat 70BMeta	32.3	32.3	—	—	—	—	—	—	—	—	—	$0.00
253	Ministral 3 8BMistral AI	31.7	—	—	—	—	—	—	31.7	—	262K	86	$0.15
254	Jamba Large 1.7AI21 Labs	31.2	60	—	—	—	—	—	2.3	—	256K	48	$2.00
255	Qwen3 VL 8BAlibaba	30.7	—	—	—	—	—	—	30.7	—	—	120	$0.20
256	OpenChat 3.5OpenChat	30.7	30.7	—	—	—	—	—	—	—	—	—	$0.00
257	Ministral 3 14BMistral AI	30	—	—	—	—	—	—	30	—	262K	67	$0.20
258	Mixtral 8x7B InstructMistral AI	29.9	29.9	—	—	—	—	—	—	—	—	—	$0.50
259	Llama 3.1 8B InstructMeta	28.1	51.9	—	—	—	—	—	4.3	—	131K	2047	$0.02
260	Command R+Cohere	27.9	27.9	—	—	70.7	—	—	—	—	128K	100	$0.15
261	DBRX InstructDatabricks	27.9	27.9	—	—	—	—	—	—	—	—	—	$0.00
262	Qwen3 VL 8B InstructAlibaba	27.3	—	—	—	—	—	—	27.3	—	256K	145	$0.08
263	Llama 3.2 11B InstructMeta	26.7	51.6	—	—	—	68.9	—	1.7	51.9	128K	168	$0.05
264	Claude InstantAnthropic	26.4	26.4	—	—	—	—	—	—	—	—	—	$0.00
265	Llama 3.2 3B InstructMeta	26.1	48.9	—	—	77.7	58.2	—	3.3	48	131K	172	$0.05
266	Gemma 3 1B InstructGoogle	25.9	48.4	—	—	—	—	—	3.3	—	—	—	$0.00
267	Qwen3 VL 4BAlibaba	25.7	—	—	—	—	—	—	25.7	—	—	—	$0.00
268	Jamba 1.6 MiniAI21 Labs	25.7	25.7	—	—	—	—	—	—	—	—	183	$0.20
269	LFM2 8B A1BLiquid AI	25.3	—	—	—	—	—	—	25.3	—	—	—	$0.00
270	Gemini DiffusionGoogle	23.3	—	—	—	—	—	—	23.3	—	—	—	—
271	Phi-3 Mini Instruct 3.8BMicrosoft	23	45.7	—	—	—	—	—	0.3	—	—	—	$0.00
272	Ministral 3 3BMistral AI	22	—	—	—	—	—	—	22	—	131K	154	$0.10
273	DeepHermes 3 - Llama-3.1 8BNous Research	21.8	21.8	—	—	—	—	—	—	—	—	—	$0.00
274	Granite 4.0 H SmallIBM	13.7	—	—	—	—	—	—	13.7	—	—	524	$0.10
275	Jamba 1.7 MiniAI21 Labs	13.1	25.8	—	—	—	—	—	0.3	—	—	—	$0.00
276	Mistral 7B InstructMistral AI	12.1	12.1	—	—	—	—	—	—	—	—	90	$0.20
277	Gemma 3n E4B Instructed LiteRT PreviewGoogle	11.6	—	—	—	—	60.7	—	11.6	—	—	—	—
278	Gemma 3n E4B InstructedGoogle	11.6	—	—	—	—	67	—	11.6	—	32K	42	$20.00
279	Jamba Reasoning 3BAI21 Labs	10.7	—	—	—	—	—	—	10.7	—	—	—	$0.00
280	LFM2 2.6BLiquid AI	8.3	—	—	—	—	—	—	8.3	—	—	—	$0.00
281	Llama 3.2 1B InstructMeta	7	14	—	—	—	—	—	0	—	131K	91	$0.03
282	Gemma 3n E2B Instructed LiteRT (Preview)Google	6.7	—	—	—	—	53.1	—	6.7	—	—	—	—
283	Gemma 3n E2B InstructedGoogle	6.7	—	—	—	—	53.1	—	6.7	—	—	—	—
284	Granite 4.0 H 1BIBM	6.3	—	—	—	—	—	—	6.3	—	—	—	$0.00
285	Granite 4.0 1BIBM	6.3	—	—	—	—	—	—	6.3	—	—	—	$0.00
286	Granite 4.0 MicroIBM	6	—	—	—	—	—	—	6	—	131K	—	$0.02
287	Llama 2 Chat 7BMeta	5.9	5.9	—	—	—	—	—	—	—	—	113	$0.10
288	OLMo 2 32BAllen Institute for AI	3.3	—	—	—	—	—	—	3.3	—	—	—	$0.00
289	LFM2 1.2BLiquid AI	3.3	—	—	—	—	—	—	3.3	—	—	—	$0.00
290	Gemma 3 270MGoogle	2.3	—	—	—	—	—	—	2.3	—	—	—	$0.00
291	Granite 4.0 H 350MIBM	1.3	—	—	—	—	—	—	1.3	—	—	—	$0.00
292	OLMo 2 7BAllen Institute for AI	0.7	—	—	—	—	—	—	0.7	—	—	—	$0.00
293	Molmo 7B-DAllen Institute for AI	0	—	—	—	—	—	—	0	—	—	—	$0.00
294	Granite 4.0 350MIBM	0	—	—	—	—	—	—	0	—	—	—	$0.00

294 models ranked on Math. The intelligence index is a balanced mean of per-category scores; category columns average the benchmarks within each. Scores are curated approximations — see each model for sources. Click any column to sort.