298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

Overview Reasoning Coding Math Agents Multimodal General Long Context

Rank	Model	Reason idx ↓	BIG-Bench Hard	ARC-AGI-2	DROP	GPQA Diamond	Released	Country	Type	Access	Params	Cutoff	Context	Speed	Latency	In $/M	Out $/M
#1	Claude Opus 4.7 Anthropic	94.2	—	—	—	94.2	2026	—	llm	API only	—	—	1M	49	1.42	$5.00	$25.00
#2	GPT-5.5 OpenAI	93.5	—	—	—	93.5	2026	—	llm	API only	—	2025	1.1M	67	0.97	$5.00	$30.00
#3	Qwen3.7 MaxNew Alibaba	92.3	—	—	—	92.3	2026	—	llm	API only	—	—	1M	203	1.59	$1.25	$3.75
#4	Gemini 3.5 FlashNew Google	92.2	—	—	—	92.2	2026	—	multimodal	API only	—	2025	1M	221	9.75	$1.50	$9.00
#5	Claude Opus 4.8New Anthropic	92	—	—	—	92	2026	—	llm	API only	—	—	1M	66	6.54	$5.00	$25.00
#6	GPT-5.4 OpenAI	92	—	—	—	92	2026	—	llm	API only	—	—	1.1M	84	0.63	$2.50	$15.00
#7	GPT-5.3-Codex OpenAI	91.5	—	—	—	91.5	2026	—	multimodal	API only	—	—	400K	73	81.08	$1.75	$14.00
#8	Kimi K2.6 Moonshot AI	91.1	—	—	—	91.1	2026	—	llm	Open weights	1T (32B active)	—	262K	57	1.20	$0.73	$3.49
#9	Grok 4.20 0309 v2 xAI	91.1	—	—	—	91.1	2026	—	llm	—	—	—	—	105	0.70	$2.00	$6.00
#10	Gemini 3 Flash Google	90.4	—	—	—	90.4	2025	—	multimodal	API only	—	—	1M	191	1.05	$0.50	$3.00
#11	DeepSeek-V4-Pro DeepSeek	90.1	—	—	—	90.1	2026	—	llm	Open weights	1.6T (49B active)	—	1M	30	1.16	$0.44	$0.87
#12	Grok 4.3New xAI	90.1	—	—	—	90.1	2026	—	llm	API only	—	—	1M	88	0.52	$1.25	$2.50
#13	GPT-5.2-Codex OpenAI	89.9	—	—	—	89.9	2026	—	multimodal	API only	—	—	400K	106	2.08	$1.75	$14.00
#14	DeepSeek-V4-Flash DeepSeek	89.4	—	—	—	89.4	2026	—	llm	Open weights	284B (13B active)	—	1M	109	0.76	$0.10	$0.20
#15	Qwen3.5 397B A17B Alibaba	89.3	—	—	—	89.3	2026	—	multimodal	Open weights	—	—	262K	53	1.82	$0.39	$2.34
#16	Qwen3.6 Max Alibaba	88.8	—	—	—	88.8	2026	—	llm	API only	—	—	262K	36	2.79	$1.04	$6.24
#17	Grok 4.20 0309 xAI	88.5	—	—	—	88.5	2026	—	llm	—	—	—	—	97	0.62	$2.00	$6.00
#18	Muse Spark Meta	88.4	—	—	—	88.4	2026	—	multimodal	API only	—	—	—	—	—	$0.00	$0.00
#19	Qwen3.6 Plus Alibaba	88.2	—	—	—	88.2	2026	—	multimodal	API only	—	—	1M	52	1.73	$0.33	$1.95
#20	GPT-5.1 OpenAI	88.1	—	—	—	88.1	2025	—	llm	API only	—	—	400K	115	0.77	$1.25	$10.00
#21	Kimi K2.5 Moonshot AI	87.9	—	—	—	87.9	2026	—	multimodal	Open weights	1T (32B active)	—	262K	35	1.33	$0.40	$1.90
#22	GPT-5.4 mini OpenAI	87.5	—	—	—	87.5	2026	—	llm	API only	—	2025	400K	162	0.63	$0.75	$4.50
#23	MiniMax M2.7 MiniMax	87.4	—	—	—	87.4	2026	—	llm	Open weights	—	—	205K	50	1.32	$0.28	$1.20
#24	GPT-5 OpenAI	87.3	—	—	—	87.3	2025	—	llm	API only	—	2024	400K	100	2.00	$1.25	$10.00
#25	DeepSeek V3.2 Speciale DeepSeek	87.1	—	—	—	87.1	2025	—	llm	Open weights	—	—	164K	—	—	$0.29	$0.43

Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.