298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

Overview Reasoning Coding Math Agents Multimodal General Long Context

Rank	Model	Reason idx ↓	BIG-Bench Hard	ARC-AGI-2	DROP	GPQA Diamond	Released	Country	Type	Access	Params	Cutoff	Context	Speed	Latency	In $/M	Out $/M
#51	Grok-3 xAI	84.6	—	—	—	84.6	2025	—	multimodal	API only	—	2024	128K	100	0.70	$3.00	$15.00
#52	Kimi K2 Thinking Moonshot AI	84.5	—	—	—	84.5	2025	—	llm	Open weights	1T (32B active)	—	262K	100	1.00	$0.60	$2.50
#53	Qwen3.5-35B-A3B Alibaba	84.5	—	—	—	84.5	2026	—	multimodal	Open weights	—	—	262K	121	1.07	$0.14	$1.00
#54	Qwen3.6 27B Alibaba	84.2	—	—	—	84.2	2026	—	multimodal	Open weights	—	—	262K	64	1.40	$0.29	$3.20
#55	Qwen3.6 35B A3B Alibaba	84.1	—	—	—	84.1	2026	—	multimodal	Open weights	—	—	262K	169	1.47	$0.14	$1.00
#56	DeepSeek-V3.2 DeepSeek	84	—	—	—	84	2025	—	llm	Open weights	671B (37B active)	—	131K	—	—	$0.25	$0.38
#57	GPT-5 Codex OpenAI	83.7	—	—	—	83.7	2025	—	multimodal	API only	—	2024	400K	180	6.64	$1.25	$10.00
#58	Claude Sonnet 4.5 Anthropic	83.4	—	—	—	83.4	2025	—	llm	API only	—	2025	1M	42	0.40	$3.00	$15.00
#59	Step 3.5 Flash StepFun	83.1	—	—	—	83.1	2026	—	llm	Open weights	—	—	262K	194	0.85	$0.09	$0.30
#60	MiniMax M2.1 MiniMax	83	—	—	—	83	2025	—	llm	Open weights	—	—	205K	92	1.14	$0.29	$0.95
#61	JT-35B-FlashNew China Mobile	82.9	—	—	—	82.9	2026	—	llm	—	—	—	—	—	—	$0.00	$0.00
#62	MiMo-V2-Omni Xiaomi	82.8	—	—	—	82.8	2026	—	multimodal	API only	—	—	262K	108	1.36	$0.40	$2.00
#63	Gemini 2.5 Flash Google	82.8	—	—	—	82.8	2025	—	multimodal	API only	—	2025	1M	85	0.70	$0.30	$2.50
#64	Step 3.5 Flash 2603 StepFun	82.6	—	—	—	82.6	2026	—	llm	—	—	—	—	197	0.90	$0.00	$0.00
#65	Qwen3.5 Omni Plus Alibaba	82.6	—	—	—	82.6	2026	—	llm	—	—	—	—	54	1.28	$0.40	$4.80
#66	GPT-5 mini OpenAI	82.3	—	—	—	82.3	2025	—	llm	API only	—	2024	400K	200	1.00	$0.25	$2.00
#67	Gemini 3.1 Flash LiteNew Google	82.2	—	—	—	82.2	2026	—	multimodal	API only	—	—	1M	342	5.35	$0.25	$1.50
#68	GPT-5.4 nano OpenAI	81.7	—	—	—	81.7	2026	—	llm	API only	—	2025	400K	157	0.55	$0.20	$1.25
#69	o4-mini OpenAI	81.4	—	—	—	81.4	2025	—	multimodal	API only	—	2024	200K	115	5.20	$1.10	$4.40
#70	GPT-5.1-Codex-Mini OpenAI	81.3	—	—	—	81.3	2025	—	multimodal	API only	—	—	400K	175	9.50	$0.25	$2.00
#71	Nova 2 Lite Amazon	81.1	—	—	—	81.1	2025	—	multimodal	API only	—	—	1M	229	0.89	$0.30	$2.50
#72	ERNIE 4.5 300B A47B Baidu	81.1	—	—	—	81.1	2025	—	llm	Open weights	—	2025	131K	24	1.53	$0.28	$1.10
#73	GLM-4.6 Zhipu AI	81	—	—	—	81	2025	—	llm	Open weights	357B (MoE)	2025	203K	85	0.70	$0.43	$1.74
#74	DeepSeek-R1-0528 DeepSeek	81	—	—	—	81	2025	—	llm	Open weights	671000000000	—	131K	45	0.30	$0.55	$2.19
#75	GLM 5V Turbo Zhipu AI	80.9	—	—	—	80.9	2026	—	multimodal	API only	—	—	203K	—	—	$1.20	$4.00

Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.