298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

Overview Reasoning Coding Math Agents Multimodal General Long Context

Rank	Model	Agents idx ↓	τ²-bench	BFCL	τ²-bench Airline	τ²-bench Retail	BrowseComp	TAU-bench Airline	TAU-bench Retail	Released	Country	Type	Access	Params	Cutoff	Context	Speed	Latency	In $/M	Out $/M
#151	Qwen3-Next-80B-A3B Alibaba	41.5	41.5	—	—	—	—	—	—	2025	—	llm	Open weights	80B (3B active)	—	262K	147	1.14	$0.50	$6.00
#152	Mistral Small 4 Mistral AI	41.2	41.2	—	—	—	—	—	—	2026	—	multimodal	Open weights	—	—	262K	145	0.51	$0.15	$0.60
#153	Nova Pro Amazon	41.2	14	68.4	—	—	—	—	—	2024	—	multimodal	API only	—	—	300K	100	0.50	$0.80	$3.20
#154	NVIDIA Nemotron 3 Nano 30B A3B NVIDIA	40.9	40.9	—	—	—	—	—	—	2025	—	llm	—	—	—	—	148	0.30	$0.10	$0.20
#155	Mistral Medium 3.1 Mistral AI	40.6	40.6	—	—	—	—	—	—	2025	—	multimodal	API only	—	2025	131K	47	0.69	$0.40	$2.00
#156	o3-mini OpenAI	40.4	31.3	—	—	—	—	32.4	57.6	2025	—	llm	API only	—	2023	200K	115	5.20	$1.10	$4.40
#157	Nova Premier Amazon	38.3	38.3	—	—	—	—	—	—	2025	—	llm	—	—	—	—	40	1.31	$2.50	$12.50
#158	Devstral Small Mistral AI	38	38	—	—	—	—	—	—	2025	—	llm	—	—	—	—	190	0.42	$0.10	$0.30
#159	DeepSeek V3.1 Terminus DeepSeek	37.1	37.1	—	—	—	—	—	—	2025	—	llm	Open weights	—	2025	164K	—	—	$0.27	$0.95
#160	DeepSeek V3.2 Exp DeepSeek	37	33.9	—	—	—	40.1	—	—	2025	—	llm	Open weights	—	2025	164K	100	0.70	$0.27	$0.41
#161	GPT-5 nano OpenAI	36.5	36.5	—	—	—	—	—	—	2025	—	llm	API only	—	2024	400K	500	0.30	$0.05	$0.40
#162	Pixtral Large Mistral AI	36.5	36.5	—	—	—	—	—	—	2024	—	multimodal	API only	—	2024	131K	0	0.50	$2.00	$6.00
#163	Qwen3 VL 235B A22B Instruct Alibaba	35.1	35.1	—	—	—	—	—	—	2025	—	multimodal	Open weights	—	2025	262K	51	1.20	$0.20	$0.88
#164	Nova Micro Amazon	35.1	14	56.2	—	—	—	—	—	2024	—	llm	API only	—	—	128K	100	0.50	$0.03	$0.14
#165	Qwen3 Coder 30B A3B Instruct Alibaba	34.5	34.5	—	—	—	—	—	—	2025	—	llm	Open weights	—	2025	160K	97	1.49	$0.07	$0.27
#166	Qwen2.5 72B Instruct Alibaba	34.5	34.5	—	—	—	—	—	—	2024	—	llm	Open weights	—	2024	131K	100	0.37	$0.36	$0.40
#167	Qwen3 14B Alibaba	34.5	34.5	—	—	—	—	—	—	2025	—	llm	Open weights	—	2025	132K	62	1.01	$0.10	$0.24
#168	Sarvam 30B Sarvam	34.5	34.5	—	—	—	—	—	—	2026	—	llm	—	—	—	—	214	1.17	$0.00	$0.00
#169	MiniMax M1 80k MiniMax	34.2	34.2	—	—	—	—	—	—	2025	—	llm	—	—	—	—	—	—	$0.60	$2.20
#170	DeepSeek-V3.1 DeepSeek	33.7	37.4	—	—	—	30	—	—	2025	—	llm	Open weights	671B (37B active)	2025	164K	—	—	$0.21	$0.79
#171	Mistral Large 2 Mistral AI	33	33	—	—	—	—	—	—	2024	—	llm	Open weights	123B	—	128K	42	0.40	$2.00	$6.00
#172	Claude 3.5 Haiku Anthropic	32.8	24.6	—	—	—	—	22.8	51	2024	—	llm	API only	—	2024	200K	104	0.30	$0.80	$4.00
#173	Ling-1T InclusionAI	32.7	32.7	—	—	—	—	—	—	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#174	Solar Pro 2 Upstage	31.9	31.9	—	—	—	—	—	—	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#175	Gemini 2.5 Flash Google	31.6	31.6	—	—	—	—	—	—	2025	—	multimodal	API only	—	2025	1M	85	0.70	$0.30	$2.50

Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.