298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

Overview Reasoning Coding Math Agents Multimodal General Long Context

Rank	Model	Reason idx ↓	BIG-Bench Hard	ARC-AGI-2	DROP	GPQA Diamond	Released	Country	Type	Access	Params	Cutoff	Context	Speed	Latency	In $/M	Out $/M
#176	Qwen3 30B A3B 2507 Instruct Alibaba	65.9	—	—	—	65.9	2025	—	llm	—	—	—	—	122	1.25	$0.20	$0.40
#177	Qwen3 30B A3B Alibaba	65.8	—	—	—	65.8	2025	—	llm	Open weights	—	2025	131K	122	0.66	$0.09	$0.45
#178	Phi 4 Microsoft	65.8	—	—	75.5	56.1	2025	—	llm	Open weights	—	2024	16K	33	0.20	$0.07	$0.14
#179	Solar Open 100B Upstage	65.7	—	—	—	65.7	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#180	Ling-flash-2.0 InclusionAI	65.7	—	—	—	65.7	2025	—	llm	—	—	—	—	91	1.61	$0.10	$0.60
#181	QwQ-32B Alibaba	65.2	—	—	—	65.2	2025	—	llm	Open weights	32500000000	2024	—	31	0.45	$0.70	$1.00
#182	DeepSeek R1 Distill Llama 70B DeepSeek	65.2	—	—	—	65.2	2025	—	llm	Open weights	70600000000	—	128K	37	0.65	$0.10	$0.40
#183	GPT-4.1 Mini OpenAI	65	—	—	—	65	2025	—	multimodal	API only	—	2024	1M	150	5.00	$0.40	$1.60
#184	Gemini 2.5 Flash Lite Google	64.6	—	—	—	64.6	2025	—	multimodal	API only	—	2025	1M	6	0.44	$0.10	$0.40
#185	Magistral Small 1 Mistral AI	64.1	—	—	—	64.1	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#186	LongCat Flash Lite LongCat	63.6	—	—	—	63.6	2026	—	llm	—	—	—	—	110	5.59	$0.00	$0.00
#187	Sarvam 30B Sarvam	63.3	—	—	—	63.3	2026	—	llm	—	—	—	—	214	1.17	$0.00	$0.00
#188	Claude 3.5 Haiku Anthropic	62.4	—	—	83.1	41.6	2024	—	llm	API only	—	2024	200K	104	0.30	$0.80	$4.00
#189	Gemini 2.0 Flash Google	62.1	—	—	—	62.1	2024	—	multimodal	API only	—	2024	1M	183	0.40	$0.10	$0.40
#190	Qwen3 Omni 30B A3B Instruct Alibaba	62	—	—	—	62	2025	—	llm	—	—	—	—	103	1.04	$0.30	$1.00
#191	Qwen3 Coder 480B A35B Instruct Alibaba	61.8	—	—	—	61.8	2025	—	llm	—	—	—	—	69	1.68	$0.30	$1.80
#192	Claude 3 Haiku Anthropic	61.8	73.7	—	78.4	33.3	2024	—	multimodal	API only	—	2023	200K	104	0.40	$0.25	$1.25
#193	Gemini 3 Pro Google	61.5	—	31.1	—	91.9	2025	—	multimodal	API only	—	—	1M	141	27.49	$2.00	$12.00
#194	HyperCLOVA X SEED Think Naver	61.5	—	—	—	61.5	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#195	DeepSeek R1 0528 Qwen3 8B DeepSeek	61.2	—	—	—	61.2	2025	—	llm	—	—	—	—	—	—	$0.00	$0.00
#196	Olmo 3 32B Think Allen Institute for AI	61	—	—	—	61	2025	—	llm	Open weights	—	—	66K	—	—	$0.15	$0.50
#197	Llama 3.1 70B Instruct Meta	60.7	—	—	79.6	41.7	2024	—	llm	Open weights	—	2023	131K	1204	0.20	$0.40	$0.40
#198	Qwen3 14B Alibaba	60.4	—	—	—	60.4	2025	—	llm	Open weights	—	2025	132K	62	1.01	$0.10	$0.24
#199	Tri-21B-Think Trillion Labs	60.1	—	—	—	60.1	2026	—	llm	—	—	—	—	—	—	$0.00	$0.00
#200	Devstral 2 Mistral AI	59.4	—	—	—	59.4	2025	—	llm	Open weights	—	—	262K	51	0.64	$0.40	$2.00

Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.