298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Reason idx ↓ | BIG-Bench Hard | ARC-AGI-2 | DROP | GPQA Diamond | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #126 | 73.3 | — | — | — | 73.3 | 2025 | — | llm | — | — | — | — | 93 | 1.26 | $0.70 | $8.40 | |
| #127 | 73.1 | 86.9 | — | 85.4 | 46.9 | 2024 | — | multimodal | API only | — | — | 300K | 100 | 0.50 | $0.80 | $3.20 | |
| #128 | 73 | — | — | — | 73 | 2025 | — | llm | API only | — | 2025 | 200K | 100 | 0.30 | $1.00 | $5.00 | |
| #129 | 72.9 | — | 58.3 | — | 87.5 | 2026 | — | llm | API only | — | — | 1M | 75 | 1.13 | $3.00 | $15.00 | |
| #130 | 72.9 | — | — | — | 72.9 | 2025 | — | llm | Open weights | — | 2025 | 262K | 161 | 1.14 | $0.09 | $1.10 | |
| #131 | 72.7 | — | 52.9 | — | 92.4 | 2025 | — | llm | API only | — | — | 400K | 73 | 0.69 | $1.75 | $14.00 | |
| #132 | 72.7 | — | — | — | 72.7 | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #133 | 72.7 | — | — | — | 72.7 | 2025 | — | llm | — | — | — | — | 34 | 0.74 | $1.00 | $3.00 | |
| #134 | 72.6 | — | — | — | 72.6 | 2025 | — | llm | — | — | — | — | 37 | 1.81 | $0.20 | $0.60 | |
| #135 | 72.6 | — | — | — | 72.6 | 2025 | — | llm | — | — | — | — | 102 | 1.05 | $0.30 | $1.00 | |
| #136 | InclusionAI | 72.5 | — | — | — | 72.5 | 2025 | — | llm | — | — | — | — | — | — | $0.10 | $0.60 |
| #137 | 72.4 | — | — | — | 72.4 | 2026 | — | llm | API only | — | — | 128K | — | — | $0.15 | $0.60 | |
| #138 | Korea Telecom | 72.2 | — | — | — | 72.2 | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #139 | 72 | — | — | — | 72 | 2025 | — | llm | — | — | — | — | 122 | 1.14 | $0.20 | $0.80 | |
| #140 | 71.9 | — | — | — | 71.9 | 2025 | — | multimodal | Open weights | — | — | 131K | 44 | 1.31 | $0.30 | $0.90 | |
| #141 | InclusionAI | 71.9 | — | — | — | 71.9 | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #142 | 71.5 | — | — | — | 71.5 | 2025 | — | llm | Open weights | 21B (3.6B active) | 2024 | 131K | 1000 | 0.38 | $0.03 | $0.14 | |
| #143 | 71.5 | — | — | — | 71.5 | 2025 | — | llm | Open weights | 671B total / 37B active (MoE) | — | 128K | 189 | 0.07 | $0.55 | $2.19 | |
| #144 | 71.4 | — | — | — | 71.4 | 2025 | — | multimodal | API only | — | — | 128K | 50 | 20.00 | $75.00 | $150.00 | |
| #145 | ServiceNow | 71.3 | — | — | — | 71.3 | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #146 | MBZUAI Institute of Foundation Models | 71.3 | — | — | — | 71.3 | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #147 | 71.2 | — | — | — | 71.2 | 2025 | — | llm | API only | — | 2024 | 400K | 500 | 0.30 | $0.05 | $0.40 | |
| #148 | 71.2 | — | — | — | 71.2 | 2025 | — | multimodal | Open weights | — | 2025 | 262K | 51 | 1.20 | $0.20 | $0.88 | |
| #149 | 70.7 | — | — | — | 70.7 | 2025 | — | llm | — | — | — | — | 151 | 1.18 | $0.30 | $1.90 | |
| #150 | 70.1 | — | — | — | 70.1 | 2024 | — | multimodal | API only | — | 2023 | 128K | 132 | 0.50 | $2.50 | $10.00 |
Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.