298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | General idx ↓ | Multi-IF | LiveBench | Arena Hard | Humanity’s Last Exam | IFEval | SimpleQA | MMLU-Pro | MMLU | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #1 | 91.1 | — | — | — | 19.8 | — | 97.1 | 85 | — | 2025 | — | llm | Open weights | — | 2025 | 164K | 100 | 0.70 | $0.27 | $0.41 | |
| #2 | 90 | — | — | — | 20 | — | 95 | 85 | — | 2025 | — | llm | API only | — | — | 2M | 90 | — | $0.20 | $0.50 | |
| #3 | 89.8 | — | — | — | 37.5 | — | — | 89.8 | — | 2025 | — | multimodal | API only | — | — | 1M | 141 | 27.49 | $2.00 | $12.00 | |
| #4 | 89.5 | — | — | — | 28.4 | — | — | 89.5 | — | 2025 | — | llm | API only | — | — | 200K | 58 | 1.50 | $5.00 | $25.00 | |
| #5 | 89 | — | — | — | 34.7 | — | — | 89 | — | 2025 | — | multimodal | API only | — | — | 1M | 191 | 1.05 | $0.50 | $3.00 | |
| #6 | 88.7 | — | — | — | 17.7 | — | 92.3 | 85 | — | 2025 | — | llm | Open weights | 671000000000 | — | 131K | 45 | 0.30 | $0.55 | $2.19 | |
| #7 | 88.6 | — | — | — | 15.9 | — | 93.4 | 83.7 | — | 2025 | — | llm | Open weights | 671B (37B active) | 2025 | 164K | — | — | $0.21 | $0.79 | |
| #8 | 88.5 | — | — | — | 10.3 | 93.2 | — | 83.7 | 86.1 | 2025 | — | llm | API only | — | — | 200K | 101 | 0.40 | $3.00 | $15.00 | |
| #9 | 88 | — | — | — | 11.9 | — | — | 88 | — | 2025 | — | llm | API only | — | 2025 | 200K | 120 | 0.40 | $15.00 | $75.00 | |
| #10 | 87.5 | — | — | — | 35.9 | — | — | 87.5 | — | 2026 | — | llm | Open weights | 1.6T (49B active) | — | 1M | 30 | 1.16 | $0.44 | $0.87 | |
| #11 | 87.5 | — | — | — | 17.3 | — | — | 87.5 | — | 2025 | — | llm | API only | — | 2025 | 1M | 42 | 0.40 | $3.00 | $15.00 | |
| #12 | 87.5 | — | — | — | 22.2 | — | — | 87.5 | — | 2025 | — | llm | Open weights | — | — | 205K | 92 | 1.14 | $0.29 | $0.95 | |
| #13 | 87.4 | — | — | — | 35.4 | — | — | 87.4 | — | 2025 | — | llm | API only | — | — | 400K | 73 | 0.69 | $1.75 | $14.00 | |
| #14 | 87.3 | — | — | — | 11.7 | — | — | 87.3 | 88.8 | 2025 | — | llm | API only | — | 2025 | 200K | 120 | 0.40 | $15.00 | $75.00 | |
| #15 | 87.1 | — | — | — | 24.8 | — | — | 87.1 | 92.5 | 2025 | — | llm | API only | — | 2024 | 400K | 100 | 2.00 | $1.25 | $10.00 | |
| #16 | 87 | — | — | — | 26.5 | — | — | 87 | — | 2025 | — | llm | API only | — | — | 400K | 115 | 0.77 | $1.25 | $10.00 | |
| #17 | 86.6 | — | — | — | 40 | — | — | 86.6 | — | 2025 | — | llm | API only | — | 2024 | 256K | 100 | 0.70 | $3.00 | $15.00 | |
| #18 | 86.5 | — | — | — | 25.6 | — | — | 86.5 | — | 2025 | — | multimodal | API only | — | 2024 | 400K | 180 | 6.64 | $1.25 | $10.00 | |
| #19 | 86.3 | — | — | — | 26.1 | — | — | 86.3 | — | 2025 | — | llm | Open weights | — | — | 164K | — | — | $0.29 | $0.43 | |
| #20 | 86.2 | — | — | — | 22.2 | — | — | 86.2 | — | 2025 | — | llm | Open weights | 671B (37B active) | — | 131K | — | — | $0.25 | $0.38 | |
| #21 | 86 | — | — | — | 23.4 | — | — | 86 | — | 2025 | — | multimodal | API only | — | — | 400K | 188 | 4.16 | $1.25 | $10.00 | |
| #22 | 86 | — | — | — | 8.1 | 89.5 | — | 82.5 | — | 2025 | — | llm | Open weights | 253000000000 | 2023 | — | 42 | 0.72 | $0.60 | $1.80 | |
| #23 | 85.6 | — | — | — | 25.1 | — | — | 85.6 | — | 2025 | — | llm | Open weights | — | — | 203K | 98 | 0.83 | $0.40 | $1.75 | |
| #24 | 85.4 | — | — | — | 17.6 | — | — | 85.4 | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #25 | 85.4 | — | — | — | 13.3 | — | — | 85.4 | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
Ranked on General. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.