298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Reason idx ↓ | BIG-Bench Hard | ARC-AGI-2 | DROP | GPQA Diamond | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #1 | 94.2 | — | — | — | 94.2 | 2026 | — | llm | API only | — | — | 1M | 49 | 1.42 | $5.00 | $25.00 | |
| #2 | 93.5 | — | — | — | 93.5 | 2026 | — | llm | API only | — | 2025 | 1.1M | 67 | 0.97 | $5.00 | $30.00 | |
| #3 | Qwen3.7 MaxNew | 92.3 | — | — | — | 92.3 | 2026 | — | llm | API only | — | — | 1M | 203 | 1.59 | $1.25 | $3.75 |
| #4 | 92.2 | — | — | — | 92.2 | 2026 | — | multimodal | API only | — | 2025 | 1M | 221 | 9.75 | $1.50 | $9.00 | |
| #5 | 92 | — | — | — | 92 | 2026 | — | llm | API only | — | — | 1M | 66 | 6.54 | $5.00 | $25.00 | |
| #6 | 92 | — | — | — | 92 | 2026 | — | llm | API only | — | — | 1.1M | 84 | 0.63 | $2.50 | $15.00 | |
| #7 | 91.5 | — | — | — | 91.5 | 2026 | — | multimodal | API only | — | — | 400K | 73 | 81.08 | $1.75 | $14.00 | |
| #8 | 91.1 | — | — | — | 91.1 | 2026 | — | llm | Open weights | 1T (32B active) | — | 262K | 57 | 1.20 | $0.73 | $3.49 | |
| #9 | 91.1 | — | — | — | 91.1 | 2026 | — | llm | — | — | — | — | 105 | 0.70 | $2.00 | $6.00 | |
| #10 | 90.4 | — | — | — | 90.4 | 2025 | — | multimodal | API only | — | — | 1M | 191 | 1.05 | $0.50 | $3.00 | |
| #11 | 90.1 | — | — | — | 90.1 | 2026 | — | llm | Open weights | 1.6T (49B active) | — | 1M | 30 | 1.16 | $0.44 | $0.87 | |
| #12 | Grok 4.3New | 90.1 | — | — | — | 90.1 | 2026 | — | llm | API only | — | — | 1M | 88 | 0.52 | $1.25 | $2.50 |
| #13 | 89.9 | — | — | — | 89.9 | 2026 | — | multimodal | API only | — | — | 400K | 106 | 2.08 | $1.75 | $14.00 | |
| #14 | 89.4 | — | — | — | 89.4 | 2026 | — | llm | Open weights | 284B (13B active) | — | 1M | 109 | 0.76 | $0.10 | $0.20 | |
| #15 | 89.3 | — | — | — | 89.3 | 2026 | — | multimodal | Open weights | — | — | 262K | 53 | 1.82 | $0.39 | $2.34 | |
| #16 | 88.8 | — | — | — | 88.8 | 2026 | — | llm | API only | — | — | 262K | 36 | 2.79 | $1.04 | $6.24 | |
| #17 | 88.5 | — | — | — | 88.5 | 2026 | — | llm | — | — | — | — | 97 | 0.62 | $2.00 | $6.00 | |
| #18 | 88.4 | — | — | — | 88.4 | 2026 | — | multimodal | API only | — | — | — | — | — | $0.00 | $0.00 | |
| #19 | 88.2 | — | — | — | 88.2 | 2026 | — | multimodal | API only | — | — | 1M | 52 | 1.73 | $0.33 | $1.95 | |
| #20 | 88.1 | — | — | — | 88.1 | 2025 | — | llm | API only | — | — | 400K | 115 | 0.77 | $1.25 | $10.00 | |
| #21 | 87.9 | — | — | — | 87.9 | 2026 | — | multimodal | Open weights | 1T (32B active) | — | 262K | 35 | 1.33 | $0.40 | $1.90 | |
| #22 | 87.5 | — | — | — | 87.5 | 2026 | — | llm | API only | — | 2025 | 400K | 162 | 0.63 | $0.75 | $4.50 | |
| #23 | 87.4 | — | — | — | 87.4 | 2026 | — | llm | Open weights | — | — | 205K | 50 | 1.32 | $0.28 | $1.20 | |
| #24 | 87.3 | — | — | — | 87.3 | 2025 | — | llm | API only | — | 2024 | 400K | 100 | 2.00 | $1.25 | $10.00 | |
| #25 | 87.1 | — | — | — | 87.1 | 2025 | — | llm | Open weights | — | — | 164K | — | — | $0.29 | $0.43 |
Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.