298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Reason idx ↓ | BIG-Bench Hard | ARC-AGI-2 | DROP | GPQA Diamond | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #51 | 84.6 | — | — | — | 84.6 | 2025 | — | multimodal | API only | — | 2024 | 128K | 100 | 0.70 | $3.00 | $15.00 | |
| #52 | 84.5 | — | — | — | 84.5 | 2025 | — | llm | Open weights | 1T (32B active) | — | 262K | 100 | 1.00 | $0.60 | $2.50 | |
| #53 | 84.5 | — | — | — | 84.5 | 2026 | — | multimodal | Open weights | — | — | 262K | 121 | 1.07 | $0.14 | $1.00 | |
| #54 | 84.2 | — | — | — | 84.2 | 2026 | — | multimodal | Open weights | — | — | 262K | 64 | 1.40 | $0.29 | $3.20 | |
| #55 | 84.1 | — | — | — | 84.1 | 2026 | — | multimodal | Open weights | — | — | 262K | 169 | 1.47 | $0.14 | $1.00 | |
| #56 | 84 | — | — | — | 84 | 2025 | — | llm | Open weights | 671B (37B active) | — | 131K | — | — | $0.25 | $0.38 | |
| #57 | 83.7 | — | — | — | 83.7 | 2025 | — | multimodal | API only | — | 2024 | 400K | 180 | 6.64 | $1.25 | $10.00 | |
| #58 | 83.4 | — | — | — | 83.4 | 2025 | — | llm | API only | — | 2025 | 1M | 42 | 0.40 | $3.00 | $15.00 | |
| #59 | StepFun | 83.1 | — | — | — | 83.1 | 2026 | — | llm | Open weights | — | — | 262K | 194 | 0.85 | $0.09 | $0.30 |
| #60 | 83 | — | — | — | 83 | 2025 | — | llm | Open weights | — | — | 205K | 92 | 1.14 | $0.29 | $0.95 | |
| #61 | JT-35B-FlashNew China Mobile | 82.9 | — | — | — | 82.9 | 2026 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #62 | 82.8 | — | — | — | 82.8 | 2026 | — | multimodal | API only | — | — | 262K | 108 | 1.36 | $0.40 | $2.00 | |
| #63 | 82.8 | — | — | — | 82.8 | 2025 | — | multimodal | API only | — | 2025 | 1M | 85 | 0.70 | $0.30 | $2.50 | |
| #64 | StepFun | 82.6 | — | — | — | 82.6 | 2026 | — | llm | — | — | — | — | 197 | 0.90 | $0.00 | $0.00 |
| #65 | 82.6 | — | — | — | 82.6 | 2026 | — | llm | — | — | — | — | 54 | 1.28 | $0.40 | $4.80 | |
| #66 | 82.3 | — | — | — | 82.3 | 2025 | — | llm | API only | — | 2024 | 400K | 200 | 1.00 | $0.25 | $2.00 | |
| #67 | 82.2 | — | — | — | 82.2 | 2026 | — | multimodal | API only | — | — | 1M | 342 | 5.35 | $0.25 | $1.50 | |
| #68 | 81.7 | — | — | — | 81.7 | 2026 | — | llm | API only | — | 2025 | 400K | 157 | 0.55 | $0.20 | $1.25 | |
| #69 | 81.4 | — | — | — | 81.4 | 2025 | — | multimodal | API only | — | 2024 | 200K | 115 | 5.20 | $1.10 | $4.40 | |
| #70 | 81.3 | — | — | — | 81.3 | 2025 | — | multimodal | API only | — | — | 400K | 175 | 9.50 | $0.25 | $2.00 | |
| #71 | 81.1 | — | — | — | 81.1 | 2025 | — | multimodal | API only | — | — | 1M | 229 | 0.89 | $0.30 | $2.50 | |
| #72 | 81.1 | — | — | — | 81.1 | 2025 | — | llm | Open weights | — | 2025 | 131K | 24 | 1.53 | $0.28 | $1.10 | |
| #73 | 81 | — | — | — | 81 | 2025 | — | llm | Open weights | 357B (MoE) | 2025 | 203K | 85 | 0.70 | $0.43 | $1.74 | |
| #74 | 81 | — | — | — | 81 | 2025 | — | llm | Open weights | 671000000000 | — | 131K | 45 | 0.30 | $0.55 | $2.19 | |
| #75 | 80.9 | — | — | — | 80.9 | 2026 | — | multimodal | API only | — | — | 203K | — | — | $1.20 | $4.00 |
Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.