298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Agents idx ↓ | τ²-bench | BFCL | τ²-bench Airline | τ²-bench Retail | BrowseComp | TAU-bench Airline | TAU-bench Retail | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #51 | 89.5 | 89.5 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 256K | 108 | 1.36 | $0.30 | $1.20 | |
| #52 | 89.2 | 89.2 | — | — | — | — | — | — | 2026 | — | multimodal | Open weights | — | — | 262K | 121 | 1.07 | $0.14 | $1.00 | |
| #53 | 88.6 | 88.6 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 1M | 49 | 1.42 | $5.00 | $25.00 | |
| #54 | 88.6 | 88.6 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 108 | 2.19 | $0.30 | $1.20 | |
| #55 | 88.3 | 88.3 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 54 | 1.28 | $0.40 | $4.80 | |
| #56 | 88 | 88 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 110 | 1.51 | $0.40 | $2.00 | |
| #57 | OpenBMB | 87.7 | 87.7 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #58 | StepFun | 87.4 | 87.4 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 197 | 0.90 | $0.00 | $0.00 |
| #59 | Naver | 87.4 | 87.4 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #60 | 87.1 | 87.1 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 1.1M | 84 | 0.63 | $2.50 | $15.00 | |
| #61 | 87.1 | 87.1 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | — | 1M | 141 | 27.49 | $2.00 | $12.00 | |
| #62 | 86.8 | 86.8 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | 2024 | 400K | 180 | 6.64 | $1.25 | $10.00 | |
| #63 | 86.8 | 86.8 | — | — | — | — | — | — | 2025 | — | llm | Open weights | 230B (10B active) | — | 205K | 91 | 1.19 | $0.26 | $1.00 | |
| #64 | 86.8 | 86.8 | — | — | — | — | — | — | 2026 | — | multimodal | Open weights | — | — | 262K | 51 | 0.33 | $0.04 | $0.15 | |
| #65 | Korea Telecom | 86.5 | 86.5 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #66 | 86.3 | 86.3 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 128K | — | — | $0.15 | $0.60 | |
| #67 | 86 | 86 | — | — | — | — | — | — | 2026 | — | multimodal | API only | — | — | 400K | 73 | 81.08 | $1.75 | $14.00 | |
| #68 | InclusionAI | 86 | 86 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 262K | — | — | $0.01 | $0.03 |
| #69 | 85.4 | 85.4 | — | — | — | — | — | — | 2025 | — | llm | Open weights | — | — | 205K | 92 | 1.14 | $0.29 | $0.95 | |
| #70 | 84.8 | 84.8 | — | — | — | — | — | — | 2025 | — | llm | API only | — | — | 400K | 73 | 0.69 | $1.75 | $14.00 | |
| #71 | 84.8 | 84.8 | — | — | — | — | — | — | 2026 | — | llm | Open weights | — | — | 205K | 50 | 1.32 | $0.28 | $1.20 | |
| #72 | 84.5 | 84.5 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 235 | 0.99 | $0.10 | $0.80 | |
| #73 | 83.9 | 83.9 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #74 | 83.6 | 83.6 | — | — | — | — | — | — | 2026 | — | llm | API only | — | — | 262K | 45 | 1.47 | $0.78 | $3.90 | |
| #75 | 83.3 | 83.3 | — | — | — | — | — | — | 2026 | — | llm | API only | — | 2025 | 400K | 162 | 0.63 | $0.75 | $4.50 |
Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.