298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Agents idx ↓ | τ²-bench | BFCL | τ²-bench Airline | τ²-bench Retail | BrowseComp | TAU-bench Airline | TAU-bench Retail | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #126 | 52.4 | 43 | — | — | — | 26.4 | 60.4 | 79.7 | 2025 | — | llm | Open weights | 355B (32B active) | 2024 | 131K | 85 | 0.70 | $0.60 | $2.20 | |
| #127 | 52 | 52 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 42 | 0.50 | $2.00 | $5.00 | |
| #128 | 51.6 | 46.5 | — | — | — | 21.3 | 60.8 | 77.9 | 2025 | — | llm | Open weights | — | 2024 | 131K | 63 | 1.68 | $0.13 | $0.85 | |
| #129 | 50.1 | 29.8 | 70.3 | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 131K | 328 | 0.93 | $0.08 | $0.28 | |
| #130 | 50 | 15.2 | 84.8 | — | — | — | — | — | 2024 | — | llm | Open weights | — | 2023 | 131K | 1204 | 0.20 | $0.40 | $0.40 | |
| #131 | 49.5 | 33.3 | — | 44 | 71.3 | — | — | — | 2025 | — | llm | Open weights | 235000000000 | — | 131K | 63 | 1.18 | $0.15 | $0.80 | |
| #132 | 49.4 | 49.4 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | — | — | $5.00 | $30.00 | |
| #133 | 49.4 | 49.4 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 37 | 1.81 | $0.20 | $0.60 | |
| #134 | 49 | 27.2 | 70.8 | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 131K | 68 | 0.78 | $0.46 | $1.82 | |
| #135 | 48.8 | 48.8 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | 2024 | 128K | 100 | 0.70 | $3.00 | $15.00 | |
| #136 | 48.2 | 52.9 | — | — | — | — | 36 | 55.8 | 2025 | — | multimodal | API only | — | 2024 | 1M | 150 | 5.00 | $0.40 | $1.60 | |
| #137 | 48.2 | 28.9 | — | 45.5 | 63.4 | — | 42.8 | 60.3 | 2024 | — | multimodal | API only | — | 2023 | 128K | 132 | 0.50 | $2.50 | $10.00 | |
| #138 | 48.2 | 48.2 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #139 | 47.6 | 26 | 69.1 | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 131K | 122 | 0.66 | $0.09 | $0.45 | |
| #140 | 47.1 | 47.1 | — | — | — | — | — | — | 2025 | — | llm | Open weights | 671000000000 | — | 164K | — | — | $0.28 | $1.14 | |
| #141 | Sarvam | 46.8 | 46.8 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 128 | 1.29 | $0.00 | $0.00 |
| #142 | Motif Technologies | 46.5 | 46.5 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #143 | 46.3 | 16.4 | 76.1 | — | — | — | — | — | 2024 | — | llm | Open weights | — | 2023 | 131K | 2047 | 0.20 | $0.02 | $0.05 | |
| #144 | 45.9 | 21.6 | — | 45.5 | 57.3 | — | 44 | 60.9 | 2025 | — | llm | Open weights | — | 2025 | 262K | 161 | 1.14 | $0.09 | $1.10 | |
| #145 | 45.6 | 45.6 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 93 | 1.26 | $0.70 | $8.40 | |
| #146 | 45.3 | 45.3 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 301 | 0.58 | $0.10 | $0.30 | |
| #147 | 43.6 | 43.6 | — | — | — | — | — | — | 2026 | — | multimodal | Open weights | — | — | 262K | 66 | 0.71 | $0.06 | $0.33 | |
| #148 | 43.6 | 43.6 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 69 | 1.68 | $0.30 | $1.80 | |
| #149 | 42.1 | 42.1 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #150 | 42.1 | 17.5 | 66.6 | — | — | — | — | — | 2024 | — | multimodal | API only | — | — | 300K | 100 | 0.50 | $0.06 | $0.24 |
Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.