298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Agents idx ↓ | τ²-bench | BFCL | τ²-bench Airline | τ²-bench Retail | BrowseComp | TAU-bench Airline | TAU-bench Retail | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #151 | 41.5 | 41.5 | — | — | — | — | — | — | 2025 | — | llm | Open weights | 80B (3B active) | — | 262K | 147 | 1.14 | $0.50 | $6.00 | |
| #152 | 41.2 | 41.2 | — | — | — | — | — | — | 2026 | — | multimodal | Open weights | — | — | 262K | 145 | 0.51 | $0.15 | $0.60 | |
| #153 | 41.2 | 14 | 68.4 | — | — | — | — | — | 2024 | — | multimodal | API only | — | — | 300K | 100 | 0.50 | $0.80 | $3.20 | |
| #154 | 40.9 | 40.9 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 148 | 0.30 | $0.10 | $0.20 | |
| #155 | 40.6 | 40.6 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | 2025 | 131K | 47 | 0.69 | $0.40 | $2.00 | |
| #156 | 40.4 | 31.3 | — | — | — | — | 32.4 | 57.6 | 2025 | — | llm | API only | — | 2023 | 200K | 115 | 5.20 | $1.10 | $4.40 | |
| #157 | 38.3 | 38.3 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 40 | 1.31 | $2.50 | $12.50 | |
| #158 | 38 | 38 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 190 | 0.42 | $0.10 | $0.30 | |
| #159 | 37.1 | 37.1 | — | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 164K | — | — | $0.27 | $0.95 | |
| #160 | 37 | 33.9 | — | — | — | 40.1 | — | — | 2025 | — | llm | Open weights | — | 2025 | 164K | 100 | 0.70 | $0.27 | $0.41 | |
| #161 | 36.5 | 36.5 | — | — | — | — | — | — | 2025 | — | llm | API only | — | 2024 | 400K | 500 | 0.30 | $0.05 | $0.40 | |
| #162 | 36.5 | 36.5 | — | — | — | — | — | — | 2024 | — | multimodal | API only | — | 2024 | 131K | 0 | 0.50 | $2.00 | $6.00 | |
| #163 | 35.1 | 35.1 | — | — | — | — | — | — | 2025 | — | multimodal | Open weights | — | 2025 | 262K | 51 | 1.20 | $0.20 | $0.88 | |
| #164 | 35.1 | 14 | 56.2 | — | — | — | — | — | 2024 | — | llm | API only | — | — | 128K | 100 | 0.50 | $0.03 | $0.14 | |
| #165 | 34.5 | 34.5 | — | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 160K | 97 | 1.49 | $0.07 | $0.27 | |
| #166 | 34.5 | 34.5 | — | — | — | — | — | — | 2024 | — | llm | Open weights | — | 2024 | 131K | 100 | 0.37 | $0.36 | $0.40 | |
| #167 | 34.5 | 34.5 | — | — | — | — | — | — | 2025 | — | llm | Open weights | — | 2025 | 132K | 62 | 1.01 | $0.10 | $0.24 | |
| #168 | Sarvam | 34.5 | 34.5 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 214 | 1.17 | $0.00 | $0.00 |
| #169 | 34.2 | 34.2 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.60 | $2.20 | |
| #170 | 33.7 | 37.4 | — | — | — | 30 | — | — | 2025 | — | llm | Open weights | 671B (37B active) | 2025 | 164K | — | — | $0.21 | $0.79 | |
| #171 | 33 | 33 | — | — | — | — | — | — | 2024 | — | llm | Open weights | 123B | — | 128K | 42 | 0.40 | $2.00 | $6.00 | |
| #172 | 32.8 | 24.6 | — | — | — | — | 22.8 | 51 | 2024 | — | llm | API only | — | 2024 | 200K | 104 | 0.30 | $0.80 | $4.00 | |
| #173 | InclusionAI | 32.7 | 32.7 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #174 | 31.9 | 31.9 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #175 | 31.6 | 31.6 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | 2025 | 1M | 85 | 0.70 | $0.30 | $2.50 |
Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.