298 models in catalog
AI models
Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.
Leaderboard →Labs →Benchmarks →
Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?
| Rank | Model | Agents idx ↓ | τ²-bench | BFCL | τ²-bench Airline | τ²-bench Retail | BrowseComp | TAU-bench Airline | TAU-bench Retail | Released | Country | Type | Access | Params | Cutoff | Context | Speed | Latency | In $/M | Out $/M |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #101 | 68.9 | 80.7 | — | 64.8 | 80.2 | 49.7 | — | — | 2025 | — | llm | API only | — | 2024 | 200K | 50 | 20.00 | $2.00 | $8.00 | |
| #102 | 68.4 | 64.6 | — | — | — | — | 60 | 80.5 | 2025 | — | llm | API only | — | 2025 | 1M | 101 | 0.40 | $3.00 | $15.00 | |
| #103 | ServiceNow | 68.4 | 68.4 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
| #104 | 67.8 | 67.8 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 211 | 1.01 | $0.30 | $0.80 | |
| #105 | 67.2 | 54.7 | — | 63.6 | 83.2 | — | — | — | 2025 | — | llm | API only | — | 2025 | 200K | 100 | 0.30 | $1.00 | $5.00 | |
| #106 | 66.8 | 65.8 | — | — | — | — | — | 67.8 | 2025 | — | llm | Open weights | 117B (5.1B active) | 2024 | 131K | 500 | 0.50 | $0.04 | $0.18 | |
| #107 | 66.4 | — | 66.4 | — | — | — | — | — | 2025 | — | llm | Open weights | 32500000000 | 2024 | — | 31 | 0.45 | $0.70 | $1.00 | |
| #108 | 65.5 | 65.5 | — | — | — | — | — | — | 2026 | — | multimodal | Open weights | — | — | 262K | 36 | 0.79 | $0.12 | $0.37 | |
| #109 | 65.2 | 65.2 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | 120 | 0.23 | $0.00 | $0.10 | |
| #110 | 64.8 | 54.7 | — | — | — | — | 58.4 | 81.2 | 2025 | — | llm | API only | — | — | 200K | 101 | 0.40 | $3.00 | $15.00 | |
| #111 | 62.9 | 62.9 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | — | 400K | 175 | 9.50 | $0.25 | $2.00 | |
| #112 | 61.1 | 61.1 | — | — | — | — | — | — | 2025 | — | llm | Open weights | 1T (32B active) | 2024 | 131K | 26 | 1.51 | $0.57 | $2.30 | |
| #113 | 61.1 | 62.6 | — | — | — | — | 50 | 70.8 | 2024 | — | llm | API only | — | 2023 | 200K | 66 | 0.54 | $15.00 | $60.00 | |
| #114 | 61 | 76.9 | — | — | — | 45.1 | — | — | 2025 | — | llm | Open weights | 357B (MoE) | 2025 | 203K | 85 | 0.70 | $0.43 | $1.74 | |
| #115 | 59.2 | — | — | — | — | — | 50 | 68.4 | 2025 | — | multimodal | API only | — | — | 128K | 50 | 20.00 | $75.00 | $150.00 | |
| #116 | 58.2 | 58.2 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 | |
| #117 | 57.5 | 60.2 | — | — | — | — | — | 54.8 | 2025 | — | llm | Open weights | 21B (3.6B active) | 2024 | 131K | 1000 | 0.38 | $0.03 | $0.14 | |
| #118 | 57 | 55.6 | — | — | — | 51.5 | 49.2 | 71.8 | 2025 | — | multimodal | API only | — | 2024 | 200K | 115 | 5.20 | $1.10 | $4.40 | |
| #119 | 55.4 | 65.8 | — | — | — | 44.9 | — | — | 2025 | — | llm | API only | — | — | 2M | 90 | — | $0.20 | $0.50 | |
| #120 | 54.8 | 47.1 | — | — | — | — | 49.4 | 68 | 2025 | — | multimodal | API only | — | 2024 | 1M | 100 | 10.00 | $2.00 | $8.00 | |
| #121 | 54.1 | 54.1 | — | — | — | — | — | — | 2025 | — | multimodal | API only | — | 2025 | 1M | 85 | 0.70 | $1.25 | $10.00 | |
| #122 | 54.1 | 54.1 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 34 | 1.75 | $0.80 | $6.20 | |
| #123 | 53.8 | 19 | 88.5 | — | — | — | — | — | 2024 | — | llm | Open weights | 405000000000 | — | 128K | 100 | 0.40 | $0.89 | $0.89 | |
| #124 | 53.2 | 53.2 | — | — | — | — | — | — | 2025 | — | llm | — | — | — | — | 59 | 1.21 | $0.40 | $2.20 | |
| #125 | 53.2 | 53.2 | — | — | — | — | — | — | 2026 | — | llm | — | — | — | — | — | — | $0.00 | $0.00 |
Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.