AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelMulti idxAI2DMMMU-ProChartQADocVQAMathVistaMMMUReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
OpenAI82.984.381.62025multimodalAPI only2024200K1155.20$1.10$4.40
OpenAI8276.486.882.92025llmAPI only2024200K5020.00$2.00$8.00
Mistral AI81.793.888.193.369.4642024multimodalAPI only2024131K00.50$2.00$6.00
Amazon81.589.293.561.72024multimodalAPI only300K1000.50$0.80$3.20
OpenAI81.378.484.22025llmAPI only2024400K1002.00$1.25$10.00
Meta80.888.894.470.769.42025multimodalOpen weights109B total / 17B active (MoE)202410M7760.31$0.08$0.30
#7Google79.779.72025multimodalAPI only20251M850.70$0.30$2.50
Index 41.9 = (44.3 + 47.8 + 43.8 + 31.6 / 4) — equal-weighted mean of 4 components.
General25%
44.3
  • SimpleQA26.9
  • AA-LCR61.7
  • LongBench-v2
  • IFBench
Reasoning25%
47.8
  • GPQA Diamond82.8
  • Humanity’s Last Exam12.7
  • FrontierMath
  • ARC-AGI-2
Coding25%
43.8
  • SWE-bench Verified60.4
  • Terminal-Bench13.6
  • Aider Polyglot61.9
  • SciCode39.4
Tool use & agents25%
31.6
  • TAU-bench Retail
  • τ²-bench31.6
  • BFCL
  • BrowseComp
Google79.679.62025multimodalAPI only20251M850.70$1.25$10.00
Amazon78.586.892.456.22024multimodalAPI only300K1000.50$0.06$0.24
Meta78.259.69094.473.773.42025multimodalOpen weights400B total / 17B active (MoE)20241M6390.20$0.15$0.60
xAI78782025multimodalAPI only2024128K1000.70$3.00$15.00
OpenAI77.794.259.985.792.861.472.22024multimodalAPI only2023128K1320.50$2.50$10.00
Anthropic77.377.32026llmAPI only1M481.65$5.00$25.00
Anthropic75752025llmAPI only200K1010.40$3.00$15.00
OpenAI74.771.877.62024llmAPI only2023200K660.54$15.00$60.00
Anthropic74.474.42025llmAPI only20251M1010.40$3.00$15.00
OpenAI73.872.375.22025multimodalAPI only128K5020.00$75.00$150.00
OpenAI73.572.274.82025multimodalAPI only20241M10010.00$2.00$8.00
OpenAI72.973.172.72025multimodalAPI only20241M1505.00$0.40$1.60
Google72.972.92025multimodalAPI only20251M60.44$0.10$0.40
Google70.770.72024multimodalAPI only20241M1830.40$0.10$0.40
Meta66.491.13383.488.451.550.72024multimodalOpen weights106000000002023128K1680.20$0.05$0.05
OpenAI55.856.255.42025multimodalAPI only20241M2002.00$0.10$0.40

Ranked on Multimodal. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.