AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 30, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelMath idxMATH-500FrontierMathHMMT 2025GSM8KMGSMAIME 2024AIME 2025MATHReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
#1OpenAI1001002025llmAPI only400K730.69$1.75$14.00
Index 69.4 = (72.7 + 60.2 + 59.7 + 84.8 / 4) — equal-weighted mean of 4 components.
General25%
72.7
  • SimpleQA
  • AA-LCR72.7
  • LongBench-v2
  • IFBench
Reasoning25%
60.2
  • GPQA Diamond92.4
  • Humanity’s Last Exam35.4
  • FrontierMath
  • ARC-AGI-252.9
Coding25%
59.7
  • SWE-bench Verified80
  • Terminal-Bench47
  • Aider Polyglot
  • SciCode52.1
Tool use & agents25%
84.8
  • TAU-bench Retail
  • τ²-bench84.8
  • BFCL
  • BrowseComp
OpenAI98.798.72025multimodalAPI only2024400K1806.64$1.25$10.00
Google97972025multimodalAPI only1M1911.05$0.50$3.00
DeepSeek96.796.72025llmOpen weights164K$0.29$0.43
Xiaomi96.396.32025llmOpen weights262K1451.34$0.10$0.30
Anthropic96.396.32025llmAPI only2025200K1000.30$1.00$5.00
Google95.795.72025multimodalAPI only1M14127.49$2.00$12.00
OpenAI95.795.72025multimodalAPI only400K1884.16$1.25$10.00
xAI95.49991.72025llmAPI only2024256K1000.70$3.00$15.00
#10Zhipu AI95952025llmOpen weights203K980.83$0.40$1.75
Index 63.5 = (64.0 + 55.5 + 38.5 + 95.9 / 4) — equal-weighted mean of 4 components.
General25%
64
  • SimpleQA
  • AA-LCR64
  • LongBench-v2
  • IFBench
Reasoning25%
55.5
  • GPQA Diamond85.9
  • Humanity’s Last Exam25.1
  • FrontierMath
  • ARC-AGI-2
Coding25%
38.5
  • SWE-bench Verified
  • Terminal-Bench31.8
  • Aider Polyglot
  • SciCode45.1
Tool use & agents25%
95.9
  • TAU-bench Retail
  • τ²-bench95.9
  • BFCL
  • BrowseComp
OpenAI9598.993.492.72025multimodalAPI only2024200K1155.20$1.10$4.40
Moonshot AI94.794.72025llmOpen weights1T (32B active)262K1001.00$0.60$2.50
Kuaishou94.794.72025llm1082.19$0.30$1.20
Alibaba94.798.4912025llm591.21$0.40$2.20
Amazon94.394.32025multimodalAPI only1M2290.89$0.30$2.50
OpenAI94942025llmAPI only400K1150.77$1.25$10.00
Zhipu AI93.993.92025llmOpen weights357B (MoE)2025203K850.70$0.43$1.74
OpenAI93.493.42025llmOpen weights117B (5.1B active)2024131K5000.50$0.04$0.18
xAI92.793.3922025llmAPI only2M90$0.20$0.50
Google92.296.79288922025multimodalAPI only20251M850.70$1.25$10.00
DeepSeek92922025llmOpen weights671B (37B active)131K$0.25$0.38
xAI9299.284.72025llm330.52$0.30$0.50
OpenAI91.791.72025multimodalAPI only400K1759.50$0.25$2.00
Anthropic91.391.32025llmAPI only200K581.50$5.00$25.00
xAI91.28793.393.32025multimodalAPI only2024128K1000.70$3.00$15.00

Ranked on Math. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.