AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 30, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelIndexGeneralReasonCodingAgentsMathMultiLong ctxGPQA DiamondDROPARC-AGI-2BIG-Bench HardSciCodeTerminal-BenchLiveCodeBenchSWE-bench VerifiedAider PolyglotHumanEvalAider Polyglot EditMBPPMultiPL-ESWE-bench ProAIME 2025MATH-500AIME 2024MATHGSM8KMGSMHMMT 2025FrontierMathτ²-benchTAU-bench RetailTAU-bench AirlineBFCLBrowseCompτ²-bench Airlineτ²-bench RetailMMMUMathVistaChartQADocVQAMMMU-ProAI2DHumanity’s Last ExamMMLU-ProMMLUIFEvalSimpleQAMulti-IFLiveBenchArena HardAA-LCRLongBench-v2ReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
Alibaba40.551.340.129.441.369.551.372.930.77.668.449.887.869.521.660.94445.557.37.380.687.675.851.32025llmOpen weights2025262K1611.14$0.09$1.10
Alibaba42.560.343.824.341.584.360.375.938.89.878.484.341.511.782.460.32025llmOpen weights80B (3B active)262K1471.14$0.50$6.00
InclusionAI14.46.730.67.213.249.36.756.213.50.842.949.313.2567.16.72025llm$0.00$0.00
Moonshot AI48.552.341.127.173.464.752.375.830.723.56194.557.37289.173.46.382.590.252.32025llmAPI only1000000000000262K161.94$0.60$2.50
#180Swiss AI Initiative8.1016.42.912.9027.25.7012.95.502025llm$0.80$2.90
Index 8.1 = (0.0 + 16.4 + 2.9 + 12.9 / 4) — equal-weighted mean of 4 components.
General25%
0
  • SimpleQA
  • AA-LCR0
  • LongBench-v2
  • IFBench
Reasoning25%
16.4
  • GPQA Diamond27.2
  • Humanity’s Last Exam5.5
  • FrontierMath
  • ARC-AGI-2
Coding25%
2.9
  • SWE-bench Verified
  • Terminal-Bench0
  • Aider Polyglot
  • SciCode5.7
Tool use & agents25%
12.9
  • TAU-bench Retail
  • τ²-bench12.9
  • BFCL
  • BrowseComp
Swiss AI Initiative7.2015.32.111.4025.64.1011.4502025llm$0.10$0.20
xAI47.748.340.126.875.743.348.372.736.217.465.743.375.77.579.348.32025llm$0.00$0.00
Nous Research2820.741.52326.669.720.772.734.611.468.669.726.610.382.920.72025llm340.74$1.00$3.00
Nous Research21.96.738.919.322.568.76.769.934.14.565.368.722.57.981.16.72025llm600.67$0.10$0.40
DeepSeek50.973.445.451.233.749.953.374.939.131.356.46668.449.866.333.537.43015.983.793.453.32025llmOpen weights671B (37B active)2025164K$0.21$0.79
ByteDance42.457.740.821.749.484.757.772.636.56.876.584.749.49.181.557.72025llm371.81$0.20$0.60
NVIDIA22.222.730.811.823.469.722.757221.572.469.723.44.674.222.72025llm1290.26$0.00$0.20
Google5.6013.309.12.3022.4000.32.39.14.25.502025llm$0.00$0.00
Mistral AI28.519.731.622.240.638.319.758.833.810.640.638.340.64.468.319.72025multimodalAPI only2025131K470.69$0.40$2.00
Zhipu AI18.6037.214.522.573068.422.16.860.47322.55.978.802025multimodalOpen weights202466K850.70$0.60$1.80
AI21 Labs15.717.321.410.613.531.217.33918.82.318.12.36013.53.857.717.32025llmOpen weights2024256K480.97$2.00$8.00
OpenAI63.375.646.160.970.778.481.375.687.342.937.984.674.98893.494.699.484.793.326.386.554.962.681.184.278.424.887.192.575.62025llmAPI only2024400K1002.00$1.25$10.00
OpenAI54.26840.437.271.1676882.34133.383.891.187.822.171.116.783.7682025llmAPI only2024400K2001.00$0.25$2.00
OpenAI33.841.729.82736.556.841.771.236.617.478.985.275.69.636.58.77841.72025llmAPI only2024400K5000.30$0.05$0.40
Alibaba28.337.736.313.625.482.737.766.725.61.564.182.725.45.974.337.72025llm$0.00$0.00
Alibaba18.47.328.211.326.652.37.351.718.14.537.752.326.64.767.27.32025llm$0.00$0.00
Anthropic60.666.346.452.976.97866.380.940.943.365.474.57871.482.45611.98866.32025llmAPI only2025200K1200.40$15.00$75.00
OpenAI52.350.75041.766.893.450.780.938.923.587.862.441.893.465.867.81980.89050.72025llmOpen weights117B (5.1B active)2024131K5000.50$0.04$0.18
OpenAI38.93144.422.557.589.33171.534.410.677.789.360.254.817.374.885.3312025llmOpen weights21B (3.6B active)2024131K10000.38$0.03$0.14
Alibaba28.22927.821.534.559.22951.627.815.240.32989.334.5470.6292025llmOpen weights2025160K971.49$0.07$0.27

Score columns under Index are the v1.2 weighted components (25% each) that feed it. Reference per-category averages (not in the index) follow. Every individual benchmark in our catalog is also shown — grouped by category, ordered by coverage. Hover any header for details — click to sort. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.