AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelIndexGeneralReasonCodingAgentsMathMultiLong ctxGPQA DiamondDROPARC-AGI-2BIG-Bench HardSciCodeTerminal-BenchLiveCodeBenchSWE-bench VerifiedAider PolyglotHumanEvalAider Polyglot EditMBPPMultiPL-ESWE-bench ProAIME 2025MATH-500AIME 2024MATHGSM8KMGSMHMMT 2025FrontierMathτ²-benchTAU-bench RetailTAU-bench AirlineBFCLBrowseCompτ²-bench Airlineτ²-bench RetailMMMUMathVistaChartQADocVQAMMMU-ProAI2DHumanity’s Last ExamMMLU-ProMMLUIFEvalSimpleQAMulti-IFLiveBenchArena HardAA-LCRLongBench-v2ReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
InclusionAI422532.824.2862559.327.121.2866.2252026llmAPI only262K$0.01$0.03
Moonshot AI69.569.763.548.795.969.791.153.543.958.695.935.969.72026llmOpen weights1T (32B active)262K571.20$0.73$3.49
Anthropic72.870.366.965.588.670.394.254.554.587.688.639.670.32026llmAPI only1M491.42$5.00$25.00
China Mobile41.111.737.122.79311.767.627.218.2936.611.72026llm$0.00$0.00
LG AI Research49.349.345.524.378.149.379.42820.578.111.649.32026llm$0.00$0.00
Meta68.569.764.248.591.569.788.451.545.591.539.969.72026multimodalAPI only$0.00$0.00
Zhipu AI65.262.357.443.597.762.386.843.843.297.72862.32026llmOpen weights203K530.78$0.98$3.08
xAI63.65861.741.8935891.145.637.99332.2582026llm1050.70$2.00$6.00
Google45.255.748.832.543.655.779.2402543.618.355.72026multimodalOpen weights262K660.71$0.06$0.33
Google26.130.731.216.42630.757.624.48.3264.730.72026llm$0.00$0.00
Alibaba66.769.75742.397.769.788.240.743.997.725.769.72026multimodalAPI only1M521.73$0.33$1.95
StepFun57.554.352.635.687.454.382.638.532.687.422.654.32026llm1970.90$0.00$0.00
Google55.46254.239.965.56285.743.436.465.522.7622026multimodalOpen weights262K360.79$0.12$0.37
Google18.315241222.21543.320.9322.24.8152026llm$0.00$0.00
Zhipu AI61.56148.438.198.56180.943.532.698.515.8612026multimodalAPI only203K$1.20$4.00
Arcee AI49.4334529.490.13375.236.122.790.114.7332026llmOpen weights262K1290.61$0.22$0.85
Alibaba55.152.748.330.988.352.782.640.521.288.313.952.72026llm541.28$0.40$4.80
#43Alibaba46.54440.716.984.54474.225.58.384.57.1442026llm2350.99$0.10$0.80
Index 46.5 = (44.0 + 40.7 + 16.9 + 84.5 / 4) — equal-weighted mean of 4 components.
General25%
44
  • SimpleQA
  • AA-LCR44
  • LongBench-v2
  • IFBench
Reasoning25%
40.7
  • GPQA Diamond74.2
  • Humanity’s Last Exam7.1
  • FrontierMath
  • ARC-AGI-2
Coding25%
16.9
  • SWE-bench Verified
  • Terminal-Bench8.3
  • Aider Polyglot
  • SciCode25.5
Tool use & agents25%
84.5
  • TAU-bench Retail
  • τ²-bench84.5
  • BFCL
  • BrowseComp
Kuaishou62.56650.843.889.56685.538.349.289.516662026llmAPI only256K1081.36$0.30$1.20
Xiaomi60.663.75337.68863.785.539.535.68820.463.72026llm1101.51$0.40$2.00
NVIDIA39.73443.62853.23475.834.821.253.211.4342026llm$0.00$0.00
Xiaomi63.860.757.741.79560.78742.540.99528.360.72026llmAPI only1M602.01$1.00$3.00
MiniMax63.668.757.843.284.868.787.44739.484.828.168.72026llmOpen weights205K501.32$0.28$1.20
Xiaomi61.366.751.435.891.266.782.836.734.891.219.966.72026multimodalAPI only262K1081.36$0.40$2.00
OpenAI65.269.357.151.183.369.387.549.952.383.326.669.32026llmAPI only2025400K1620.63$0.75$4.50

Score columns under Index are the v1.2 weighted components (25% each) that feed it. Reference per-category averages (not in the index) follow. Every individual benchmark in our catalog is also shown — grouped by category, ordered by coverage. Hover any header for details — click to sort. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.