AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 30, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelIndexGeneralReasonCodingAgentsMathMultiLong ctxGPQA DiamondDROPARC-AGI-2BIG-Bench HardSciCodeTerminal-BenchLiveCodeBenchSWE-bench VerifiedAider PolyglotHumanEvalAider Polyglot EditMBPPMultiPL-ESWE-bench ProAIME 2025MATH-500AIME 2024MATHGSM8KMGSMHMMT 2025FrontierMathτ²-benchTAU-bench RetailTAU-bench AirlineBFCLBrowseCompτ²-bench Airlineτ²-bench RetailMMMUMathVistaChartQADocVQAMMMU-ProAI2DHumanity’s Last ExamMMLU-ProMMLUIFEvalSimpleQAMulti-IFLiveBenchArena HardAA-LCRLongBench-v2ReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
OpenAI47.153.230.437.966.758.974.759.37835.812.967.94161.788.19774.396.497.189.35.562.670.85077.671.87.784.192476759.32024llmAPI only2023200K660.54$15.00$60.00
Allen Institute for AI4.8017.21.900.7028.83.704.10.705.528.202024llm$0.00$0.00
Amazon24.71925.213.541.242.881.51946.985.486.920.86.123.389778.676.694.81468.461.789.293.53.469.185.992.1192024multimodalAPI only300K1000.50$0.80$3.20
Amazon22.617.723.37.442.141.878.517.74280.282.413.90.816.785.4776.573.394.517.566.656.286.892.44.65980.589.717.72024multimodalAPI only300K1000.50$0.06$0.24
Amazon18.29.722.45.535.138.29.74079.379.59.41.51481.1670.369.392.31456.24.753.177.687.29.72024llmAPI only128K1000.50$0.03$0.14
Mistral AI25.810.327.129.236.536.981.710.350.529.226.12.371.436.56469.488.193.393.83.670.110.32024multimodalAPI only2024131K00.50$2.00$6.00
Anthropic27.123.322.624.637.872.123.341.683.127.42.331.440.62888.172.169.485.624.65122.83.56580.923.32024llmAPI only2024200K1040.30$0.80$4.00
NVIDIA17.4725.613.923.142.2746.523.34.516.91173.391.423.14.66980.272024llmOpen weights7000000000020232920.24$1.20$1.20
Meta12.811.719614.626.766.411.732.811.20.8111.751.651.968.914.650.751.583.488.43391.15.246.47311.72024multimodalOpen weights106000000002023128K1680.20$0.05$0.05
Meta11.82195.221.126.1232.85.28.33.348.94877.758.221.15.234.763.477.422024llmOpen weights2023131K1720.24$0.05$0.34
Meta4.6512.50.907519.61.701.901405.32052024llmOpen weights2023131K910.60$0.03$0.20
#287Allen Institute for AI4.1014.61.8000243.603.9005.137.102024llm$0.00$0.00
Index 4.1 = (0.0 + 14.6 + 1.8 + 0.0 / 4) — equal-weighted mean of 4 components.
General25%
0
  • SimpleQA
  • AA-LCR0
  • LongBench-v2
  • IFBench
Reasoning25%
14.6
  • GPQA Diamond24
  • Humanity’s Last Exam5.1
  • FrontierMath
  • ARC-AGI-2
Coding25%
1.8
  • SWE-bench Verified
  • Terminal-Bench0
  • Aider Polyglot
  • SciCode3.6
Tool use & agents25%
0
  • TAU-bench Retail
  • τ²-bench0
  • BFCL
  • BrowseComp
Alibaba24.320.326.615.634.549.920.34926.74.555.586.688.275.11485.883.195.834.54.271.184.152.381.220.32024llmOpen weights2024131K1000.37$0.36$0.40
Mistral AI20.65.326.317.73343.85.348.629.26.129.3921473.69333469.7845.32024llmOpen weights123B128K420.40$2.00$6.00
Meta3124.327.518.353.836.724.350.784.829.96.830.589370.373.896.81988.54.273.387.388.624.32024llmOpen weights405000000000128K1000.40$0.89$0.89
Meta23.66.323.214.95034.56.341.779.626.7323.280.5464.915.284.84.666.483.687.56.32024llmOpen weights2023131K12040.20$0.40$0.40
Meta21.715.717.8746.328.115.730.459.513.20.811.672.64.351.916.476.15.148.369.480.415.72024llmOpen weights2023131K20470.20$0.02$0.05
OpenAI38.845.637.727.244.642.777.75370.136.68.342.533.230.790.218.225.789.313.128.960.342.845.563.472.261.485.792.859.994.25.374.788.78138.260.9532024multimodalAPI only2023128K1320.50$2.50$10.00
Microsoft6.2218.24.5023231.99011.60.345.704.443.522024llm$0.00$0.00
Meta7.8021.29.9048.3037.918.90.819.848.304.457.402024llmOpen weights20238K450.70$0.51$0.74
Meta5.9017.46049.9029.611.909.649.905.140.502024llmOpen weights20238K810.51$0.04$0.04
Anthropic17.62118.69.721.139.42133.378.473.718.60.815.475.939.438.988.975.121.13.975.2212024multimodalAPI only2023200K1040.40$0.25$1.25
Mistral AI3.40112.4012.1017.72.44.612.104.324.502023llm900.39$0.20$0.20

Score columns under Index are the v1.2 weighted components (25% each) that feed it. Reference per-category averages (not in the index) follow. Every individual benchmark in our catalog is also shown — grouped by category, ordered by coverage. Hover any header for details — click to sort. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.