AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelReason idxBIG-Bench HardARC-AGI-2DROPGPQA DiamondReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
#126Alibaba73.373.32025llm931.26$0.70$8.40
#127Amazon73.186.985.446.92024multimodalAPI only300K1000.50$0.80$3.20
#128Anthropic73732025llmAPI only2025200K1000.30$1.00$5.00
#129Anthropic72.958.387.52026llmAPI only1M751.13$3.00$15.00
#130Alibaba72.972.92025llmOpen weights2025262K1611.14$0.09$1.10
#131OpenAI72.752.992.42025llmAPI only400K730.69$1.75$14.00
#132xAI72.772.72025llm$0.00$0.00
#133Nous Research72.772.72025llm340.74$1.00$3.00
#134ByteDance72.672.62025llm371.81$0.20$0.60
#135Alibaba72.672.62025llm1021.05$0.30$1.00
#136InclusionAI72.572.52025llm$0.10$0.60
#137Upstage72.472.42026llmAPI only128K$0.15$0.60
#138Korea Telecom72.272.22025llm$0.00$0.00
#139Alibaba72722025llm1221.14$0.20$0.80
#140Zhipu AI71.971.92025multimodalOpen weights131K441.31$0.30$0.90
#141InclusionAI71.971.92025llm$0.00$0.00
#142OpenAI71.571.52025llmOpen weights21B (3.6B active)2024131K10000.38$0.03$0.14
#143DeepSeek71.571.52025llmOpen weights671B total / 37B active (MoE)128K1890.07$0.55$2.19
#144OpenAI71.471.42025multimodalAPI only128K5020.00$75.00$150.00
#145ServiceNow71.371.32025llm$0.00$0.00
#146MBZUAI Institute of Foundation Models71.371.32025llm$0.00$0.00
#147OpenAI71.271.22025llmAPI only2024400K5000.30$0.05$0.40
#148Alibaba71.271.22025multimodalOpen weights2025262K511.20$0.20$0.88
#149Alibaba70.770.72025llm1511.18$0.30$1.90
#150OpenAI70.170.12024multimodalAPI only2023128K1320.50$2.50$10.00

Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.