AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelReason idxBIG-Bench HardARC-AGI-2DROPGPQA DiamondReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
#176Alibaba65.965.92025llm1221.25$0.20$0.40
#177Alibaba65.865.82025llmOpen weights2025131K1220.66$0.09$0.45
#178Microsoft65.875.556.12025llmOpen weights202416K330.20$0.07$0.14
#179Upstage65.765.72025llm$0.00$0.00
#180InclusionAI65.765.72025llm911.61$0.10$0.60
#181Alibaba65.265.22025llmOpen weights325000000002024310.45$0.70$1.00
#182DeepSeek65.265.22025llmOpen weights70600000000128K370.65$0.10$0.40
#183OpenAI65652025multimodalAPI only20241M1505.00$0.40$1.60
#184Google64.664.62025multimodalAPI only20251M60.44$0.10$0.40
#185Mistral AI64.164.12025llm$0.00$0.00
#186LongCat63.663.62026llm1105.59$0.00$0.00
#187Sarvam63.363.32026llm2141.17$0.00$0.00
#188Anthropic62.483.141.62024llmAPI only2024200K1040.30$0.80$4.00
#189Google62.162.12024multimodalAPI only20241M1830.40$0.10$0.40
#190Alibaba62622025llm1031.04$0.30$1.00
#191Alibaba61.861.82025llm691.68$0.30$1.80
#192Anthropic61.873.778.433.32024multimodalAPI only2023200K1040.40$0.25$1.25
#193Google61.531.191.92025multimodalAPI only1M14127.49$2.00$12.00
#194Naver61.561.52025llm$0.00$0.00
#195DeepSeek61.261.22025llm$0.00$0.00
#196Allen Institute for AI61612025llmOpen weights66K$0.15$0.50
#197Meta60.779.641.72024llmOpen weights2023131K12040.20$0.40$0.40
#198Alibaba60.460.42025llmOpen weights2025132K621.01$0.10$0.24
#199Trillion Labs60.160.12026llm$0.00$0.00
#200Mistral AI59.459.42025llmOpen weights262K510.64$0.40$2.00

Ranked on Reasoning. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.