AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelGeneral idxMulti-IFLiveBenchArena HardHumanity’s Last ExamIFEvalSimpleQAMMLU-ProMMLUReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
#51InclusionAI82.27.282.22025llm$0.00$0.00
#52Prime Intellect82.212.182.22025llmOpen weights131K$0.20$1.10
#53MiniMax8212.5822025llmOpen weights230B (10B active)205K911.19$0.26$1.00
#54OpenAI8216.9822025multimodalAPI only400K1759.50$0.25$2.00
#55Amazon81.810.981.82025multimodalAPI only1M2290.89$0.30$2.50
#56Alibaba81.89.681.82025llm931.26$0.70$8.40
#57LG AI Research81.810.581.82025llm$0.00$0.00
#58MiniMax81.68.281.62025llm$0.60$2.20
#59Mistral AI81.59.681.52025llm420.50$2.00$5.00
#60ByteDance81.59.181.52025llm371.81$0.20$0.60
#61Zhipu AI81.410.681.42025llmOpen weights2024131K631.68$0.13$0.85
#62NVIDIA81.46.881.42025llm510.29$0.10$0.40
#63Kuaishou81.333.481.32025llm1082.19$0.30$1.20
#64Alibaba81.375.87.387.680.62025llmOpen weights2025262K1611.14$0.09$1.10
#65Korea Telecom81.38.881.32025llm$0.00$0.00
#66DeepSeek81.25.281.22025llmOpen weights671000000000164K$0.28$1.14
#67Nous Research81.17.981.12025llm600.67$0.10$0.40
#68Amazon80.96.880.92025llm$0.30$2.50
#69Meta80.94.288.673.387.32024llmOpen weights405000000000128K1000.40$0.89$0.89
#70OpenAI80.81980.8902025llmOpen weights117B (5.1B active)2024131K5000.50$0.04$0.18
#71MiniMax80.87.580.82025llm$0.00$0.00
#72Mistral AI80.74.180.72025llmOpen weights675B (41B active)262K540.64$0.50$1.50
#73Alibaba80.78.780.72025llm1221.14$0.20$0.80
#74InclusionAI80.610.280.62025llm$0.00$0.00
#75Amazon80.63.492.169.185.92024multimodalAPI only300K1000.50$0.80$3.20

Ranked on General. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.