AI War Tracker
298 models in catalog

AI models

Every model we track — frontier flagships, open-weights specialists, narrow benchmarks-only releases. Filter by lab, country, access, modality, or release window. Sorted by newest by default; head to the leaderboard for the ranked view.

Leaderboard →Labs →Benchmarks →

Updated May 29, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Spotted an error?

RankModelAgents idxτ²-benchBFCLτ²-bench Airlineτ²-bench RetailBrowseCompTAU-bench AirlineTAU-bench RetailReleasedCountryTypeAccessParamsCutoffContextSpeedLatencyIn $/MOut $/M
#76OpenAI83832025multimodalAPI only400K1884.16$1.25$10.00
#77OpenBMB82.582.52026llm$0.00$0.00
#78OpenAI81.981.92025llmAPI only400K1150.77$1.25$10.00
#79Alibaba81.681.62026llm3280.24$0.00$0.10
#80Cohere80.780.72025llmOpen weights2024256K2030.17$2.50$10.00
#81Google80.480.42025multimodalAPI only1M1911.05$0.50$3.00
#82Amazon80.480.42025llm$0.30$2.50
#83Anthropic79.579.52026llmAPI only1M751.13$3.00$15.00
#84Alibaba79.579.52026llmOpen weights262K921.14$0.11$0.80
#85LongCat79.579.52026llm1105.59$0.00$0.00
#86Anthropic78.178.17086.22025llmAPI only20251M420.40$3.00$15.00
#87LG AI Research78.178.12026llm$0.00$0.00
#88OpenAI76762026llmAPI only2025400K1570.55$0.20$1.25
#89Amazon75.775.72025multimodalAPI only1M2290.89$0.30$2.50
#90xAI75.775.72025llm$0.00$0.00
#91xAI74.974.92025llmAPI only2024256K1000.70$3.00$15.00
#92LG AI Research74.374.32025llm$0.00$0.00
#93Alibaba74.374.32025llmAPI only2025262K451.71$0.78$3.90
#94Moonshot AI73.473.42025llmAPI only1000000000000262K161.94$0.60$2.50
#95Anthropic71.573.459.681.42025llmAPI only2025200K1200.40$15.00$75.00
#96OpenAI71.386.562.681.154.92025llmAPI only2024400K1002.00$1.25$10.00
#97OpenAI71.171.12025llmAPI only2024400K2001.00$0.25$2.00
#98Inception70.870.82026llmAPI only128K7906.11$0.25$0.75
#99Anthropic69.971.45682.42025llmAPI only2025200K1200.40$15.00$75.00
#100ServiceNow69.369.32025llm$0.00$0.00

Ranked on Agents. Cell colors show relative standing within each column (red → yellow → green). Scores are curated approximations — see each model for sources.