AI War Tracker
Long Context

AA-LCR

Artificial Analysis Long-Context Recall — how well a model retrieves specific information from long inputs across multiple positions.

301Models
75.7Top score
32.3Median

State of the art over time

Each point is a model at its release date; the line traces the best score to date.

8060402002023202420252026Llama 3 8B Instruct: 0 (2024-04-18)Llama 3 70B Instruct: 0 (2024-04-18)Phi-3 Mini Instruct 3.8B: 2 (2024-04-23)Llama 3.1 405B Instruct: 24.3 (2024-07-23)Llama 3.1 8B Instruct: 15.7 (2024-07-23)Llama 3.1 70B Instruct: 6.3 (2024-07-23)Mistral Large 2: 5.3 (2024-07-24)Qwen2.5 72B Instruct: 20.3 (2024-09-19)Llama 3.2 11B Instruct: 11.7 (2024-09-25)Llama 3.2 1B Instruct: 5 (2024-09-25)Llama 3.2 3B Instruct: 2 (2024-09-25)Molmo 7B-D: 0 (2024-09-25)Llama 3.1 Nemotron 70B Instruct: 7 (2024-10-01)Claude 3.5 Haiku: 23.3 (2024-11-04)Pixtral Large: 10.3 (2024-11-19)Nova Pro: 19 (2024-11-20)Nova Lite: 17.7 (2024-11-20)Nova Micro: 9.7 (2024-11-20)OLMo 2 7B: 0 (2024-11-26)Llama 3.3 70B Instruct: 15 (2024-12-06)Gemini 2.0 Flash: 28.3 (2024-12-11)DeepSeek-V3: 29 (2024-12-26)Phi 4: 0 (2025-01-10)DeepSeek-R1: 52.3 (2025-01-20)DeepSeek R1 Distill Llama 70B: 11 (2025-01-20)DeepSeek R1 Distill Qwen 32B: 9.7 (2025-01-20)DeepSeek R1 Distill Qwen 14B: 7 (2025-01-20)DeepSeek R1 Distill Qwen 1.5B: 0.3 (2025-01-20)DeepSeek R1 Distill Llama 8B: 0 (2025-01-20)Mistral Small 3: 0 (2025-01-30)o3-mini: 39.3 (2025-01-31)Grok-3: 54.7 (2025-02-17)Grok 3 mini Reasoning: 50.3 (2025-02-19)QwQ-32B: 25 (2025-03-05)Gemma 3 12B Instruct: 6.7 (2025-03-12)Gemma 3 4B Instruct: 5.7 (2025-03-12)Gemma 3 27B Instruct: 5.7 (2025-03-12)Reka Flash 3: 0 (2025-03-12)Command A: 46 (2025-03-13)OLMo 2 32B: 0 (2025-03-13)Gemma 3 1B Instruct: 0 (2025-03-13)Mistral Small 3.1: 19.7 (2025-03-17)Llama-3.3 Nemotron Super 49B v1: 17 (2025-03-18)DeepSeek-V3 0324: 41 (2025-03-25)Llama 4 Maverick: 46 (2025-04-05)Llama 4 Scout: 25.8 (2025-04-05)Llama 3.1 Nemotron Ultra 253B v1: 7.3 (2025-04-07)GPT-4.1: 61 (2025-04-14)GPT-4.1 Mini: 42.3 (2025-04-14)GPT-4.1 Nano: 17 (2025-04-14)o4-mini: 55 (2025-04-16)Granite 3.3 8B: 4.3 (2025-04-16)Gemini 2.5 Flash: 61.7 (2025-04-17)Qwen3 0.6B: 0 (2025-04-28)Qwen3 4B: 0 (2025-04-28)Qwen3 1.7B: 0 (2025-04-28)Qwen3 235B A22B: 0 (2025-04-28)Qwen3 32B: 0 (2025-04-28)Qwen3 14B: 0 (2025-04-28)Qwen3 8B: 0 (2025-04-28)Qwen3 30B A3B: 0 (2025-04-28)Nova Premier: 30 (2025-04-30)Mistral Medium 3: 28 (2025-05-07)Gemma 3n E4B Instruct: 0 (2025-05-20)Llama 3.1 Nemotron Nano 4B v1.1: 0 (2025-05-20)Solar Pro 2: 0 (2025-05-20)Devstral Small: 26.7 (2025-05-21)Claude Sonnet 4: 64.7 (2025-05-22)Claude Opus 4: 36 (2025-05-22)Sarvam M: 0 (2025-05-23)DeepSeek-R1-0528: 54.7 (2025-05-28)DeepSeek R1 0528 Qwen3 8B: 13 (2025-05-29)Magistral Small 1: 0 (2025-06-10)Magistral Medium 1: 0 (2025-06-10)MiniMax M1 80k: 54.3 (2025-06-17)MiniMax M1 40k: 51.7 (2025-06-17)Mistral Small 3.2: 17.3 (2025-06-20)Gemma 3n E2B Instruct: 0 (2025-06-26)ERNIE 4.5 300B A47B: 2.3 (2025-06-30)Jamba 1.7 Mini: 12.7 (2025-07-07)Grok 4: 68 (2025-07-09)Devstral Medium: 28.7 (2025-07-10)LFM2 1.2B: 0 (2025-07-10)Kimi K2: 51 (2025-07-11)EXAONE 4.0 32B: 14 (2025-07-15)Exaone 4.0 1.2B: 0 (2025-07-15)Gemini 2.5 Flash Lite: 51.3 (2025-07-22)Qwen3 Coder 480B A35B Instruct: 42.3 (2025-07-22)Qwen3-235B-A22B-Instruct-2507: 31.2 (2025-07-22)Qwen3 235B A22B 2507: 67 (2025-07-25)GLM 4.5 Air: 43.7 (2025-07-25)Llama Nemotron Super 49B v1.5: 34 (2025-07-25)GLM-4.5: 48.3 (2025-07-28)Qwen3 30B A3B 2507 Instruct: 22.7 (2025-07-29)Qwen3 30B A3B 2507: 59 (2025-07-30)Qwen3 Coder 30B A3B Instruct: 29 (2025-07-31)Claude Opus 4.1: 66.3 (2025-08-05)gpt-oss-120b: 50.7 (2025-08-05)gpt-oss-20b: 31 (2025-08-05)Qwen3 4B 2507: 37.7 (2025-08-06)Qwen3 4B 2507 Instruct: 7.3 (2025-08-06)GPT-5 mini: 68 (2025-08-07)GPT-5 nano: 41.7 (2025-08-07)Jamba Large 1.7: 17.3 (2025-08-08)GLM 4.5V: 0 (2025-08-11)Mistral Medium 3.1: 19.7 (2025-08-13)Gemma 3 270M: 0 (2025-08-14)NVIDIA Nemotron Nano 9B V2: 22.7 (2025-08-18)Seed-OSS-36B-Instruct: 57.7 (2025-08-20)DeepSeek-V3.1: 53.3 (2025-08-21)Hermes 4 - Llama-3.1 405B: 20.7 (2025-08-27)Hermes 4 - Llama-3.1 70B: 6.7 (2025-08-27)Grok Code Fast 1: 48.3 (2025-08-28)Apertus 70B Instruct: 0 (2025-09-02)Apertus 8B Instruct: 0 (2025-09-02)Kimi K2 0905: 52.3 (2025-09-05)Ling-mini-2.0: 6.7 (2025-09-09)Qwen3-Next-80B-A3B: 60.3 (2025-09-10)Qwen3 Next 80B A3B Instruct: 51.3 (2025-09-11)Magistral Small 1.2: 16.3 (2025-09-17)Ling-flash-2.0: 15 (2025-09-17)Magistral Medium 1.2: 51.3 (2025-09-18)Grok 4 Fast: 64.7 (2025-09-19)Ring-flash-2.0: 21 (2025-09-19)DeepSeek V3.1 Terminus: 65 (2025-09-22)Granite 4.0 H Small: 9 (2025-09-22)Qwen3 Omni 30B A3B Instruct: 0 (2025-09-22)Qwen3 Omni 30B A3B: 0 (2025-09-22)GPT-5 Codex: 69 (2025-09-23)Qwen3 VL 235B A22B: 58.7 (2025-09-23)Qwen3 Max: 46.7 (2025-09-23)Qwen3 VL 235B A22B Instruct: 31.7 (2025-09-23)LFM2 2.6B: 0 (2025-09-23)DeepSeek V3.2 Exp: 69 (2025-09-29)Claude Sonnet 4.5: 65.7 (2025-09-29)GLM-4.6: 54.3 (2025-09-30)Apriel-v1.5-15B-Thinker: 20 (2025-09-30)Qwen3 VL 30B A3B: 40.7 (2025-10-03)Qwen3 VL 30B A3B Instruct: 23.7 (2025-10-06)LFM2 8B A1B: 0 (2025-10-07)Ling-1T: 34.7 (2025-10-08)Jamba Reasoning 3B: 7 (2025-10-08)Ring-1T: 45.7 (2025-10-13)Qwen3 VL 8B: 31 (2025-10-14)Qwen3 VL 4B: 21.3 (2025-10-14)Qwen3 VL 8B Instruct: 15.3 (2025-10-14)Qwen3 VL 4B Instruct: 13 (2025-10-14)Claude Haiku 4.5: 70.3 (2025-10-15)Phi 4 Mini Instruct: 13.7 (2025-10-17)Granite 4.0 Micro: 4 (2025-10-20)Qwen3 VL 32B: 55.3 (2025-10-21)Qwen3 VL 32B Instruct: 31.3 (2025-10-23)MiniMax-M2: 61 (2025-10-27)NVIDIA Nemotron Nano 12B v2 VL: 40 (2025-10-28)Granite 4.0 H 1B: 6.3 (2025-10-28)Granite 4.0 1B: 4 (2025-10-28)Granite 4.0 350M: 0 (2025-10-28)Granite 4.0 H 350M: 0 (2025-10-28)Kimi Linear 48B A3B Instruct: 25.7 (2025-10-30)Kimi K2 Thinking: 66.3 (2025-11-06)KAT-Coder-Pro V1: 74 (2025-11-11)Doubao Seed Code: 65.3 (2025-11-11)GPT-5.1: 75 (2025-11-12)GPT-5.1-Codex: 67.3 (2025-11-13)GPT-5.1-Codex-Mini: 62.7 (2025-11-13)ERNIE 5.0 Thinking: 6.7 (2025-11-13)Gemini 3 Pro: 70.7 (2025-11-18)Cogito v2.1: 21.7 (2025-11-18)Grok 4.1 Fast: 68 (2025-11-19)Olmo 3 7B Think: 0 (2025-11-20)Olmo 3 7B Instruct: 0 (2025-11-20)Olmo 3 32B Think: 0 (2025-11-21)Claude Opus 4.5: 74 (2025-11-24)Apriel-v1.6-15B-Thinker: 50.3 (2025-11-25)Nova 2.0 Omni: 53.7 (2025-11-26)Nova 2.0 Pro: 61.7 (2025-11-27)INTELLECT-3: 32.3 (2025-11-27)DeepSeek-V3.2: 65 (2025-12-01)DeepSeek V3.2 Speciale: 59.3 (2025-12-01)Nova 2 Lite: 58.3 (2025-12-02)Mistral Large 3: 34.7 (2025-12-02)Ministral 3 8B: 24 (2025-12-02)Ministral 3 14B: 22 (2025-12-02)Ministral 3 3B: 11.7 (2025-12-02)Motif-2-12.7B-Reasoning: 13 (2025-12-04)K2-V2: 33.3 (2025-12-05)GLM 4.6V: 40.3 (2025-12-08)Devstral 2: 30 (2025-12-09)Devstral Small 2: 24 (2025-12-09)GPT-5.2: 72.7 (2025-12-11)Mi:dm K 2.5 Pro: 11 (2025-12-11)Molmo2-8B: 0 (2025-12-11)Olmo 3.1 32B Think: 0 (2025-12-12)MiMo-V2-Flash: 64.3 (2025-12-14)K2 Think V2: 52.7 (2025-12-15)NVIDIA Nemotron 3 Nano 30B A3B: 33.7 (2025-12-15)Gemini 3 Flash: 66.3 (2025-12-17)Solar Open 100B: 36 (2025-12-17)GLM 4.7: 64 (2025-12-22)MiniMax M2.1: 59 (2025-12-23)HyperCLOVA X SEED Think: 11.7 (2025-12-26)K-EXAONE: 55.7 (2025-12-31)Falcon-H1R-7B: 8.7 (2026-01-04)LFM2.5-VL-1.6B: 0 (2026-01-05)LFM2.5-1.2B-Instruct: 0 (2026-01-05)Olmo 3.1 32B Instruct: 0 (2026-01-13)GLM 4.7 Flash: 35 (2026-01-19)Step3 VL 10B: 0 (2026-01-20)LFM2.5-1.2B-Thinking: 0 (2026-01-20)Kimi K2.5: 65.3 (2026-01-27)Solar Pro 3: 27 (2026-01-27)LongCat Flash Lite: 25.7 (2026-01-28)Step 3.5 Flash: 43 (2026-01-29)Qwen3 Coder Next: 40 (2026-02-04)Claude Opus 4.6: 70.7 (2026-02-05)Qwen3 Max Thinking: 66 (2026-02-09)Tri-21B-Think: 14.7 (2026-02-10)GLM-5: 63.3 (2026-02-11)Nanbeige4.1-3B: 0 (2026-02-11)MiniMax M2.5: 66 (2026-02-12)Qwen3.5 397B A17B: 65.7 (2026-02-16)Claude Sonnet 4.6: 70.7 (2026-02-17)Tiny Aya Global: 0 (2026-02-17)Gemini 3.1 Pro: 72.7 (2026-02-19)GPT-5.3-Codex: 74 (2026-02-24)Qwen3.5-27B: 67.3 (2026-02-25)Qwen3.5-122B-A10B: 66.7 (2026-02-25)Qwen3.5-35B-A3B: 62.7 (2026-02-25)LFM2-24B-A2B: 0 (2026-02-25)Qwen3.5 4B: 55.7 (2026-03-02)Qwen3.5 2B: 23.7 (2026-03-02)Qwen3.5 0.8B: 6.7 (2026-03-02)Mercury 2: 36.3 (2026-03-04)GPT-5.4: 74 (2026-03-05)Sarvam 30B: 0 (2026-03-06)Sarvam 105B: 0 (2026-03-06)Grok 4.20 0309: 59 (2026-03-10)Qwen3.5-9B: 59 (2026-03-10)NVIDIA Nemotron 3 Super 120B A12B: 60 (2026-03-11)GLM 5 Turbo: 60.7 (2026-03-15)Mistral Small 4: 44.7 (2026-03-16)NVIDIA Nemotron 3 Nano 4B: 16.7 (2026-03-16)GPT-5.4 mini: 69.3 (2026-03-17)GPT-5.4 nano: 66 (2026-03-17)MiniMax M2.7: 68.7 (2026-03-18)MiMo-V2-Omni: 66.7 (2026-03-18)MiMo-V2-Pro: 60.7 (2026-03-18)Nemotron Cascade 2 30B A3B: 34 (2026-03-19)KAT-Coder-Pro V2: 66 (2026-03-27)MiMo-V2-Omni-0327: 63.7 (2026-03-27)Qwen3.5 Omni Plus: 52.7 (2026-03-30)Qwen3.5 Omni Flash: 44 (2026-03-30)GLM 5V Turbo: 61 (2026-04-01)Trinity Large Thinking: 33 (2026-04-01)Qwen3.6 Plus: 69.7 (2026-04-02)Gemma 4 31B: 62 (2026-04-02)Step 3.5 Flash 2603: 54.3 (2026-04-02)Gemma 4 E2B: 15 (2026-04-02)Gemma 4 26B A4B: 55.7 (2026-04-03)Gemma 4 E4B: 30.7 (2026-04-03)GLM 5.1: 62.3 (2026-04-07)Grok 4.20 0309 v2: 58 (2026-04-07)Muse Spark: 69.7 (2026-04-08)EXAONE 4.5 33B: 49.3 (2026-04-09)JT-MINI: 11.7 (2026-04-15)Claude Opus 4.7: 70.3 (2026-04-16)Kimi K2.6: 69.7 (2026-04-20)Ling-2.6-flash: 25 (2026-04-21)MiMo-V2.5-Pro: 73.3 (2026-04-22)MiMo-V2.5: 62.7 (2026-04-22)Hy3: 54.7 (2026-04-22)GPT-5.5: 74.3 (2026-04-23)Ling-2.6-1T: 34.7 (2026-04-23)DeepSeek-V4-Pro: 66.3 (2026-04-24)DeepSeek-V4-Flash: 63 (2026-04-24)Qwen3.6 Max: 69.7 (2026-04-27)Qwen3.6 27B: 68.7 (2026-04-27)Qwen3.6 35B A3B: 63.7 (2026-04-27)Nemotron 3 Nano Omni 30B A3B Reasoning: 35.7 (2026-04-29)Granite 4.1 30B: 18.7 (2026-04-29)Granite 4.1 3B: 3 (2026-04-29)Mistral Medium 3.5: 61 (2026-04-30)Granite 4.1 8B: 12 (2026-04-30)Grok 4.3: 65 (2026-05-06)Gemini 3.1 Flash Lite: 65.3 (2026-05-07)Ring-2.6-1T: 64.3 (2026-05-08)MiniCPM-V 4.6 1.3B: 6.3 (2026-05-11)JT-35B-Flash: 55.3 (2026-05-14)Gemini 3.5 Flash: 71 (2026-05-19)Qwen3.7 Max: 69 (2026-05-21)MiniCPM5-1B: 4.7 (2026-05-25)Claude Opus 4.8: 67.7 (2026-05-28)Mistral 7B Instruct: 0 (2023-09-27)Mistral 7B InstructClaude 3 Haiku: 21 (2024-03-13)Claude 3 HaikuGPT-4o: 53 (2024-05-13)o1: 59.3 (2024-12-05)o1Claude 3.7 Sonnet: 60.7 (2025-02-24)Gemini 2.5 Pro: 66 (2025-03-25)Gemini 2.5 Proo3: 69.3 (2025-04-16)GPT-5: 75.6 (2025-08-07)GPT-5GPT-5.2-Codex: 75.7 (2026-01-14)GPT-5.2-Codex

Ranking

#1GPT-5.2-Codex75.7
#2GPT-575.6
#3GPT-5.175
#4GPT-5.574.3
#5KAT-Coder-Pro V174
#6GPT-5.3-Codex74
#7Claude Opus 4.574
#8GPT-5.474
#9MiMo-V2.5-Pro73.3
#10Gemini 3.1 Pro72.7
#11GPT-5.272.7
#12Gemini 3.5 Flash71
#13Gemini 3 Pro70.7
#14Claude Sonnet 4.670.7
#15Claude Opus 4.670.7
#16Claude Opus 4.770.3
#17Claude Haiku 4.570.3
#18Muse Spark69.7
#19Qwen3.6 Plus69.7
#20Qwen3.6 Max69.7
#21Kimi K2.669.7
#22GPT-5.4 mini69.3
#23o369.3
#24Qwen3.7 Max69
#25GPT-5 Codex69
#26DeepSeek V3.2 Exp69
#27MiniMax M2.768.7
#28Qwen3.6 27B68.7
#29Grok 4.1 Fast68
#30Grok 468
#31GPT-5 mini68
#32Claude Opus 4.867.7
#33GPT-5.1-Codex67.3
#34Qwen3.5-27B67.3
#35Qwen3 235B A22B 250767
#36MiMo-V2-Omni66.7
#37Qwen3.5-122B-A10B66.7
#38Kimi K2 Thinking66.3
#39DeepSeek-V4-Pro66.3
#40Gemini 3 Flash66.3
#41Claude Opus 4.166.3
#42KAT-Coder-Pro V266
#43Qwen3 Max Thinking66
#44MiniMax M2.566
#45GPT-5.4 nano66
#46Gemini 2.5 Pro66
#47Qwen3.5 397B A17B65.7
#48Claude Sonnet 4.565.7
#49Doubao Seed Code65.3
#50Gemini 3.1 Flash Lite65.3
#51Kimi K2.565.3
#52DeepSeek V3.1 Terminus65
#53DeepSeek-V3.265
#54Grok 4.365
#55Grok 4 Fast64.7
#56Claude Sonnet 464.7
#57MiMo-V2-Flash64.3
#58Ring-2.6-1T64.3
#59GLM 4.764
#60MiMo-V2-Omni-032763.7
#61Qwen3.6 35B A3B63.7
#62GLM-563.3
#63DeepSeek-V4-Flash63
#64MiMo-V2.562.7
#65GPT-5.1-Codex-Mini62.7
#66Qwen3.5-35B-A3B62.7
#67GLM 5.162.3
#68Gemma 4 31B62
#69Nova 2.0 Pro61.7
#70Gemini 2.5 Flash61.7
#71GLM 5V Turbo61
#72Mistral Medium 3.561
#73MiniMax-M261
#74GPT-4.161
#75MiMo-V2-Pro60.7
#76GLM 5 Turbo60.7
#77Claude 3.7 Sonnet60.7
#78Qwen3-Next-80B-A3B60.3
#79NVIDIA Nemotron 3 Super 120B A12B60
#80DeepSeek V3.2 Speciale59.3
#81o159.3
#82Qwen3 30B A3B 250759
#83Grok 4.20 030959
#84MiniMax M2.159
#85Qwen3.5-9B59
#86Qwen3 VL 235B A22B58.7
#87Nova 2 Lite58.3
#88Grok 4.20 0309 v258
#89Seed-OSS-36B-Instruct57.7
#90Qwen3.5 4B55.7
#91K-EXAONE55.7
#92Gemma 4 26B A4B55.7
#93Qwen3 VL 32B55.3
#94JT-35B-Flash55.3
#95o4-mini55
#96DeepSeek-R1-052854.7
#97Hy354.7
#98Grok-354.7
#99MiniMax M1 80k54.3
#100Step 3.5 Flash 260354.3
#101GLM-4.654.3
#102Nova 2.0 Omni53.7
#103DeepSeek-V3.153.3
#104GPT-4o53
#105Qwen3.5 Omni Plus52.7
#106K2 Think V252.7
#107Kimi K2 090552.3
#108DeepSeek-R152.3
#109MiniMax M1 40k51.7
#110Magistral Medium 1.251.3
#111Gemini 2.5 Flash Lite51.3
#112Qwen3 Next 80B A3B Instruct51.3
#113Kimi K251
#114gpt-oss-120b50.7
#115Apriel-v1.6-15B-Thinker50.3
#116Grok 3 mini Reasoning50.3
#117EXAONE 4.5 33B49.3
#118Grok Code Fast 148.3
#119GLM-4.548.3
#120Qwen3 Max46.7
#121Command A46
#122Llama 4 Maverick46
#123Ring-1T45.7
#124Mistral Small 444.7
#125Qwen3.5 Omni Flash44
#126GLM 4.5 Air43.7
#127Step 3.5 Flash43
#128Qwen3 Coder 480B A35B Instruct42.3
#129GPT-4.1 Mini42.3
#130GPT-5 nano41.7
#131DeepSeek-V3 032441
#132Qwen3 VL 30B A3B40.7
#133GLM 4.6V40.3
#134NVIDIA Nemotron Nano 12B v2 VL40
#135Qwen3 Coder Next40
#136o3-mini39.3
#137Qwen3 4B 250737.7
#138Mercury 236.3
#139Solar Open 100B36
#140Claude Opus 436
#141Nemotron 3 Nano Omni 30B A3B Reasoning35.7
#142GLM 4.7 Flash35
#143Ling-1T34.7
#144Ling-2.6-1T34.7
#145Mistral Large 334.7
#146Nemotron Cascade 2 30B A3B34
#147Llama Nemotron Super 49B v1.534
#148NVIDIA Nemotron 3 Nano 30B A3B33.7
#149K2-V233.3
#150Trinity Large Thinking33
#151INTELLECT-332.3
#152Qwen3 VL 235B A22B Instruct31.7
#153Qwen3 VL 32B Instruct31.3
#154Qwen3-235B-A22B-Instruct-250731.2
#155Qwen3 VL 8B31
#156gpt-oss-20b31
#157Gemma 4 E4B30.7
#158Nova Premier30
#159Devstral 230
#160Qwen3 Coder 30B A3B Instruct29
#161DeepSeek-V329
#162Devstral Medium28.7
#163Gemini 2.0 Flash28.3
#164Mistral Medium 328
#165Solar Pro 327
#166Devstral Small26.7
#167Llama 4 Scout25.8
#168LongCat Flash Lite25.7
#169Kimi Linear 48B A3B Instruct25.7
#170QwQ-32B25
#171Ling-2.6-flash25
#172Llama 3.1 405B Instruct24.3
#173Devstral Small 224
#174Ministral 3 8B24
#175Qwen3.5 2B23.7
#176Qwen3 VL 30B A3B Instruct23.7
#177Claude 3.5 Haiku23.3
#178Qwen3 30B A3B 2507 Instruct22.7
#179NVIDIA Nemotron Nano 9B V222.7
#180Ministral 3 14B22
#181Cogito v2.121.7
#182Qwen3 VL 4B21.3
#183Ring-flash-2.021
#184Claude 3 Haiku21
#185Hermes 4 - Llama-3.1 405B20.7
#186Qwen2.5 72B Instruct20.3
#187Apriel-v1.5-15B-Thinker20
#188Mistral Small 3.119.7
#189Mistral Medium 3.119.7
#190Nova Pro19
#191Granite 4.1 30B18.7
#192Nova Lite17.7
#193Mistral Small 3.217.3
#194Jamba Large 1.717.3
#195Llama-3.3 Nemotron Super 49B v117
#196GPT-4.1 Nano17
#197NVIDIA Nemotron 3 Nano 4B16.7
#198Magistral Small 1.216.3
#199Llama 3.1 8B Instruct15.7
#200Qwen3 VL 8B Instruct15.3
#201Ling-flash-2.015
#202Gemma 4 E2B15
#203Llama 3.3 70B Instruct15
#204Tri-21B-Think14.7
#205EXAONE 4.0 32B14
#206Phi 4 Mini Instruct13.7
#207Qwen3 VL 4B Instruct13
#208DeepSeek R1 0528 Qwen3 8B13
#209Motif-2-12.7B-Reasoning13
#210Jamba 1.7 Mini12.7
#211Granite 4.1 8B12
#212JT-MINI11.7
#213HyperCLOVA X SEED Think11.7
#214Llama 3.2 11B Instruct11.7
#215Ministral 3 3B11.7
#216Mi:dm K 2.5 Pro11
#217DeepSeek R1 Distill Llama 70B11
#218Pixtral Large10.3
#219Nova Micro9.7
#220DeepSeek R1 Distill Qwen 32B9.7
#221Granite 4.0 H Small9
#222Falcon-H1R-7B8.7
#223Qwen3 4B 2507 Instruct7.3
#224Llama 3.1 Nemotron Ultra 253B v17.3
#225Jamba Reasoning 3B7
#226Llama 3.1 Nemotron 70B Instruct7
#227DeepSeek R1 Distill Qwen 14B7
#228Gemma 3 12B Instruct6.7
#229Ling-mini-2.06.7
#230Qwen3.5 0.8B6.7
#231ERNIE 5.0 Thinking6.7
#232Hermes 4 - Llama-3.1 70B6.7
#233MiniCPM-V 4.6 1.3B6.3
#234Granite 4.0 H 1B6.3
#235Llama 3.1 70B Instruct6.3
#236Gemma 3 4B Instruct5.7
#237Gemma 3 27B Instruct5.7
#238Mistral Large 25.3
#239Llama 3.2 1B Instruct5
#240MiniCPM5-1B4.7
#241Granite 3.3 8B4.3
#242Granite 4.0 1B4
#243Granite 4.0 Micro4
#244Granite 4.1 3B3
#245ERNIE 4.5 300B A47B2.3
#246Phi-3 Mini Instruct 3.8B2
#247Llama 3.2 3B Instruct2
#248DeepSeek R1 Distill Qwen 1.5B0.3
#249Qwen3 0.6B0
#250Qwen3 4B0
#251Qwen3 1.7B0
#252Sarvam M0
#253OLMo 2 7B0
#254OLMo 2 32B0
#255LFM2 1.2B0
#256Magistral Small 10
#257Magistral Medium 10
#258Mistral 7B Instruct0
#259Gemma 3n E2B Instruct0
#260Gemma 3n E4B Instruct0
#261Gemma 3 1B Instruct0
#262Qwen3 Omni 30B A3B Instruct0
#263Qwen3 Omni 30B A3B0
#264Tiny Aya Global0
#265Apertus 70B Instruct0
#266Apertus 8B Instruct0
#267Nanbeige4.1-3B0
#268Sarvam 30B0
#269Sarvam 105B0
#270Exaone 4.0 1.2B0
#271Granite 4.0 350M0
#272Granite 4.0 H 350M0
#273Olmo 3.1 32B Think0
#274Molmo2-8B0
#275Olmo 3.1 32B Instruct0
#276Olmo 3 7B Think0
#277Molmo 7B-D0
#278Olmo 3 7B Instruct0
#279Step3 VL 10B0
#280Llama 3.1 Nemotron Nano 4B v1.10
#281Solar Pro 20
#282LFM2.5-VL-1.6B0
#283LFM2.5-1.2B-Thinking0
#284LFM2 2.6B0
#285LFM2.5-1.2B-Instruct0
#286LFM2 8B A1B0
#287Gemma 3 270M0
#288Reka Flash 30
#289DeepSeek R1 Distill Llama 8B0
#290Llama 3 8B Instruct0
#291Llama 3 70B Instruct0
#292Phi 40
#293Mistral Small 30
#294Qwen3 235B A22B0
#295Qwen3 32B0
#296Qwen3 14B0
#297Qwen3 8B0
#298Qwen3 30B A3B0
#299GLM 4.5V0
#300Olmo 3 32B Think0
#301LFM2-24B-A2B0

Related Long Context benchmarks