Llama 3.1 Nemotron Ultra 253B v1 vs Qwen3.7 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

•Qwen3.7 Max has the higher intelligence index (92.3 vs 78.3)
•Llama 3.1 Nemotron Ultra 253B v1 is cheaper ($0.60 vs $2.50 per 1M input)
•Qwen3.7 Max is faster

	Llama 3.1 Nemotron Ultra 253B v1	Qwen3.7 Max
Intelligence index	78.3	92.3
Developer	NVIDIA	Alibaba
Type	LLM	LLM
Access	Open weights	API only
Context window	—	1,000,000 tokens
Input price	$0.60 / 1M	$2.50 / 1M
Output price	$1.80 / 1M	$7.50 / 1M
Speed	42 tok/s	203 tok/s
Released	April 7, 2025	May 21, 2026
Parameters	253000000000	—
Input modalities	—	Text
Output modalities	—	Text

Shared benchmarks

Llama 3.1 Nemotron Ultra 253B v1

Qwen3.7 Max

GPQA Diamond

76

92.3

Humanity’s Last Exam

8.1

38.1

Llama 3.1 Nemotron Ultra 253B v1 details Qwen3.7 Max details