Llama 3.1 Nemotron Nano 4B v1.1 vs Qwen3.7 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

•Qwen3.7 Max has the higher intelligence index (92.3 vs 54.5)
•Llama 3.1 Nemotron Nano 4B v1.1 is cheaper ($0.00 vs $2.50 per 1M input)

	Llama 3.1 Nemotron Nano 4B v1.1	Qwen3.7 Max
Intelligence index	54.5	92.3
Developer	NVIDIA	Alibaba
Type	LLM	LLM
Access	—	API only
Context window	—	1,000,000 tokens
Input price	$0.00 / 1M	$2.50 / 1M
Output price	$0.00 / 1M	$7.50 / 1M
Speed	—	203 tok/s
Released	May 20, 2025	May 21, 2026
Parameters	—	—
Input modalities	—	Text
Output modalities	—	Text

Shared benchmarks

Llama 3.1 Nemotron Nano 4B v1.1

Qwen3.7 Max

GPQA Diamond

40.8

92.3

Humanity’s Last Exam

5.1

38.1

Llama 3.1 Nemotron Nano 4B v1.1 details Qwen3.7 Max details