Llama 3.3 Nemotron Super 49B V1.5 vs Qwen3.7 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

•Llama 3.3 Nemotron Super 49B V1.5 is cheaper ($0.10 vs $2.50 per 1M input)
•Qwen3.7 Max has a larger context window (1M)

	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.7 Max
Intelligence index	—	92.3
Developer	NVIDIA	Alibaba
Type	LLM	LLM
Access	Open weights	API only
Context window	131,072 tokens	1,000,000 tokens
Input price	$0.10 / 1M	$2.50 / 1M
Output price	$0.40 / 1M	$7.50 / 1M
Speed	—	203 tok/s
Released	October 10, 2025	May 21, 2026
Parameters	—	—
Input modalities	Text	Text
Output modalities	Text	Text

Llama 3.3 Nemotron Super 49B V1.5 details Qwen3.7 Max details