Llama 3.1 Nemotron Ultra 253B v1 vs Qwen3.7 Max
NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.
- •Qwen3.7 Max has the higher intelligence index (92.3 vs 78.3)
- •Llama 3.1 Nemotron Ultra 253B v1 is cheaper ($0.60 vs $2.50 per 1M input)
- •Qwen3.7 Max is faster
| Llama 3.1 Nemotron Ultra 253B v1 | Qwen3.7 Max | |
|---|---|---|
| Intelligence index | 78.3 | 92.3 |
| Developer | NVIDIA | Alibaba |
| Type | LLM | LLM |
| Access | Open weights | API only |
| Context window | — | 1,000,000 tokens |
| Input price | $0.60 / 1M | $2.50 / 1M |
| Output price | $1.80 / 1M | $7.50 / 1M |
| Speed | 42 tok/s | 203 tok/s |
| Released | April 7, 2025 | May 21, 2026 |
| Parameters | 253000000000 | — |
| Input modalities | — | Text |
| Output modalities | — | Text |
Shared benchmarks
Llama 3.1 Nemotron Ultra 253B v1
Qwen3.7 Max
GPQA Diamond
76
92.3
Humanity’s Last Exam
8.1
38.1