AI Hub
← All models

Llama 3.1 Nemotron Ultra 253B v1 vs Qwen3.6 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • Qwen3.6 Max has the higher intelligence index (88.8 vs 78.3)
  • Llama 3.1 Nemotron Ultra 253B v1 is cheaper ($0.60 vs $1.04 per 1M input)
  • Llama 3.1 Nemotron Ultra 253B v1 is faster
Llama 3.1 Nemotron Ultra 253B v1Qwen3.6 Max
Intelligence index78.388.8
DeveloperNVIDIAAlibaba
TypeLLMLLM
AccessOpen weightsAPI only
Context window262,144 tokens
Input price$0.60 / 1M$1.04 / 1M
Output price$1.80 / 1M$6.24 / 1M
Speed42 tok/s36 tok/s
ReleasedApril 7, 2025April 27, 2026
Parameters253000000000
Input modalitiesText
Output modalitiesText

Shared benchmarks

Llama 3.1 Nemotron Ultra 253B v1
Qwen3.6 Max
GPQA Diamond
76
88.8
Humanity’s Last Exam
8.1
28.9