AI Hub
← All models

Llama 3.3 Nemotron Super 49B V1.5 vs Qwen3.7 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • Llama 3.3 Nemotron Super 49B V1.5 is cheaper ($0.10 vs $2.50 per 1M input)
  • Qwen3.7 Max has a larger context window (1M)
Llama 3.3 Nemotron Super 49B V1.5Qwen3.7 Max
Intelligence index92.3
DeveloperNVIDIAAlibaba
TypeLLMLLM
AccessOpen weightsAPI only
Context window131,072 tokens1,000,000 tokens
Input price$0.10 / 1M$2.50 / 1M
Output price$0.40 / 1M$7.50 / 1M
Speed203 tok/s
ReleasedOctober 10, 2025May 21, 2026
Parameters
Input modalitiesTextText
Output modalitiesTextText