AI Hub
← All models

Llama 3.1 Nemotron Nano 4B v1.1 vs Qwen3.7 Max

NVIDIA vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • Qwen3.7 Max has the higher intelligence index (92.3 vs 54.5)
  • Llama 3.1 Nemotron Nano 4B v1.1 is cheaper ($0.00 vs $2.50 per 1M input)
Llama 3.1 Nemotron Nano 4B v1.1Qwen3.7 Max
Intelligence index54.592.3
DeveloperNVIDIAAlibaba
TypeLLMLLM
AccessAPI only
Context window1,000,000 tokens
Input price$0.00 / 1M$2.50 / 1M
Output price$0.00 / 1M$7.50 / 1M
Speed203 tok/s
ReleasedMay 20, 2025May 21, 2026
Parameters
Input modalitiesText
Output modalitiesText

Shared benchmarks

Llama 3.1 Nemotron Nano 4B v1.1
Qwen3.7 Max
GPQA Diamond
40.8
92.3
Humanity’s Last Exam
5.1
38.1