AI Hub
← All models

Grok-4 Heavy vs Qwen3 VL 32B Instruct

xAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • Grok-4 Heavy has the higher intelligence index (89.3 vs 66.5)
Grok-4 HeavyQwen3 VL 32B Instruct
Intelligence index89.366.5
DeveloperxAIAlibaba
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window262,144 tokens
Input price$0.10 / 1M
Output price$0.42 / 1M
Speed76 tok/s
ReleasedJuly 9, 2025October 23, 2025
Parameters
Input modalitiesText, Image
Output modalitiesText

Shared benchmarks

Grok-4 Heavy
Qwen3 VL 32B Instruct
AIME 2025
100
68.3
GPQA Diamond
88.4
67.1
Humanity’s Last Exam
50.7
6.3
LiveCodeBench
79.4
51.4