Grok-4 Heavy vs Qwen3 VL 32B Instruct

xAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•Grok-4 Heavy has the higher intelligence index (89.3 vs 66.5)

	Grok-4 Heavy	Qwen3 VL 32B Instruct
Intelligence index	89.3	66.5
Developer	xAI	Alibaba
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	—	262,144 tokens
Input price	—	$0.10 / 1M
Output price	—	$0.42 / 1M
Speed	—	76 tok/s
Released	July 9, 2025	October 23, 2025
Parameters	—	—
Input modalities	—	Text, Image
Output modalities	—	Text

Shared benchmarks

Grok-4 Heavy

Qwen3 VL 32B Instruct

AIME 2025

100

68.3

GPQA Diamond

88.4

67.1

Humanity’s Last Exam

50.7

6.3

LiveCodeBench

79.4

51.4

Grok-4 Heavy details Qwen3 VL 32B Instruct details