Grok-4 Heavy vs Qwen3 VL 8B Instruct

xAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

•Grok-4 Heavy has the higher intelligence index (89.3 vs 43)

	Grok-4 Heavy	Qwen3 VL 8B Instruct
Intelligence index	89.3	43
Developer	xAI	Alibaba
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	—	256,000 tokens
Input price	—	$0.08 / 1M
Output price	—	$0.50 / 1M
Speed	—	145 tok/s
Released	July 9, 2025	October 14, 2025
Parameters	—	—
Input modalities	—	Image, Text
Output modalities	—	Text

Shared benchmarks

Grok-4 Heavy

Qwen3 VL 8B Instruct

AIME 2025

100

27.3

GPQA Diamond

88.4

42.7

Humanity’s Last Exam

50.7

2.9

LiveCodeBench

79.4

33.2

Grok-4 Heavy details Qwen3 VL 8B Instruct details