AI Hub
← All models

Grok-4 Heavy vs Qwen3 VL 8B Instruct

xAI vs Alibaba — benchmarks, pricing, and capabilities side by side.

  • Grok-4 Heavy has the higher intelligence index (89.3 vs 43)
Grok-4 HeavyQwen3 VL 8B Instruct
Intelligence index89.343
DeveloperxAIAlibaba
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window256,000 tokens
Input price$0.08 / 1M
Output price$0.50 / 1M
Speed145 tok/s
ReleasedJuly 9, 2025October 14, 2025
Parameters
Input modalitiesImage, Text
Output modalitiesText

Shared benchmarks

Grok-4 Heavy
Qwen3 VL 8B Instruct
AIME 2025
100
27.3
GPQA Diamond
88.4
42.7
Humanity’s Last Exam
50.7
2.9
LiveCodeBench
79.4
33.2