AI Hub
← All models

Grok-4 Heavy vs Phi-4-multimodal-instruct

xAI vs Microsoft — benchmarks, pricing, and capabilities side by side.

  • Grok-4 Heavy has the higher intelligence index (89.3 vs 46.2)
Grok-4 HeavyPhi-4-multimodal-instruct
Intelligence index89.346.2
DeveloperxAIMicrosoft
TypeMultimodalMultimodal
AccessAPI onlyOpen weights
Context window128,000 tokens
Input price$0.05 / 1M
Output price$0.10 / 1M
Speed25 tok/s
ReleasedJuly 9, 2025February 1, 2025
Parameters5600000000
Input modalities
Output modalities

Shared benchmarks

Grok-4 Heavy
Phi-4-multimodal-instruct
GPQA Diamond
88.4
31.5
Humanity’s Last Exam
50.7
4.4
LiveCodeBench
79.4
13.1