Grok-4 Heavy vs Phi-4-multimodal-instruct

xAI vs Microsoft — benchmarks, pricing, and capabilities side by side.

•Grok-4 Heavy has the higher intelligence index (89.3 vs 46.2)

	Grok-4 Heavy	Phi-4-multimodal-instruct
Intelligence index	89.3	46.2
Developer	xAI	Microsoft
Type	Multimodal	Multimodal
Access	API only	Open weights
Context window	—	128,000 tokens
Input price	—	$0.05 / 1M
Output price	—	$0.10 / 1M
Speed	—	25 tok/s
Released	July 9, 2025	February 1, 2025
Parameters	—	5600000000
Input modalities	—	—
Output modalities	—	—

Shared benchmarks

Grok-4 Heavy

Phi-4-multimodal-instruct

GPQA Diamond

88.4

31.5

Humanity’s Last Exam

50.7

4.4

LiveCodeBench

79.4

13.1

Grok-4 Heavy details Phi-4-multimodal-instruct details