Grok-4 Heavy vs Phi-4-multimodal-instruct
xAI vs Microsoft — benchmarks, pricing, and capabilities side by side.
- •Grok-4 Heavy has the higher intelligence index (89.3 vs 46.2)
| Grok-4 Heavy | Phi-4-multimodal-instruct | |
|---|---|---|
| Intelligence index | 89.3 | 46.2 |
| Developer | xAI | Microsoft |
| Type | Multimodal | Multimodal |
| Access | API only | Open weights |
| Context window | — | 128,000 tokens |
| Input price | — | $0.05 / 1M |
| Output price | — | $0.10 / 1M |
| Speed | — | 25 tok/s |
| Released | July 9, 2025 | February 1, 2025 |
| Parameters | — | 5600000000 |
| Input modalities | — | — |
| Output modalities | — | — |
Shared benchmarks
Grok-4 Heavy
Phi-4-multimodal-instruct
GPQA Diamond
88.4
31.5
Humanity’s Last Exam
50.7
4.4
LiveCodeBench
79.4
13.1