GPT-5.1-Codex-Mini vs Grok-4 Heavy
OpenAI vs xAI — benchmarks, pricing, and capabilities side by side.
- •Grok-4 Heavy has the higher intelligence index (89.3 vs 84.7)
| GPT-5.1-Codex-Mini | Grok-4 Heavy | |
|---|---|---|
| Intelligence index | 84.7 | 89.3 |
| Developer | OpenAI | xAI |
| Type | Multimodal | Multimodal |
| Access | API only | API only |
| Context window | 400,000 tokens | — |
| Input price | $0.25 / 1M | — |
| Output price | $2.00 / 1M | — |
| Speed | 175 tok/s | — |
| Released | November 13, 2025 | July 9, 2025 |
| Parameters | — | — |
| Input modalities | Image, Text | — |
| Output modalities | Text | — |
Shared benchmarks
GPT-5.1-Codex-Mini
Grok-4 Heavy
AIME 2025
91.7
100
GPQA Diamond
81.3
88.4
Humanity’s Last Exam
16.9
50.7
LiveCodeBench
83.6
79.4