Gemini 2.0 Flash Thinking vs Grok-4 Heavy
Google vs xAI — benchmarks, pricing, and capabilities side by side.
- •Grok-4 Heavy has the higher intelligence index (89.3 vs 69.1)
| Gemini 2.0 Flash Thinking | Grok-4 Heavy | |
|---|---|---|
| Intelligence index | 69.1 | 89.3 |
| Developer | xAI | |
| Type | Multimodal | Multimodal |
| Access | API only | API only |
| Context window | — | — |
| Input price | $0.00 / 1M | — |
| Output price | $0.00 / 1M | — |
| Speed | — | — |
| Released | January 21, 2025 | July 9, 2025 |
| Parameters | — | — |
| Input modalities | — | — |
| Output modalities | — | — |
Shared benchmarks
Gemini 2.0 Flash Thinking
Grok-4 Heavy
GPQA Diamond
74.2
88.4
Humanity’s Last Exam
7.1
50.7
LiveCodeBench
32.1
79.4