Claude 3 Sonnet vs Grok-4 Heavy
Anthropic vs xAI — benchmarks, pricing, and capabilities side by side.
- •Grok-4 Heavy has the higher intelligence index (89.3 vs 45.8)
| Claude 3 Sonnet | Grok-4 Heavy | |
|---|---|---|
| Intelligence index | 45.8 | 89.3 |
| Developer | Anthropic | xAI |
| Type | Multimodal | Multimodal |
| Access | API only | API only |
| Context window | 200,000 tokens | — |
| Input price | $3.00 / 1M | — |
| Output price | $15.00 / 1M | — |
| Speed | 120 tok/s | — |
| Released | February 29, 2024 | July 9, 2025 |
| Parameters | — | — |
| Input modalities | — | — |
| Output modalities | — | — |
Shared benchmarks
Claude 3 Sonnet
Grok-4 Heavy
GPQA Diamond
40.4
88.4
Humanity’s Last Exam
3.8
50.7
LiveCodeBench
17.5
79.4