Grok-4 Heavy vs Llama 3.2 11B Instruct
xAI vs Meta — benchmarks, pricing, and capabilities side by side.
- •Grok-4 Heavy has the higher intelligence index (89.3 vs 36.7)
| Grok-4 Heavy | Llama 3.2 11B Instruct | |
|---|---|---|
| Intelligence index | 89.3 | 36.7 |
| Developer | xAI | Meta |
| Type | Multimodal | Multimodal |
| Access | API only | Open weights |
| Context window | — | 128,000 tokens |
| Input price | — | $0.05 / 1M |
| Output price | — | $0.05 / 1M |
| Speed | — | 168 tok/s |
| Released | July 9, 2025 | September 25, 2024 |
| Parameters | — | 10600000000 |
| Input modalities | — | — |
| Output modalities | — | — |
Shared benchmarks
Grok-4 Heavy
Llama 3.2 11B Instruct
AIME 2025
100
1.7
GPQA Diamond
88.4
32.8
Humanity’s Last Exam
50.7
5.2
LiveCodeBench
79.4
11