Claude 3 Sonnet vs Grok-4 Heavy

Anthropic vs xAI — benchmarks, pricing, and capabilities side by side.

•Grok-4 Heavy has the higher intelligence index (89.3 vs 45.8)

	Claude 3 Sonnet	Grok-4 Heavy
Intelligence index	45.8	89.3
Developer	Anthropic	xAI
Type	Multimodal	Multimodal
Access	API only	API only
Context window	200,000 tokens	—
Input price	$3.00 / 1M	—
Output price	$15.00 / 1M	—
Speed	120 tok/s	—
Released	February 29, 2024	July 9, 2025
Parameters	—	—
Input modalities	—	—
Output modalities	—	—

Shared benchmarks

Claude 3 Sonnet

Grok-4 Heavy

GPQA Diamond

40.4

88.4

Humanity’s Last Exam

3.8

50.7

LiveCodeBench

17.5

79.4

Claude 3 Sonnet details Grok-4 Heavy details