DeepHermes 3 - Llama-3.1 8B vs GPT-5.1
Nous Research vs OpenAI — benchmarks, pricing, and capabilities side by side.
- •GPT-5.1 has the higher intelligence index (89 vs 23.5)
- •DeepHermes 3 - Llama-3.1 8B is cheaper ($0.00 vs $1.25 per 1M input)
| DeepHermes 3 - Llama-3.1 8B | GPT-5.1 | |
|---|---|---|
| Intelligence index | 23.5 | 89 |
| Developer | Nous Research | OpenAI |
| Type | LLM | LLM |
| Access | — | API only |
| Context window | — | 400,000 tokens |
| Input price | $0.00 / 1M | $1.25 / 1M |
| Output price | $0.00 / 1M | $10.00 / 1M |
| Speed | — | 115 tok/s |
| Released | February 13, 2025 | November 12, 2025 |
| Parameters | — | — |
| Input modalities | — | Text, Image |
| Output modalities | — | Text |
Shared benchmarks
DeepHermes 3 - Llama-3.1 8B
GPT-5.1
GPQA Diamond
27
88.1
Humanity’s Last Exam
4.3
26.5
LiveCodeBench
8.5
86.8
MMLU-Pro
36.5
87