AI Hub
← All models

GPT-5.1 vs Hermes 4 - Llama-3.1 405B

OpenAI vs Nous Research — benchmarks, pricing, and capabilities side by side.

  • GPT-5.1 has the higher intelligence index (89 vs 73.5)
  • Hermes 4 - Llama-3.1 405B is cheaper ($1.00 vs $1.25 per 1M input)
  • GPT-5.1 is faster
GPT-5.1Hermes 4 - Llama-3.1 405B
Intelligence index8973.5
DeveloperOpenAINous Research
TypeLLMLLM
AccessAPI only
Context window400,000 tokens
Input price$1.25 / 1M$1.00 / 1M
Output price$10.00 / 1M$3.00 / 1M
Speed115 tok/s34 tok/s
ReleasedNovember 12, 2025August 27, 2025
Parameters
Input modalitiesText, Image
Output modalitiesText

Shared benchmarks

GPT-5.1
Hermes 4 - Llama-3.1 405B
AIME 2025
94
69.7
GPQA Diamond
88.1
72.7
Humanity’s Last Exam
26.5
10.3
LiveCodeBench
86.8
68.6
MMLU-Pro
87
82.9