AI Hub
← All models

Gemini 3.5 Flash vs Hermes 4 - Llama-3.1 405B

Google vs Nous Research — benchmarks, pricing, and capabilities side by side.

  • Gemini 3.5 Flash has the higher intelligence index (92.2 vs 73.5)
  • Hermes 4 - Llama-3.1 405B is cheaper ($1.00 vs $1.50 per 1M input)
  • Gemini 3.5 Flash is faster
Gemini 3.5 FlashHermes 4 - Llama-3.1 405B
Intelligence index92.273.5
DeveloperGoogleNous Research
TypeMultimodalLLM
AccessAPI only
Context window1,048,576 tokens
Input price$1.50 / 1M$1.00 / 1M
Output price$9.00 / 1M$3.00 / 1M
Speed221 tok/s34 tok/s
ReleasedMay 19, 2026August 27, 2025
Parameters
Input modalitiesText, Image, Audio, Video
Output modalitiesText

Shared benchmarks

Gemini 3.5 Flash
Hermes 4 - Llama-3.1 405B
GPQA Diamond
92.2
72.7
Humanity’s Last Exam
41
10.3