Gemini 3.5 Flash vs Hermes 4 - Llama-3.1 405B
Google vs Nous Research — benchmarks, pricing, and capabilities side by side.
- •Gemini 3.5 Flash has the higher intelligence index (92.2 vs 73.5)
- •Hermes 4 - Llama-3.1 405B is cheaper ($1.00 vs $1.50 per 1M input)
- •Gemini 3.5 Flash is faster
| Gemini 3.5 Flash | Hermes 4 - Llama-3.1 405B | |
|---|---|---|
| Intelligence index | 92.2 | 73.5 |
| Developer | Nous Research | |
| Type | Multimodal | LLM |
| Access | API only | — |
| Context window | 1,048,576 tokens | — |
| Input price | $1.50 / 1M | $1.00 / 1M |
| Output price | $9.00 / 1M | $3.00 / 1M |
| Speed | 221 tok/s | 34 tok/s |
| Released | May 19, 2026 | August 27, 2025 |
| Parameters | — | — |
| Input modalities | Text, Image, Audio, Video | — |
| Output modalities | Text | — |
Shared benchmarks
Gemini 3.5 Flash
Hermes 4 - Llama-3.1 405B
GPQA Diamond
92.2
72.7
Humanity’s Last Exam
41
10.3