Gemini 3.5 Flash vs Llama 3 70B Instruct
Google vs Meta — benchmarks, pricing, and capabilities side by side.
- •Gemini 3.5 Flash has the higher intelligence index (92.2 vs 40.8)
- •Llama 3 70B Instruct is cheaper ($0.51 vs $1.50 per 1M input)
- •Gemini 3.5 Flash is faster
- •Gemini 3.5 Flash has a larger context window (1M)
| Gemini 3.5 Flash | Llama 3 70B Instruct | |
|---|---|---|
| Intelligence index | 92.2 | 40.8 |
| Developer | Meta | |
| Type | Multimodal | LLM |
| Access | API only | Open weights |
| Context window | 1,048,576 tokens | 8,192 tokens |
| Input price | $1.50 / 1M | $0.51 / 1M |
| Output price | $9.00 / 1M | $0.74 / 1M |
| Speed | 221 tok/s | 45 tok/s |
| Released | May 19, 2026 | April 18, 2024 |
| Parameters | — | — |
| Input modalities | Text, Image, Audio, Video | Text |
| Output modalities | Text | Text |
Shared benchmarks
Gemini 3.5 Flash
Llama 3 70B Instruct
GPQA Diamond
92.2
37.9
Humanity’s Last Exam
41
4.4