MultimodalAPI only
Gemini 2.0 Flash
Updated May 21, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error
Superseded by Gemini 2.5 Flash
Specifications
- Type
- Multimodal
- Access
- API only
- Released
- December 11, 2024
- License
- Proprietary
- Context window
- 1,000,000 tokens
- Knowledge cutoff
- August 1, 2024
- Output speed
- 183 tok/s
- Latency (TTFT)
- 0.4s
- Input
- Text, Image, Audio, Video
- Output
- Text
- API pricing
- $0.10 in · $0.40 out / 1M tokens
Capabilities
Function callingStructured outputWeb search
Benchmarks
Reasoning
| GPQA Diamond | 62.1 |
Coding
| LiveCodeBench | 35.1 |
Multimodal
| MMMU | 70.7 |
General
| MMLU | 87 |
| MMLU-Pro | 76.4 |
| Humanity’s Last Exam | 5.3 |
Our take
Google’s aggressively cheap, fast multimodal model with native tool use and a 1M-token window — a strong default for high-volume, latency-sensitive workloads.