MultimodalAPI only
Gemini 2.5 Flash
Updated May 21, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error
Superseded by Gemini 3 Flash
Specifications
- Type
- Multimodal
- Access
- API only
- Released
- April 17, 2025
- License
- proprietary
- Context window
- 1,048,576 tokens
- Knowledge cutoff
- January 31, 2025
- Output speed
- 85 tok/s
- Latency (TTFT)
- 0.7s
- Input
- Text, Image, Audio, Video
- Output
- Text
- API pricing
- $0.30 in · $2.50 out / 1M tokens
Capabilities
Function callingStructured outputWeb search
Benchmarks
Reasoning
| GPQA Diamond | 82.8 |
Coding
| SWE-bench Verified | 60.4 |
| Aider Polyglot | 61.9 |
| Aider Polyglot Edit | 56.7 |
| LiveCodeBench | 69.5 |
Multimodal
| MMMU | 79.7 |
General
| Humanity’s Last Exam | 11 |
| SimpleQA | 26.9 |
| MMLU-Pro | 83.2 |
Our take
A fast, low-cost reasoning model with a controllable thinking budget — the high-volume workhorse of the Gemini 2.5 line.