MultimodalAPI only

Gemini 2.0 Flash

Name: Gemini 2.0 Flash
Author: Google

Google

Updated May 21, 2026 · Benchmarks via Artificial Analysis, specs & pricing via OpenRouter · Methodology · Report an error

Superseded by Gemini 2.5 Flash

Specifications

Type: Multimodal
Access: API only
Released: December 11, 2024
License: Proprietary
Context window: 1,000,000 tokens
Knowledge cutoff: August 1, 2024
Output speed: 183 tok/s
Latency (TTFT): 0.4s
Input: Text, Image, Audio, Video
Output: Text
API pricing: $0.10 in · $0.40 out / 1M tokens

Capabilities

Function callingStructured outputWeb search

Benchmarks

Reasoning

GPQA Diamond

62.1

Coding

LiveCodeBench

35.1

Math

MATH	89.7
MATH-500	93
AIME 2025	21.7

Multimodal

MMMU

70.7

General

MMLU	87
MMLU-Pro	76.4
Humanity’s Last Exam	5.3

Our take

Google’s aggressively cheap, fast multimodal model with native tool use and a 1M-token window — a strong default for high-volume, latency-sensitive workloads.

Compare Gemini 2.0 Flash with

vs Sonar Reasoning Pro vs R1 1776 vs Qwen3.7 Max vs Gemini 3.5 Flash

See all Gemini 2.0 Flash alternatives →