Gemini 3 Flash vs GPT Audio
Google vs OpenAI — benchmarks, pricing, and capabilities side by side.
- •Gemini 3 Flash is cheaper ($0.50 vs $2.50 per 1M input)
- •Gemini 3 Flash has a larger context window (1M)
| Gemini 3 Flash | GPT Audio | |
|---|---|---|
| Intelligence index | 90.2 | — |
| Developer | OpenAI | |
| Type | Multimodal | Multimodal |
| Access | API only | API only |
| Context window | 1,048,576 tokens | 128,000 tokens |
| Input price | $0.50 / 1M | $2.50 / 1M |
| Output price | $3.00 / 1M | $10.00 / 1M |
| Speed | 191 tok/s | — |
| Released | December 17, 2025 | January 19, 2026 |
| Parameters | — | — |
| Input modalities | Text, Image, Audio, Video | Text, Audio |
| Output modalities | Text | Text, Audio |