Gemini 2.5 Flash
gemini-2.5-flash
Google · chat · api · seen 8m ago
Context
1M
Max output
65.5K
Input $/1M
$0.30
Output $/1M
$2.50
Modalities
text
Released
17 Jun 2025
AI summary
● machine-written
Google releases Gemini 2.5 Flash for large-scale processing
Gemini 2.5 Flash is Google's price-performance model supporting a 1M token context window and offering text input with text output. The model is positioned for large-scale processing, low-latency tasks, and agentic use cases. It was released on June 17, 2025.
What's new
- 1M token context window with 65.5K token output limit
- Supports thinking, code execution, and function calling
- Available through Batch API, Flex inference, and Priority inference
- Knowledge cutoff January 2025
Best for
Large-scale processing and high-volume tasksLow-latency applicationsAgentic use casesReasoning and coding tasks
Source: https://ai.google.dev/gemini-api/docs/models/gemini-2.5-flash