Skip to content

Gemini 2.5 Flash

gemini-2.5-flash
Google · chat · api · seen 8m ago
GA Alert me on changes
Context
1M
Max output
65.5K
Input $/1M
$0.30
Output $/1M
$2.50
Modalities
text
Released
17 Jun 2025
AI summary
● machine-written

Google releases Gemini 2.5 Flash for large-scale processing

Gemini 2.5 Flash is Google's price-performance model supporting a 1M token context window and offering text input with text output. The model is positioned for large-scale processing, low-latency tasks, and agentic use cases. It was released on June 17, 2025.

What's new
  • 1M token context window with 65.5K token output limit
  • Supports thinking, code execution, and function calling
  • Available through Batch API, Flex inference, and Priority inference
  • Knowledge cutoff January 2025
Best for
Large-scale processing and high-volume tasksLow-latency applicationsAgentic use casesReasoning and coding tasks
Sources

Source: https://ai.google.dev/gemini-api/docs/models/gemini-2.5-flash