Skip to content

Gemini 2.0 Flash 001

gemini-2.0-flash-001
Google · chat · api · seen 13m ago
GA Alert me on changes
Context
1M
Max output
8.2K
Input $/1M
$0.10
Output $/1M
$0.40
Modalities
text
Released
05 Feb 2025
AI summary
● machine-written

Google releases Gemini 2.0 Flash 001, 1M-token context model

Gemini 2.0 Flash 001 is a large language model by Google supporting text input with a 1 million token context window and up to 8.2K tokens of output. The model is available via API at $0.10 per million input tokens and $0.40 per million output tokens, positioned as a fast inference option for developers.

What's new
  • Released February 5, 2025
  • 1 million token context window
  • Supports function calling, web search, prompt caching, and structured output
  • Up to 8.2K token maximum output
  • General availability status
Best for
High token limit applications requiring extended contextCost-sensitive text generation tasksWeb search-augmented responsesStructured output generation
Sources

Source: https://ai.google.dev/gemini-api/docs/models/gemini-2.0-flash-001