Gemini 2.0 Flash 001
gemini-2.0-flash-001
Google · chat · api · seen 13m ago
Context
1M
Max output
8.2K
Input $/1M
$0.10
Output $/1M
$0.40
Modalities
text
Released
05 Feb 2025
AI summary
● machine-written
Google releases Gemini 2.0 Flash 001, 1M-token context model
Gemini 2.0 Flash 001 is a large language model by Google supporting text input with a 1 million token context window and up to 8.2K tokens of output. The model is available via API at $0.10 per million input tokens and $0.40 per million output tokens, positioned as a fast inference option for developers.
What's new
- Released February 5, 2025
- 1 million token context window
- Supports function calling, web search, prompt caching, and structured output
- Up to 8.2K token maximum output
- General availability status
Best for
High token limit applications requiring extended contextCost-sensitive text generation tasksWeb search-augmented responsesStructured output generation
Source: https://ai.google.dev/gemini-api/docs/models/gemini-2.0-flash-001