Gemini 3.1 Flash Lite Preview
gemini-3.1-flash-lite-preview
Google · chat · api · seen 13m ago
Preview
Alert me on changes
Context
1M
Max output
65.5K
Input $/1M
$0.25
Output $/1M
$1.50
Modalities
text
Released
03 Mar 2026
AI summary
● machine-written
Google releases Gemini 3.1 Flash Lite Preview with 1M context window
Gemini 3.1 Flash Lite Preview is Google's cost-efficient model designed for high-volume use cases, offering frontier-class performance at reduced cost. The model supports a 1M token context window and 65.5K maximum output, with API access available through Google. It ranks #89 overall in intelligence benchmarks and provides a balance of capability and affordability for production applications.
What's new
- 1M token context window supports larger input documents and conversations
- 65.5K token maximum output enables extended response generation
- Pricing at $0.25/$1.50 per 1M tokens positions it below pro-tier models
- Optimized for high-volume, cost-sensitive production deployments
Best for
High-volume API integrations requiring cost efficiencyText-based chat and conversational applicationsBatch processing with extended context requirementsProduction workloads prioritizing speed and affordability over maximum intelligence
Source: https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite-preview