Skip to content

Gemini 3.1 Flash Lite

gemini-3.1-flash-lite
Google · chat · api · seen 11m ago
GA Alert me on changes
Context
1M
Max output
65.5K
Input $/1M
$0.25
Output $/1M
$1.50
Modalities
text
Released
03 Mar 2026
AI summary
● machine-written

Google releases Gemini 3.1 Flash Lite for cost-efficient API access

Gemini 3.1 Flash Lite is a text-based chat model from Google available via API with a 1M token context window. It is positioned as a cost-effective option within Google's Gemini lineup, priced at $0.25 per million input tokens and $1.50 per million output tokens.

What's new
  • 1M context window supports extended document processing
  • Max output of 65.5K tokens per response
  • Pricing includes cache-miss input tokens at $0.25/M and output at $1.50/M
Best for
Cost-sensitive production deploymentsHigh-throughput text generation tasksApplications requiring extended context handling
Sources

Source: https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite