Gemini 3.1 Flash Lite Preview

Name: Gemini 3.1 Flash Lite Preview
Price: 0.25 USD
Author: Google

gemini-3.1-flash-lite-preview

Google · chat · api · seen 13m ago

Preview Alert me on changes

Context

Max output

65.5K

Input $/1M

$0.25

Output $/1M

$1.50

Modalities

text

Released

03 Mar 2026

AI summary

● machine-written

Google releases Gemini 3.1 Flash Lite Preview with 1M context window

Gemini 3.1 Flash Lite Preview is Google's cost-efficient model designed for high-volume use cases, offering frontier-class performance at reduced cost. The model supports a 1M token context window and 65.5K maximum output, with API access available through Google. It ranks #89 overall in intelligence benchmarks and provides a balance of capability and affordability for production applications.

What's new

1M token context window supports larger input documents and conversations
65.5K token maximum output enables extended response generation
Pricing at $0.25/$1.50 per 1M tokens positions it below pro-tier models
Optimized for high-volume, cost-sensitive production deployments

Best for

High-volume API integrations requiring cost efficiencyText-based chat and conversational applicationsBatch processing with extended context requirementsProduction workloads prioritizing speed and affordability over maximum intelligence

Sources

Source: https://ai.google.dev/gemini-api/docs/models/gemini-3.1-flash-lite-preview