ModelPriceLab
Changes/Gemini 3.1 Flash-Lite launched in preview ($0.25/$1.50 per MTok)
MajorGoogle AINew Launchgemini-3.1-flash-liteVerified

Gemini 3.1 Flash-Lite launched in preview ($0.25/$1.50 per MTok)

Google introduced Gemini 3.1 Flash-Lite in preview for low-latency, high-volume workloads. The official announcement positions it for summarization, translation, and classification, with pricing set at $0.25 per 1M input tokens and $1.50 per 1M output tokens in Google AI Studio and Vertex AI.

Mar 3, 2026Effective: Mar 3, 2026
At a glance
Decide impact, cost movement, and the next step before reading the full diff.
Affected
Google AI · gemini-3.1-flash-lite
New Launch
Cost impact
High · 4/5
Estimated from pricing and window changes
Migration pressure
Low · 0/5
No explicit deadline yet
Next step
Benchmark Gemini 3.1 Flash-Lite against your current low-cost model on s...
Do this first, then review field-level details

Impact summary

Use explicit levels instead of abstract charts to judge this change.

Cost
High
Score 4/580%
Quality
High
Score 4/580%
Migration
Low
Score 0/50%
Reliability
Low
Score 1/520%
Compliance
Low
Score 0/50%
What changed
  • New preview model: gemini-3.1-flash-lite
  • Available in Google AI Studio and Vertex AI
  • Pricing: $0.25 / MTok input, $1.50 / MTok output
  • Positioned for low-latency, high-volume tasks such as summarization, translation, and classification
Recommended actions
  • Benchmark latency and quality against your current low-cost routing target.
  • Because this is preview, gate production rollout behind evaluation and fallback rules.

Changes

Review changed fields first, then decide whether you need full before/after values.

5 fields changed
Pro
modelstatusavailabilityinput_per_1moutput_per_1m
Upgrade to Pro for full field-level diffs, before/after values, and migration guidance.
Upgrade

Recommended Actions

Only actionable items stay prominent. Incomplete actions fall back to a compact list.

Do this first
1
Validate

Benchmark Gemini 3.1 Flash-Lite against your current low-cost model on summarization, translation, classification, and latency-sensitive workloads.

2
Monitor

If you route production traffic to this model, keep a fallback target because the launch is preview-only.

Sources

Each source focuses on what it confirms, without repeating the whole article.

blog
Gemini 3.1 Flash-Lite

Google introduced Gemini 3.1 Flash-Lite in preview for low-latency, high-volume tasks, priced at $0.25 input and $1.50 output per 1M tokens.

Open

Comments

Loading comments...
Gemini 3.1 Flash-Lite launched in preview ($0.25/$1.50 per MTok) — ModelPriceLab