gemini-3.1-flash-lite-preview

Common Name: Gemini 3.1 Flash-Lite Preview

Google
-20%On SaleReleased on Mar 3 12:00 AMKnowledge Cutoff Jan 1, 2025 12:00 AMSupportedTool InvocationSupportedReasoning
CompareTry in Chat

Google's most cost-efficient Gemini 3 series model, optimized for high-volume agentic tasks, translation, and simple data processing with 2.5X faster time to first token than 2.5 Flash.

Specifications

Context
1000K
Maximum Output
64K
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Standard
Batch
Input/MTokens
$0.20
$0.10
Input Image/MTokens
$0.20
$0.10
Output/MTokens
$1.20
$0.60
Thinking Output/MTokens
$1.20
$0.60

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

$0.24/$2.00/M-20%
ctx1.0Mmax66Kavailtps

Google's most efficient workhorse model designed for speed and low-cost. Improved across key benchmarks for reasoning, multimodality, code and long context while being 20-30% more efficient.

$0.40/$2.40/M-20%
ctx1.0Mmax64Kavailtps

Preview of Google's next-generation Gemini 3 Flash model, optimized for speed with frontier intelligence combined with superior search and grounding capabilities.

$0.40/$2.40/M-20%
ctx1.0Mmax64Kavailtps
InOutCap

Gemini 3.1 Flash Image generation model designed for speed and efficiency, effective for quick interactive image responses and high throughput.

$0.08/$0.32/M-20%
ctx1.0Mmax66Kavailtps
InOutCap

A lightweight version of Gemini 2.5 Flash optimized for speed and cost efficiency with 1M token context support.