gemini-2.5-flash

Common Name: Gemini 2.5 Flash

Google
-20%On SaleReleased on Jun 18, 2025 12:00 AMKnowledge Cutoff Apr 1, 2025 12:00 AMSupportedTool Invocation
CompareTry in Chat

Google's most efficient workhorse model designed for speed and low-cost. Improved across key benchmarks for reasoning, multimodality, code and long context while being 20-30% more efficient.

Specifications

Context
1048.6K
Maximum Output
65.5K
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Input$0.24/MTokens
Input Audio$0.80/MTokens
Output$2.00/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

$0.40/$2.40/M-20%
ctx1.0Mmax64Kavailtps

Preview of Google's next-generation Gemini 3 Flash model, optimized for speed with frontier intelligence combined with superior search and grounding capabilities.

$0.20/$1.20/M-20%
ctx1.0Mmax64Kavailtps
InOutCap

Google's most cost-efficient Gemini 3 series model, optimized for high-volume agentic tasks, translation, and simple data processing with 2.5X faster time to first token than 2.5 Flash.

$0.40/$2.40/M-20%
ctx1.0Mmax64Kavailtps
InOutCap

Gemini 3.1 Flash Image generation model designed for speed and efficiency, effective for quick interactive image responses and high throughput.

$0.08/$0.32/M-20%
ctx1.0Mmax66Kavailtps
InOutCap

A lightweight version of Gemini 2.5 Flash optimized for speed and cost efficiency with 1M token context support.