o4-mini-2025-04-16 is the recommended small reasoning model in the o-series, offering improved performance, faster responses, and a range of reasoning modes.
Specifications
Context200,000
Max Output100,000
Inputtext, image
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
API Paths
/v1/chat/completions
/v1/responses
Pricing
Input$1.10× 1.1/ MTokens
Output$4.40× 1.1/ MTokens
Cached Input$0.28× 1.1/ MTokens