o3-mini is the recommended small reasoning model in the o-series, offering improved performance, faster responses, and a range of reasoning modes.
Specifications
Context200,000
Max Output100,000
Inputtext
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
API Paths
/v1/chat/completions
/v1/responses
Pricing
Input$1.10× 1.1/ MTokens
Output$4.40× 1.1/ MTokens
Cached Input$0.55× 1.1/ MTokens
Batch Input$0.55× 1.1/ MTokens
Batch Output$2.20× 1.1/ MTokens