GPT-4o (“o” for “omni”) is our versatile, high-intelligence flagship model. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). Learn how to use GPT-4o in our text generation guide.
Specifications
Context128,000
Max Output16,384
Inputtext, image
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
API Paths
/v1/chat/completions
/v1/responses
Pricing
Input$2.50× 1.1/ MTokens
Output$10.00× 1.1/ MTokens
Cached Input$1.25× 1.1/ MTokens
Batch Input$1.25× 1.1/ MTokens
Batch Output$5.00× 1.1/ MTokens