Fastest, most cost-effective GPT 4.1 model.
Specifications
Context1,047,576
Max Output32,768
Inputtext, image
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
API Paths
/v1/chat/completions
/v1/responses
Pricing
Input$0.10× 1.1/ MTokens
Output$0.40× 1.1/ MTokens
Cached Input$0.03× 1.1/ MTokens