a speedy and economical reasoning model that excels at agentic coding.
Specifications
Context256,000
Inputtext
Outputtext, json
Performance (7-day Average)
Uptime
TPS
RURT
API Paths
/v1/chat/completions
Pricing
Input$0.20× 1.1/ MTokens
Output$1.50× 1.1/ MTokens
Cached Input$0.02× 1.1/ MTokens