fireworks/models/deepseek-v3-0324

Display Name:Deepseek V3 03-24
Fireworks
Fireworks
text-onlyReleased on Oct 16 12:00 AM

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token from Deepseek. Updated checkpoint.

Specifications

Context160,000
Inputtext
Outputtext, json

Performance (7-day Average)

Uptime
TPS
RURT

Pricing

Input$0.90×1.1/MTokens
Output$0.90×1.1/MTokens

Similar Models