Resources
Pricing
The CentML Platform operates on a credit-based billing system, where 1 CentML credit equals 1 USD. You can buy credits through the Platform by going to Account -> Wallet.
Serverless endpoint
Serverless endpoint usage is billed according to the total number of tokens generated and processed.
Model Size | Credits per 1M Tokens |
---|---|
DeepSeek-R1 MoE (671B) | $3.99 |
Llama-3.3 (70B) | $0.50 |
Qwen2.5 (32B)* | $0.80 |
Qwen2.5-VL (7B)* | $0.15 |
Llama-3.1 (8B)* | $0.10 |
Llama-3.2 (3B)* | $0.06 |
Phi-3.5* | $0.12 |
*Coming Soon…
Dedicated deployments
Dedicated deployments are charged based on the type and duration of hardware used, following a per-minute billing system.
GPU | Credits per GPU per hour |
---|---|
Nvidia L4 24GB | 0.30 |
Nvidia A10G 24GB | 0.30 |
Nvidia A100 40GB | 1.10 |
Nvidia H100 80GB | 2.50 |