The CentML Platform operates on a credit-based billing system, where 1 CentML credit equals 1 USD. You can buy credits through the Platform by going to Account -> Wallet.

Serverless endpoint

Serverless endpoint usage is billed according to the total number of tokens generated and processed.

Model SizeCredits per 1M Tokens
Small (1-4B)$0.04
Medium (7-11B)$0.08
Large (70-90B)$0.50
X-Large (405B)$2.50

Dedicated deployments

Dedicated deployments are charged based on the type and duration of hardware used, following a per-minute billing system.

GPUCredits per GPU per hour
Nvidia L4 24GB0.30
Nvidia A10G 24GB0.30
Nvidia A100 40GB1.10
Nvidia H100 80GB2.50