The CentML Platform operates on a credit-based billing system, where 1 CentML credit equals 1 USD. You can buy credits through the Platform by going to Account -> Wallet.

Serverless endpoint

Serverless endpoint usage is billed according to the total number of tokens generated and processed.

ModelCredits per million tokens
meta-llama/Llama-3.1-405B-Instruct-FP82.50

Dedicated deployments

Dedicated deployments are charged based on the type and duration of hardware used, following a per-minute billing system.

GPUCredits per GPU per hour
Nvidia L4 24GB0.30
Nvidia A10G 24GB0.30
Nvidia A100 40GB1.10
Nvidia H100 80GB2.50