.mint-mt-1 {
    margin-top: 0.25rem
}
.mint-mt-4 {
    margin-top: 1rem
}
.mint-mt-5 {
    margin-top: 1.25rem
}
.mint-flex {
    display: flex
}
.mint-w-full {
    width: 100%
}
.mint-flex-1 {
    flex: 1 1 0%
}
.mint-cursor-pointer {
    cursor: pointer
}
.mint-space-x-1 > :not([hidden]) ~ :not([hidden]) {
    --tw-space-x-reverse: 0;
    margin-right: calc(0.25rem * var(--tw-space-x-reverse));
    margin-left: calc(0.25rem * calc(1 - var(--tw-space-x-reverse)))
}
.mint-rounded-xl {
    border-radius: 0.75rem
}
.mint-bg-white {
    --tw-bg-opacity: 1;
    background-color: rgb(255 255 255 / var(--tw-bg-opacity))
}
.mint-px-4 {
    padding-left: 1rem;
    padding-right: 1rem
}
.mint-py-1 {
    padding-top: 0.25rem;
    padding-bottom: 0.25rem
}
.mint-pb-8 {
    padding-bottom: 2rem
}
.mint-text-3xl {
    font-size: 1.875rem;
    line-height: 2.25rem
}
.mint-text-sm {
    font-size: 0.875rem;
    line-height: 1.25rem
}
.mint-font-bold {
    font-weight: 700
}
.mint-font-medium {
    font-weight: 500
}
.mint-font-semibold {
    font-weight: 600
}
.mint-leading-6 {
    line-height: 1.5rem
}
.\!mint-text-black {
    --tw-text-opacity: 1 !important;
    color: rgb(0 0 0 / var(--tw-text-opacity)) !important
}
.mint-text-gray-600 {
    --tw-text-opacity: 1;
    color: rgb(75 85 99 / var(--tw-text-opacity))
}
.mint-text-gray-900 {
    --tw-text-opacity: 1;
    color: rgb(17 24 39 / var(--tw-text-opacity))
}
.mint-text-white {
    --tw-text-opacity: 1;
    color: rgb(255 255 255 / var(--tw-text-opacity))
}
.hover\:mint-opacity-\[0\.9\]:hover {
    opacity: 0.9
}
.dark\:mint-text-gray-400:is(.dark *) {
    --tw-text-opacity: 1;
    color: rgb(156 163 175 / var(--tw-text-opacity))
}
.dark\:mint-text-white:is(.dark *) {
    --tw-text-opacity: 1;
    color: rgb(255 255 255 / var(--tw-text-opacity))
}

How to get started?

Introduction

CentML Platform

Quickstart

Plan, optimize, and deploy LLMs effortlessly

LLM Serving

General Inference

Launch an on-demand GPU instance and start building

Compute Instance

Turnkey deployment for Retrieval-Augmented Generation

RAG Application

Client Setup

Python SDK Reference

How to build your own containerized inference engine and deploy

Deploying custom models

How to create private inference endpoints

Private Inference Endpoints

How to build AI agents using structured outputs such as JSON and function/tool calls with CentML

Agent Support

Pricing

Build your own solutions using CentML Platform and start with an example from Codex

Deployments

Clients

Resources

Examples

CentML Platform

Serverless endpoints

Turnkey GenAI deployments

Deploy any model

Deploy anywhere

How to get started?

A Quickstart Guide

Deployments

Clients

Resources

Examples

Serverless endpoints

Turnkey GenAI deployments

Deploy any model

Deploy anywhere

​How to get started?

A Quickstart Guide

How to get started?