Navigate to General Inference
From the Home page, go to General Inference to set up the deployment.
Fill in Deployment Details
Provide the necessary details, including:
Docker image name
Copy
Ask AI
username/base-api # replace with your image
Tag (e.g., v1.0)
Copy
Ask AI
v1.0
Port number
Copy
Ask AI
8000
Health check path
Copy
Ask AI
/docs
Entry point arguments
Copy
Ask AI
python main.py
Your HF_TOKEN (Hugging Face API token) for authentication
Example:
Select Resource Size
Choose the resource size based on your image generation speed requirements:
Small
Medium
Large
Click Deploy
After deployment, you will receive an endpoint URL.Interacting with the EndpointOnce deployed, you can interact with the endpoint via:
curl commands
The included apps and examples
Use the provided URL to send requests and generate images with FLUX.This version adds clarity and a more polished flow. Let me know if you’d like to include more details or examples for interacting with the endpoint!