Quick Start
One Click Deploy quick start
Check all instances
When you first access the One-Click Deploy service, you'll be presented with a table displaying all your instances, both active and inactive.
Every account is allocated a quota of 4 GPU cards, with no restrictions on the type of GPU. To check your current quota, simply click on the "Quota" button or navigate to the service quota settings.
Learn more about service quota here
Add new instance
to start new instance:
Paste Hugging Face model repository and token (if required), then click "Next"
Review model name and input instance name.
Configure instance, select region and GPU type.
Review pricing and instance summary.
Click "Start Deploy", wait for "Start instance successfully" notification
Redirected to the instance's deployment section.
Quickly test API
After successful deployment, test your model using:
Instance Chat Playground
After successfully deploying your model, you can easily test its performance using the Instance Chat Playground. This convenient testing tool is readily accessible from your instance overview, allowing you to immediately interact with your deployed model.
Default settings: temperature 0.5, max tokens 512
Customize parameters, system prompt, and text message via GUI
cURL
Or use following command:
Find copyable API formats (including OpenAI and LangChain) in the API tab.
Last updated