Instance Detail
Your instance detail
Last updated
Your instance detail
Last updated
When you initiate deployment, the instance detail page becomes available. This page is divided into five sections, each providing crucial information about your deployed instance.
The Overview section contains essential instance information:
Model details (batch size, max input length, number of tokens)
Instance configuration (cloud provider, region, GPU type)
Endpoint and API key (visible after successful deployment)
This section displays real-time usage and cost information:
Each row represents usage and cost for a specific pricing period
New rows are added when pricing changes
Example: If L4 GPU costs 1.00/hour
in September and increases to 1.20/hour
in October, you'll see separate rows for September and October usage.
This section displays the deployment status with the following possible states:
Initial: Checking model and limitations
Allocate: Allocating resources
Running: Deploy successful, ready to use
Terminated: Instance shut down (will still show 3 checked statuses)
If deployment fails, system will automatically terminate the instance
You can regenerate the API key for security purposes.
A playground is also provided for quick model testing.
For a comprehensive view of all instance costs, visit the .
The Activity section shows monthly instance activity.
Currently, the Setting section offers the option to terminate the instance.