Service Quota
Estimate your current quota usage
Last updated
Estimate your current quota usage
Last updated
Service Quota in Float16.cloud refers to the limits on creating, using, or accessing our services for each account. These limits vary depending on the specific service.
We provide a service quota for our LLM as a service, specifically for API key creation:
Maximum API keys per account: 20
You can monitor overall usage on the Service Quota page
Manage your API keys within each workspace's services section
To learn more about managing your API Key, please refer to our detailed guide: Learn More About API Keys
We implement a service quota for our One-Click Deploy service based on GPU card usage:
Maximum GPU card usage: 4 cards per account
This limit applies regardless of GPU card type
Track your overall usage on the Service Quota page
If you need to create a new instance but lack sufficient quota. You have to terminate an existing instance to reclaim the quota, quota is released immediately upon instance termination. Then proceed with creating your new instance.
For detailed instructions on creating and terminating instances, please refer to our comprehensive guide: Learn More About One Click Deploy
If you need additional quota, you can contact us to request an increase.
To check your current quota usage:
Go to the Settings section
Navigate to the Service Quota page
View your usage across all workspaces in your account
This page provides an overview of your quota utilization, helping you manage your resources effectively across your entire account.
You can also check your current quota usage and remaining quota directly from the service's credential menu.