Service Quota

Estimate your current quota usage

What is Service Quota?

Service Quota in Float16.cloud refers to the limits on creating, using, or accessing our services for each account. These limits vary depending on the specific service.

Your service quota

LLM as a service

We provide a service quota for our LLM as a service, specifically for API key creation:

  • Maximum API keys per account: 20

  • You can monitor overall usage on the Service Quota page

  • Manage your API keys within each workspace's services section

To learn more about managing your API Key, please refer to our detailed guide: Learn More About API Keys

One Click Deploy

We implement a service quota for our One-Click Deploy service based on GPU card usage:

  • Maximum GPU card usage: 4 cards per account

  • This limit applies regardless of GPU card type

  • Track your overall usage on the Service Quota page

If you need to create a new instance but lack sufficient quota. You have to terminate an existing instance to reclaim the quota, quota is released immediately upon instance termination. Then proceed with creating your new instance.

For detailed instructions on creating and terminating instances, please refer to our comprehensive guide: Learn More About One Click Deploy

Requesting More Quota

If you need additional quota, you can contact us to request an increase.

Monitoring Your Quota

To check your current quota usage:

  1. Go to the Settings section

  2. Navigate to the Service Quota page

  3. View your usage across all workspaces in your account

This page provides an overview of your quota utilization, helping you manage your resources effectively across your entire account.

You can also check your current quota usage and remaining quota directly from the service's credential menu.

Last updated