# Service Quota

{% hint style="danger" %}
Service Quota is under maintenance
{% endhint %}

## What is Service Quota?

Service Quota in Float16.cloud refers to the limits on creating, using, or accessing our services for each account. These limits vary depending on the specific service.

## Your service quota

### LLM as a service

<figure><img src="https://3153350778-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F06IkMLrrm1ptUT8449HF%2Fuploads%2F5Q1fFSJ8H7tEie5ONDJp%2Fimage.png?alt=media&#x26;token=1ef7eb4c-5f0c-45ef-b69f-fbd9f60fae97" alt=""><figcaption><p>Quota remaining</p></figcaption></figure>

We provide a service quota for our LLM as a service, specifically for API key creation:

* Maximum API keys per account: 20
* You can monitor overall usage on the Service Quota page
* Manage your API keys within each workspace's services section

To learn more about managing your API Key, please refer to our detailed guide: [Learn More About API Keys](https://docs.float16.cloud/getting-started/llm-as-a-service/quick-start/set-the-credentials)

### One Click Deploy

We implement a service quota for our One-Click Deploy service based on GPU card usage:

* Maximum GPU card usage: 4 cards per account
* This limit applies regardless of GPU card type
* Track your overall usage on the Service Quota page

If you need to create a new instance but lack sufficient quota. You have to terminate an existing instance to reclaim the quota, quota is released immediately upon instance termination. Then proceed with creating your new instance.

For detailed instructions on creating and terminating instances, please refer to our comprehensive guide: [Learn More About One Click Deploy](https://docs.float16.cloud/getting-started/one-click-deploy)

{% hint style="info" %}

#### Requesting More Quota

If you need additional quota, you can [contact us](https://email:support@float16.cloud) to request an increase.
{% endhint %}

## Monitoring Your Quota

<figure><img src="https://3153350778-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F06IkMLrrm1ptUT8449HF%2Fuploads%2FpWzpUVysg4Are3dIK0AP%2Fimage.png?alt=media&#x26;token=7fb6f1af-b450-4f82-8fd9-fc2d470a224b" alt=""><figcaption><p>Service Quota Setting</p></figcaption></figure>

To check your current quota usage:

1. Go to the Settings section
2. Navigate to the Service Quota page
3. View your usage across all workspaces in your account

This page provides an overview of your quota utilization, helping you manage your resources effectively across your entire account.

You can also check your current quota usage and remaining quota directly from the service's credential menu.<br>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.float16.cloud/getting-started/account/service-quota.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
