Skip to content
Merged
Changes from 3 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 5 additions & 3 deletions docs/api-inference/rate-limits.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,10 @@ The Inference API has rate limits based on the number of requests. These rate li

Serverless API is not meant to be used for heavy production applications. If you need higher rate limits, consider [Inference Endpoints](https://huggingface.co/docs/inference-endpoints) to have dedicated resources.

Only registered users can use the Inference API.


| User Tier | Rate Limit |
|---------------------|---------------------------|
| Unregistered Users | 1 request per hour |
| Signed-up Users | 50 requests per hour |
| PRO and Enterprise Users | 500 requests per hour |
| Signed-up Users | 1.000 requests per day |
| PRO and Enterprise Users | 20.000 requests per day |
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This might be confusing to US people (they can read it as 1 and 20 requests per day with too many decimals). Maybe use "1 thousand" or "1K" instead?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, either 1k/20k or 1,000/20,000 otherwise

Loading