Skip to content
Merged
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 5 additions & 3 deletions docs/api-inference/rate-limits.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,10 @@ The Inference API has rate limits based on the number of requests. These rate li

Serverless API is not meant to be used for heavy production applications. If you need higher rate limits, consider [Inference Endpoints](https://huggingface.co/docs/inference-endpoints) to have dedicated resources.

Only registered users can use the Inference API.


| User Tier | Rate Limit |
|---------------------|---------------------------|
| Unregistered Users | 1 request per hour |
| Signed-up Users | 50 requests per hour |
| PRO and Enterprise Users | 500 requests per hour |
| Signed-up Users | 1,000 requests per day |
| PRO and Enterprise Users | 20,000 requests per day |
Loading