diff --git a/docs/api-inference/rate-limits.md b/docs/api-inference/rate-limits.md index 147909a35..f3f819d9b 100644 --- a/docs/api-inference/rate-limits.md +++ b/docs/api-inference/rate-limits.md @@ -4,8 +4,10 @@ The Inference API has rate limits based on the number of requests. These rate li Serverless API is not meant to be used for heavy production applications. If you need higher rate limits, consider [Inference Endpoints](https://huggingface.co/docs/inference-endpoints) to have dedicated resources. +You need to be authenticated (passing a token or through your browser) to use the Inference API. + + | User Tier | Rate Limit | |---------------------|---------------------------| -| Unregistered Users | 1 request per hour | -| Signed-up Users | 50 requests per hour | -| PRO and Enterprise Users | 500 requests per hour | \ No newline at end of file +| Signed-up Users | 1,000 requests per day | +| PRO and Enterprise Users | 20,000 requests per day | \ No newline at end of file