add rate limits to capabilities guide

MathieuBsqt · MathieuBsqt · commit 93c2eb0b5304 · 2025-04-22T12:03:12.000+02:00
diff --git a/pages/public_cloud/ai_machine_learning/endpoints_guide_02_capabilities/guide.en-gb.md b/pages/public_cloud/ai_machine_learning/endpoints_guide_02_capabilities/guide.en-gb.md
@@ -28,7 +28,7 @@ This page provides the technical features, capabilities and limitations of [AI E
 | Large Selection of Models                        | AI Endpoints offers a diverse range of pre-trained AI models, covering categories such as Assistant (LLMs, Code Assistants), Audio Analysis, Embeddings, Natural Language Processing, Translation, Image Generation, and Computer Vision. For a full list of models, please visit the [AI Endpoints Catalog Page](https://endpoints.ai.cloud.ovh.net/catalog).                                                                          |
 | Model Metrics | Users can access various metrics in the [OVHcloud Control Panel](/links/manager), such as the number of calls made per model, input and output tokens for large language models (LLMs), and other usage data. These insights can help you manage costs and gain a better understanding of how your applications are using AI capabilities.                                                                                              |
 | Data Privacy and Sovereignty | OVHcloud prioritizes data privacy and sovereignty, ensuring that AI models accessed via AI Endpoints are fully compliant with strict European regulations. Our infrastructure, located in Gravelines, France, adheres to European data protection regulations. Data is not stored or shared during or after model use, providing users with peace of mind that their data is secure and protected.                                                                          |
-| Access with Personalized Tokens | To ensure secure and authenticated access to model APIs, users need to provide a token for each request. Access tokens can easily be created through the [AI Endpoints](https://endpoints.ai.cloud.ovh.net) page, providing the flexibility to manage multiple tokens for various projects or teams. Additionally, each token comes with adjustable validity periods, allowing users to tailor their access to specific needs. 
+| Access with Personalized Access Keys | To ensure secure and authenticated access to model APIs, users need to provide an API access key in each request. Access keys can be easily created by following the instructions in the [AI Endpoints - Getting Started](/pages/public_cloud/ai_machine_learning/endpoints_guide_01_getting_started) guide. API keys are linked to a Public Cloud project. We provide the flexibility to manage multiple keys for various projects or teams. Additionally, each access key comes with adjustable validity periods, allowing users to tailor their access to specific needs.
 
 ### Planned features
 
@@ -44,17 +44,28 @@ This page provides the technical features, capabilities and limitations of [AI E
 
 AI Endpoints is designed to be compatible with the OpenAI API, making it easy to integrate with existing applications and workflows. This compatibility means that you can take advantage of AI capabilities without having to make major changes to your existing technology stack, allowing you to get up and running quickly and easily.
 
-### Flexibile Usage
+### Flexibile usage
 
 AI Endpoints' APIs are language-agnostic. It enables developers to use any programming language or technology of their choice when working with our APIs, providing them with the freedom to build and integrate AI capabilities according to their requirements and preferences.
 
-## Limitations for the beta phase
+## Limitations
 
-### No Token Limit
+### Model rate limit
 
-As of now, the AI Endpoints platform does not impose a token limit for API requests.
+When using AI Endpoints, the **following rate limits apply**:
 
-In the future, we plan to introduce a token limit feature. This feature will allow you to set a limit on the number of tokens used for each API request, providing better control and management over token consumption.
+- **Anonymous**: 2 requests per minute, per by IP and per model.
+- **Authenticated with an API access key**: 400 requests per minute, per PCI project and per model.
+
+If you exceed this limit, a **429 error code** will be returned.
+
+If you require higher usage, please **[get in touch with us](https://help.ovhcloud.com/csm?id=csm_get_help)** to discuss increasing your rate limits.
+
+### No usage limit
+
+As of now, the AI Endpoints platform does not impose any usage limits for API requests, apart from the rate limiting.
+
+However, we are considering introducing a usage limit feature in the future. This feature will allow you to set a limit on the number of tokens, characters, seconds of audio consumed, depending on your usage, providing better control and management over AI Endpoints consumption.
 
 ## Going further