Skip to content

Commit f023695

Browse files
author
gitName
committed
[APIM] Rate limits in multi-region deployment
1 parent 0d62afc commit f023695

File tree

3 files changed

+9
-4
lines changed

3 files changed

+9
-4
lines changed

articles/api-management/rate-limit-by-key-policy.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: dlepow
66

77
ms.service: azure-api-management
88
ms.topic: reference
9-
ms.date: 07/23/2024
9+
ms.date: 03/31/2025
1010
ms.author: danlep
1111
---
1212

@@ -63,7 +63,9 @@ To understand the difference between rate limits and quotas, [see Rate limits an
6363

6464
* [!INCLUDE [api-management-rate-limit-key-scope](../../includes/api-management-rate-limit-key-scope.md)]
6565
* [!INCLUDE [api-management-self-hosted-gateway-rate-limit](../../includes/api-management-self-hosted-gateway-rate-limit.md)] [Learn more](how-to-self-hosted-gateway-on-kubernetes-in-production.md#request-throttling)
66-
* When `increment-condition` or `increment-count` are defined using expressions, evaluation and increment of rate limit counter are postponed to end of outbound pipeline to allow for policy expressions based on the reponse. Limit exceeded condition is not evaluated at the same time in this case and will be evaluated on next incoming call. This leads to cases where `429 Too Many Requests` status code is returned 1 call later than usual.
66+
* When `increment-condition` or `increment-count` are defined using expressions, evaluation and increment of the rate limit counter are postponed to the end of outbound pipeline to allow for policy expressions based on the response. Limit exceeded condition is not evaluated at the same time in this case and will be evaluated on next incoming call. This leads to cases where `429 Too Many Requests` status code is returned 1 call later than usual.
67+
* In a [multi-region](api-management-howto-deploy-multi-region.md) deployment, this policy counts calls separately at each regional gateway in the deployment. The policy doesn't aggregate all call data for the instance.
68+
6769

6870

6971
## Example

articles/api-management/rate-limit-policy.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: dlepow
66

77
ms.service: azure-api-management
88
ms.topic: reference
9-
ms.date: 07/23/2024
9+
ms.date: 03/31/2025
1010
ms.author: danlep
1111
---
1212

@@ -87,6 +87,7 @@ To understand the difference between rate limits and quotas, [see Rate limits an
8787
* This policy can be used only once per policy definition.
8888
* This policy is only applied when an API is accessed using a subscription key.
8989
* [!INCLUDE [api-management-self-hosted-gateway-rate-limit](../../includes/api-management-self-hosted-gateway-rate-limit.md)] [Learn more](how-to-self-hosted-gateway-on-kubernetes-in-production.md#request-throttling)
90+
* In a [multi-region](api-management-howto-deploy-multi-region.md) deployment, this policy counts calls separately at each regional gateway in the deployment. The policy doesn't aggregate all call data for the instance.
9091

9192

9293
## Example

includes/api-management-multi-region-concepts.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
author: dlepow
33
ms.service: azure-api-management
44
ms.topic: include
5-
ms.date: 02/12/2024
5+
ms.date: 03/31/2025
66
ms.author: danlep
77
---
88

@@ -20,4 +20,6 @@ ms.author: danlep
2020
* If a region goes offline, API requests are automatically routed around the failed region to the next closest gateway.
2121

2222
* If the primary region goes offline, the API Management management plane and developer portal become unavailable, but secondary regions continue to serve API requests using the most recent gateway configuration.
23+
24+
* If configured, the [rate-limit](/azure/api-management/rate-limit-policy) and [rate-limit-by-key](/azure/api-management/rate-limit-by-key-policy) policies count calls separately at each regional gateway in the deployment. The policies don't aggregate all call data for the instance.
2325

0 commit comments

Comments
 (0)