You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/postgresql/flexible-server/concepts-scaling-resources.md
+9-3Lines changed: 9 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ ms.author: varundhawan
6
6
ms.service: postgresql
7
7
ms.subservice: flexible-server
8
8
ms.topic: conceptual
9
-
ms.date: 12/12/2023
9
+
ms.date: 1/4/2024
10
10
---
11
11
12
12
# Scaling Resources in Azure Database for PostgreSQL - Flexible Server
@@ -34,18 +34,24 @@ Near-zero downtime scaling is a feature designed to minimize downtime when modif
34
34
35
35
### How it works
36
36
37
-
When updating your Flexible server in scaling scenarios, we create a new copy of your server (VM) with the updated configuration, synchronize it with your current one, briefly switch to the new copy with a 30-second interruption, and retire the old server, all at no extra cost to you. This process allows for seamless updates while minimizing downtime and ensuring cost-efficiency. This scaling process is triggered when changes are made to the storage and compute tiers, and the experience remains consistent for both, HA and non-HA servers. This feature is enabled in all Azure regions* and there's **no customer action required** to use this capability.
37
+
When updating your Flexible server in scaling scenarios, we create a new copy of your server (VM) with the updated configuration, synchronize it with your current one, briefly switch to the new copy with a 30-second interruption, and retire the old server, all at no extra cost to you. This process allows for seamless updates while minimizing downtime and ensuring cost-efficiency. This scaling process is triggered when changes are made to the storage and compute tiers, and the experience remains consistent for both, HA and non-HA servers. This feature is enabled in all Azure regions^ and there's **no customer action required** to use this capability.
38
38
39
39
> [!NOTE]
40
40
> Near-zero downtime scaling process is the _default_ operation. However, in cases where the following limitations are encountered, the system switches to regular scaling, which involves more downtime compared to the near-zero downtime scaling.
41
41
42
+
### Precise Downtime Expectations
43
+
44
+
***Downtime Duration**: In most cases, the downtime ranges from 10 to 30 seconds.
45
+
***Additional Considerations**: After a scaling event, there's an inherent DNS `Time-To-Live` (TTL) period of approximately 30 seconds. This period isn't directly controlled by the scaling process but is a standard part of DNS behavior. So, from a application perspective, the total downtime experienced during scaling could be in the range of **40 to 60 seconds**.
46
+
42
47
#### Limitations
43
48
44
49
- In order for near-zero downtime scaling to work, you should enable all [inbound/outbound connections between the IPs in the delegated subnet when using VNET integrated networking](../flexible-server/concepts-networking-private.md#virtual-network-concepts). If these aren't enabled near zero downtime scaling process will not work and scaling will occur through the standard scaling workflow.
45
50
- Near-zero downtime scaling won't work if there are regional capacity constraints or quota limits on customer subscriptions.
46
51
- Near-zero downtime scaling doesn't work for replica server, as it is only supported on the primary server. For replica server it will automatically go through regular scaling process.
47
52
- Near-zero downtime scaling won't work if a [virtual network injected server with delegated subnet](../flexible-server/concepts-networking-private.md#virtual-network-concepts) doesn't have sufficient usable IP addresses. If you have a standalone server, one extra IP address is necessary, and for a HA-enabled server, two extra IP addresses are required.
48
-
-*For HA enabled servers, near-zero downtime scaling is currently enabled for a limited set of regions. We will be enabling this to more regions in a phased manner based upon the regional capacity.
53
+
- Replication Slots - Be aware that logical replication slots aren't preserved during near-zero downtime failover event. To maintain logical replication slots and ensure data consistency after a scale operation, it is recommended to use the [pg_failover_slot](https://github.com/EnterpriseDB/pg_failover_slots) extension. For more details, refer [Enabling extension in Flexible Server](../flexible-server/concepts-extensions.md#pg_failover_slots-preview).
54
+
- ^ For HA enabled servers, near-zero downtime scaling is currently enabled for a limited set of regions. We will be enabling this to more regions in a phased manner based upon the regional capacity.
0 commit comments