Skip to content

Commit 70cfd64

Browse files
authored
Merge pull request #8729 from mosbahmajed/workitem-86528
AB#5509: Update cannot-scale-cluster-autoscaler-enabled-node-pool.md
2 parents eebe9d3 + a41215e commit 70cfd64

File tree

1 file changed

+47
-17
lines changed

1 file changed

+47
-17
lines changed

support/azure/azure-kubernetes/create-upgrade-delete/cannot-scale-cluster-autoscaler-enabled-node-pool.md

Lines changed: 47 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -3,14 +3,14 @@ title: Cluster autoscaler fails to scale with cannot scale cluster autoscaler en
33
description: Learn how to troubleshoot the cannot scale cluster autoscaler enabled node pool error when your autoscaler isn't scaling up or down.
44
author: sgeannina
55
ms.author: ninasegares
6-
ms.date: 10/17/2023
7-
ms.reviewer: aritraghosh, chiragpa
6+
ms.date: 04/17/2025
7+
ms.reviewer: aritraghosh, chiragpa.momajed
88
ms.service: azure-kubernetes-service
99
ms.custom: sap:Create, Upgrade, Scale and Delete operations (cluster or nodepool)
1010
---
1111
# Cluster autoscaler fails to scale with "cannot scale cluster autoscaler enabled node pool" error
1212

13-
This article discusses how to resolve the "cannot scale cluster autoscaler enabled node pool" error that appears when scaling a cluster with an autoscaler enabled node pool.
13+
This article discusses how to resolve the "cannot scale cluster autoscaler enabled node pool" error that occurs when you scale a cluster that has an autoscaler-enabled node pool.
1414

1515
## Symptoms
1616

@@ -22,45 +22,75 @@ You receive an error message that resembles the following message:
2222
2323
## Troubleshooting checklist
2424

25-
Azure Kubernetes Service (AKS) uses virtual machine scale sets-based agent pools, which contain cluster nodes and [cluster autoscaling capabilities](/azure/aks/cluster-autoscaler) if enabled.
25+
Azure Kubernetes Service (AKS) uses Azure Virtual Machine Scale Sets-based agent pools. These pools contain cluster nodes and [cluster autoscaling capabilities](/azure/aks/cluster-autoscaler), if they're enabled.
2626

2727
### Check that the cluster virtual machine scale set exists
2828

29-
1. Sign in to [Azure portal](https://portal.azure.com).
30-
1. Find the node resource group by searching the following names:
31-
32-
- The default name `MC_{AksResourceGroupName}_{YourAksClusterName}_{AksResourceLocation}`.
33-
- The custom name (if it was provided at creation).
29+
1. Sign in to the [Azure portal](https://portal.azure.com).
30+
1. Find the node resource group by searching for the following names:
3431

32+
- The default name `MC_{AksResourceGroupName}_{YourAksClusterName}_{AksResourceLocation}`
33+
- The custom name (if it was provided at creation)
34+
>
3535
> [!NOTE]
36-
> When you create a new cluster, AKS automatically creates a second resource group to store the AKS resources. For more information, see [Why are two resource groups created with AKS?](/azure/aks/faq#why-are-two-resource-groups-created-with-aks)
36+
> When you create a cluster, AKS automatically creates a second resource group to store the AKS resources. For more information, see [Why are two resource groups created with AKS?](/azure/aks/faq#why-are-two-resource-groups-created-with-aks)
3737
38-
1. Check the list of resources and make sure that there's a virtual machine scale set.
38+
1. Check the list of resources to make sure that a virtual machine scale set exists.
3939

4040
## Cause 1: The cluster virtual machine scale set was deleted
4141

42-
Deleting the virtual machine scale set attached to the cluster causes the cluster autoscaler to fail. It also causes issues when provisioning resources such as nodes and pods.
42+
If you delete the virtual machine scale set that's attached to the cluster, this action causes the cluster autoscaler to fail. It also causes issues when you provision resources such as nodes and pods.
4343

4444
> [!NOTE]
45-
> Modifying any resource under the node resource group in the AKS cluster is an unsupported action and will cause cluster operation failures. You can prevent changes from being made to the node resource group by [blocking users from modifying resources](/azure/aks/cluster-configuration#fully-managed-resource-group-preview) managed by the AKS cluster.
45+
> Modifying any resource under the node resource group in the AKS cluster is an unsupported action and causes cluster operation failures. You can prevent changes from being made to the node resource group by [blocking users from modifying resources](/azure/aks/cluster-configuration#fully-managed-resource-group-preview) that are managed by the AKS cluster.
46+
47+
### Reconcile node pool
48+
49+
If the cluster virtual machine scale set is accidentally deleted, you can reconcile the node pool by using `az aks nodepool update`:
50+
51+
```bash
52+
# Update Node Pool Configuration
53+
az aks nodepool update --resource-group <resource-group-name> --cluster-name <cluster-name> --name <nodepool-name> --tags <tags> --node-taints <taints> --labels <labels>
54+
55+
# Verify the Update
56+
az aks nodepool show --resource-group <resource-group-name> --cluster-name <cluster-name> --name <nodepool-name>
57+
```
58+
Monitor the node pool to make sure that it's functioning as expected and that all nodes are operational.
4659

4760
## Cause 2: Tags or any other properties were modified from the node resource group
4861

49-
You may receive scaling errors if you modify or delete Azure-created tags and other resource properties in the node resource group. For more information, see [Can I modify tags and other properties of the AKS resources in the node resource group?](/azure/aks/faq#can-i-modify-tags-and-other-properties-of-the-aks-resources-in-the-node-resource-group)
62+
You may experience scaling errors if you modify or delete Azure-created tags and other resource properties in the node resource group. For more information, see [Can I modify tags and other properties of the AKS resources in the node resource group?](/azure/aks/faq#can-i-modify-tags-and-other-properties-of-the-aks-resources-in-the-node-resource-group)
63+
64+
### Reconcile node resource group tags
65+
66+
Use the Azure CLI to make sure that the node resource group has the correct tags for AKS name and the AKS group name:
67+
68+
```bash
69+
# Add or update tags for AKS name and AKS group name
70+
az group update --name <node-resource-group-name> --set tags.AKS-Managed-Cluster-Name=<aks-managed-cluster-name> tags.AKS-Managed-Cluster-RG=<aks-managed-cluster-rg>
71+
72+
# Verify the tags
73+
az group show --name <node-resource-group-name> --query "tags"
74+
```
75+
Monitor the resource group to make sure that the tags are correctly applied and that the resource group is functioning as expected.
5076

5177
## Cause 3: The cluster node resource group was deleted
5278

53-
Deleting the cluster node resource group causes issues when provisioning the infrastructure resources required by the cluster, which causes the cluster autoscaler to fail.
79+
Deleting the cluster node resource group causes issues when you provision the infrastructure resources that are required by the cluster. This action causes the cluster autoscaler to fail.
5480

5581
## Solution: Update the cluster to the goal state without changing the configuration
5682

57-
To resolve this issue, you can run the following command to recover the deleted virtual machine scale set or any tags (missing or modified):
83+
To resolve this issue, run the following command to recover the deleted virtual machine scale set or any tags (missing or modified).
5884

5985
> [!NOTE]
60-
> It might take a few minutes until the operation completes.
86+
> It might take a few minutes until the operation finishes.
6187
6288
```azurecli
6389
az aks update --resource-group <resource-group-name> --name <aks-cluster-name>
6490
```
6591

92+
### Additional troubleshooting tips
93+
94+
- Check the Azure Activity Log for any recent changes or deletions.
95+
6696
[!INCLUDE [Azure Help Support](../../../includes/azure-help-support.md)]

0 commit comments

Comments
 (0)