Skip to content

Commit 630f71f

Browse files
committed
Incorp of changes
1 parent 6116d66 commit 630f71f

File tree

3 files changed

+46
-14
lines changed

3 files changed

+46
-14
lines changed

AKS-Arc/TOC.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -194,7 +194,7 @@
194194
- name: BGP with FRR not working
195195
href: connectivity-troubleshoot.md
196196
- name: Cluster status stuck during upgrade
197-
href: tsg-upgrade-issues.md
197+
href: cluster-upgrade-status.md
198198
- name: Reference
199199
items:
200200
- name: Azure CLI

AKS-Arc/aks-troubleshoot.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Troubleshoot common issues in AKS enabled by Azure Arc
33
description: Learn about common issues and workarounds in AKS enabled by Arc.
44
ms.topic: how-to
55
author: sethmanheim
6-
ms.date: 06/18/2025
6+
ms.date: 06/27/2025
77
ms.author: sethm
88
ms.lastreviewed: 04/01/2025
99
ms.reviewer: abha
@@ -28,6 +28,7 @@ The following sections describe known issues for AKS enabled by Azure Arc:
2828
| AKS steady state | [AKS Arc telemetry pod consumes too much memory and CPU](telemetry-pod-resources.md) | Active |
2929
| AKS steady state | [Disk space exhaustion on control plane VMs due to accumulation of kube-apiserver audit logs](kube-apiserver-log-overflow.md) | Active |
3030
| AKS cluster delete | [Deleted AKS Arc cluster still visible on Azure portal](deleted-cluster-visible.md) | Active |
31+
| AKS cluster upgrade | [AKS Arc cluster stuck in "Upgrading" state](cluster-upgrade-status.md) | Fixed in 2505 release |
3132
| AKS cluster delete | [Can't fully delete AKS Arc cluster with PodDisruptionBudget (PDB) resources](delete-cluster-pdb.md) | Fixed in 2503 release |
3233
| Azure portal | [Can't see VM SKUs on Azure portal](check-vm-sku.md) | Fixed in 2411 release |
3334
| MetalLB Arc extension | [Connectivity issues with MetalLB](load-balancer-issues.md) | Fixed in 2411 release |

AKS-Arc/tsg-upgrade-issues.md renamed to AKS-Arc/cluster-upgrade-status.md

Lines changed: 43 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
---
2-
title: Troubleshoot the issue where the cluster is stuck in Upgrading state
2+
title: Troubleshoot issue in which the cluster is stuck in Upgrading state
33
description: Learn how to troubleshoot and mitigate the issue when an AKS enabled by Arc cluster is stuck in 'Upgrading' state.
44
ms.topic: troubleshooting
55
author: rcheeran
66
ms.author: rcheeran
7-
ms.date: 06/26/2025
7+
ms.date: 06/27/2025
88
ms.reviewer: abha
99

1010
---
@@ -34,14 +34,14 @@ Upgrading the AKSArc cluster. This operation might take a while...
3434
"type": "CustomLocation"
3535
},
3636
"id": "/subscriptions/fbaf508b-cb61-4383-9cda-a42bfa0c7bc9/resourceGroups/Bellevue/providers/Microsoft.Kubernetes/ConnectedClusters/Bel-cluster/providers/Microsoft.HybridContainerService/ProvisionedClusterInstances/default",
37-
"name": "default",
38-
"properties": {
39-
"kubernetesVersion": "1.30.4",
40-
"provisioningState": "Succeeded",
41-
"currentState": "Upgrading",
42-
"errorMessage": null,
43-
"operationStatus": null
44-
"agentPoolProfiles": [
37+
"name": "default",
38+
"properties": {
39+
"kubernetesVersion": "1.30.4",
40+
"provisioningState": "Succeeded",
41+
"currentState": "Upgrading",
42+
"errorMessage": null,
43+
"operationStatus": null
44+
"agentPoolProfiles": [
4545
{
4646
...
4747
```
@@ -58,9 +58,28 @@ Upgrading the AKSArc cluster. This operation might take a while...
5858
az k8s-extension show -g $res.HybridaksExtension.resourceGroup -c $res.ResourceBridge.name --cluster-type appliances --name hybridaksextension
5959
```
6060

61+
```output
62+
{
63+
"aksAssignedIdentity": null,
64+
"autoUpgradeMinorVersion": false,
65+
"configurationProtectedSettings": {},
66+
"currentVersion": "2.1.211",
67+
"customLocationSettings": null,
68+
"errorInfo": null,
69+
"extensionType": "microsoft.hybridaksoperator",
70+
...
71+
}
72+
```
73+
6174
## Mitigation
6275

63-
You can resolve this issue by running the AKS Arc `update` command. The `update` command restarts the upgrade flow. You can run the `aksarc update` command with placeholder parameters, which do not impact the state of the cluster. So in this case, you can run the `update` command to enable NFS or SMB drivers if those features aren't already enabled. First, check if any of the storage drivers are already enabled:
76+
This issue was fixed in AKS on [Azure Local, version 2505](/azure/azure-local/whats-new?view=azloc-2505&preserve-view=true#features-and-improvements-in-2505). Upgrade your Azure Local deployment to the 2505 build. After you update, [verify](#verification) that the Kubernetes version was upgraded and the `currentState` property of the cluster shows as **Succeeded**.
77+
78+
### Workaround for Azure Linux versions 2503 or 2504
79+
80+
This issue only affects clusters in Azure Local version 2503 or 2504, and on AKS Arc extension versions 2.1.211 or 2.1.223. The mitigation described here is applicable only when you are unable to upgrade to 2505.
81+
82+
You can resolve the issue by running the AKS Arc `update` command. The `update` command restarts the upgrade flow. You can run the `aksarc update` command with placeholder parameters, which do not impact the state of the cluster. So in this case, you can run the `update` command to enable NFS or SMB drivers if those features aren't already enabled. First, check if any of the storage drivers are already enabled:
6483

6584
```azurecli
6685
az login --use-device-code --tenant <Azure tenant ID>
@@ -100,12 +119,24 @@ If both drivers are already enabled on your cluster, you can disable the one tha
100119

101120
## Verification
102121

103-
To confirm the K8s version upgrade is complete, run the following command and check that the `currentState` property in the JSON output is set to `Succeeded`.
122+
To confirm the K8s version upgrade is complete, run the following command and check that the `currentState` property in the JSON output is set to **Succeeded**.
104123

105124
```azurecli
106125
az aksarc show -g <resource_group> -n <cluster_name>
107126
```
108127

128+
```output
129+
...
130+
...
131+
"provisioningState": "Succeeded",
132+
"status": {
133+
"currentState": "Succeeded",
134+
"errorMessage": null,
135+
"operationStatus": null
136+
"controlPlaneStatus": { ...
137+
...
138+
```
139+
109140
## Contact Microsoft Support
110141

111142
If the problem persists, collect the [AKS cluster logs](get-on-demand-logs.md) before you [create a support request](aks-troubleshoot.md#open-a-support-request).

0 commit comments

Comments
 (0)