Skip to content

Commit a9c9490

Browse files
authored
Update troubleshoot-container-storage.md
Add TSG for node pools with taints
1 parent e1e2d03 commit a9c9490

File tree

1 file changed

+39
-1
lines changed

1 file changed

+39
-1
lines changed

articles/storage/container-storage/troubleshoot-container-storage.md

Lines changed: 39 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,9 +50,47 @@ To add the `acstor` namespace to the exclusion list, follow these steps:
5050
1. Create a policy that you suspect is blocking the installation of Azure Container Storage.
5151
1. Attempt to install Azure Container Storage in the AKS cluster.
5252
1. Check the logs for the gatekeeper-controller pod to confirm any policy violations.
53-
1. Add the `acstor` namespace to the exclusion list of the policy.
53+
1. Add the `acstor` namespace and `azure-extensions-usage-system` namespace to the exclusion list of the policy.
5454
1. Attempt to install Azure Container Storage in the AKS cluster again.
5555

56+
### Can't install and enable Azure Container Storage in node pools with taints
57+
58+
You may have configured [node taints](https://learn.microsoft.com/en-us/azure/aks/use-node-taints) on the node pools to retrict pods from being scheduled on these node pools. When you try to install and enable Azure Container Storage on these noode pools, it will be blocked because the required pods can't be created in these node pools. This applies to both the system node pool when installing and the user node pools when enabling.
59+
60+
You can check the node taints with the following example:
61+
62+
```bash
63+
$ az aks nodepool list -g $resourceGroup --cluster-name $clusterName --query "[].{PoolName:name, nodeTaints:nodeTaints}"
64+
[
65+
...
66+
{
67+
"PoolName": "nodepoolx",
68+
"nodeTaints": [
69+
"sku=gpu:NoSchedule"
70+
]
71+
}
72+
]
73+
74+
```
75+
76+
You can remove these taints temporarily to unblock and configure them back after you install and enable successfully. You can go to Azure Portal > AKS cluster > Node pools, click your node pool, remove the taints in "Taints and labels
77+
" section. Or you can use the following command to remove taints and confirm the change.
78+
79+
```bash
80+
$ az aks nodepool update -g $resourceGroup --cluster-name $clusterName --name $nodePoolName --node-taints ""
81+
$ az aks nodepool list -g $resourceGroup --cluster-name $clusterName --query "[].{PoolName:name, nodeTaints:nodeTaints}"
82+
[
83+
...
84+
{
85+
"PoolName": "nodepoolx",
86+
"nodeTaints": null
87+
}
88+
]
89+
90+
```
91+
92+
Retry the installing or enabling after you remove node taints successfully. After it's commpleted successfully, you can configure these node taints back to resume the pod scheduling restaints.
93+
5694
### Can't set storage pool type to NVMe
5795

5896
If you try to install Azure Container Storage with Ephemeral Disk, specifically with local NVMe on a cluster where the virtual machine (VM) SKU doesn't have NVMe drives, you get the following error message: *Cannot set --storage-pool-option as NVMe as none of the node pools can support ephemeral NVMe disk*.

0 commit comments

Comments
 (0)