Skip to content

Commit 60cf256

Browse files
committed
More changes
1 parent 13008ef commit 60cf256

File tree

1 file changed

+33
-29
lines changed

1 file changed

+33
-29
lines changed

azure-stack/hci/manage/troubleshoot-deployment.md

Lines changed: 33 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -10,11 +10,11 @@ ms.date: 08/21/2024
1010

1111
# Troubleshoot Azure portal deployment validation issues for Azure Stack HCI, version 23H2
1212

13-
[!INCLUDE [hci-applies-to-23h2](../../includes/hci-applies-to-23h2.md)]
13+
> Applies to: Azure Stack HCI, version 23H2 running 2405 or later
1414
1515
This article provides guidance on how to troubleshoot deployment validation issues experienced during the deployment of your Azure Stack HCI cluster via the Azure portal.
1616

17-
## Error - deployment vaildation failure
17+
## Error - deployment validation failure
1818

1919
When deploying Azure Stack HCI, version 23H2 via the Azure portal, you might encounter a deployment validation failure.
2020
The "Azure Stack HCI Network - Check network requirements" validation task fail with the following error:
@@ -30,17 +30,17 @@ Example: 192.168.0.1 or 192.168.0.1","Target":null,"Details":null}].
3030
If you go to the **Networking** tab in Azure portal deployment, within the **Network Intent** configuration, you could see the following error:
3131

3232
```
33-
The selected physical network adapter is not binded to the management virtual Switch.
33+
The selected physical network adapter is not binded to the management virtual switch.
3434
```
3535

3636
## Cause
3737

38-
This issue occurs on deployments triggered after August 6th. The issue only happens if the deployment validation was triggered on the cluster and the validation result was a failure, with subsequent validation retries.
38+
This issue occurs on deployments triggered after August 6. The issue only happens if the deployment validation was triggered on the cluster and the validation result was a failure, with subsequent validation retries.
3939

4040
The issue occurs for the following reason:
4141

4242
- Validation on the device creates a VM switch for network related tests and is deleted at the end of tests.
43-
- `DeviceManagementExtension` extension is not detecting the deletion of the VM switch.
43+
- `DeviceManagementExtension` extension isn't detecting the deletion of the VM switch.
4444

4545
## Recommended resolution
4646

@@ -53,25 +53,25 @@ The multi-step resolution process includes the following steps:
5353
- [Redo the Azure portal](#redo-the-azure-portal)
5454
- [Recreate the lock on the seed node resource](#recreate-the-lock-on-the-seed-node-resource)
5555

56-
### Remove the lock from the seed node
57-
5856
> [!NOTE]
59-
> All the steps in this section need to be performed on the seed node.
57+
> All the steps in this article need to be performed on the seed node.
58+
59+
### Remove the lock from the seed node
6060

61-
Follow this multi-step process to mitigate this validation error:
61+
Follow these steps to remove the lock from the seed node:
6262

6363
1. To remove the lock, in the Azure portal, go to the object via the resource group or within Machines - Azure Arc.
6464
1. In the left-pane, go to **Settings > Locks**. You should see a lock named **DoNotDelete**. This is the automatic resource lock that is created when the node is onboarded.
6565
1. Select **Delete** against the lock.
6666

67-
If you attmept the steps in the next section without removing the lock, the **Delete** command fails with the following error:
67+
If you attempt the steps in the next section without removing the lock, the **Delete** command fails with the following error:
6868

6969
```
7070
Some resources failed to be deleted (run with `--verbose` for more information):
7171
/subscriptions/<subid>/resourceGroups/<rgname>/providers/Microsoft.HybridCompute/machines/<machinename>/providers/Microsoft.AzureStackHCI/edgeDevices/default
7272
```
7373

74-
Here is the example output when run with the `--verbose` switch:
74+
Here's the example output when run with the `--verbose` switch:
7575

7676
```Output
7777
(ScopeLocked) The scope '/subscriptions/<subid>/resourceGroups/<rgname>/providers/Microsoft.HybridCompute/machines/<machinename>/providers/Microsoft.AzureStackHCI/edgeDevices/default' cannot perform delete operation because following scope(s) are locked: '/subscriptions/<subid>/resourceGroups/<rgname>/providers/Microsoft.HybridCompute/machines/<machinename>'. Please remove the lock and try again.
@@ -85,7 +85,7 @@ With the lock removed, follow these steps to remove the validation error.
8585

8686
1. Connect to the seed node. Run the following PowerShell command:
8787

88-
```powershell
88+
```PowerShell
8989
Get-VMSwitch
9090
```
9191
@@ -97,7 +97,7 @@ With the lock removed, follow these steps to remove the validation error.
9797
Remove-VMSwitch -Name "<VM Switch Name>" -Force
9898
```
9999
100-
Make sure to use the VM switch name from the `Get-VMSwitch` command. If you didn't intentionally create a VM switch, the `Get-VMSwitch` command will have no results. The failure occurs because the Network Validation Step cleaned up the VM switch, but the `DeviceManagementExtension` didn't detect the cleanup.
100+
Make sure to use the VM switch name from the `Get-VMSwitch` command. If you didn't intentionally create a VM switch, the `Get-VMSwitch` command has no results. The failure occurs because the Network Validation Step cleaned up the VM switch, but the `DeviceManagementExtension` didn't detect the cleanup.
101101
102102
1. Confirm the device cleanup and wait for the cleanup to complete.
103103
@@ -107,7 +107,7 @@ After the VM switch on the device is removed, clean up the Edge Device ARM resou
107107
108108
1. On a client that can access to Azure, verify install or install AZ CLI: [Install Azure CLI on Windows](https://learn.microsoft.com/en-us/cli/azure/install-azure-cli-windows?tabs=azure-cli)
109109
- You can verify install by running: `az`
110-
- If installed, this will output a `"Welcome to Azure CLI!"` message with available commands.
110+
- If installed, this outputs a `"Welcome to Azure CLI!"` message with available commands.
111111
112112
1. Sign in to Azure with az cli. Run the following command:
113113
@@ -119,48 +119,52 @@ After the VM switch on the device is removed, clean up the Edge Device ARM resou
119119
120120
1. To set a specific subscription, run the following command:
121121
122-
```azurecli
122+
```AzureCLI
123123
az account set --subscription "<subGUID>"
124124
```
125125
126126
Replace the value in the above example command with the appropriate value for `<subGUID>`.
127-
1. Output the data stored within the `edgeDevices` resource that has the incorrectly stored VM Switch information with the following command:
127+
1. Output the data stored within the `edgeDevices` resource that has the incorrectly stored VM Switch information. Run the following command:
128128
129-
```azurecli
129+
```AzureCLI
130130
az resource show --ids "/subscriptions/<subGUID>/resourceGroups/<resourceGROUPNAME>/providers/Microsoft.HybridCompute/machines/<machineNAME>/providers/Microsoft.AzureStackHCI/edgeDevices/default"
131131
```
132132
133133
Replace the values in the above example command with the appropriate values for:`<subGUID>`, `<resourceGROUPNAME>`, and `<machineNAME>`.
134134
135-
Here is an example output:
135+
Here's an example output:
136136
137137
```output
138138
az resource show --ids "/subscriptions/<Subscription ID>/resourceGroups/<Resource Group Name>/providers/Microsoft.HybridCompute/machines/ASRR1N26R15U33/providers/Microsoft.AzureStackHCI/edgeDevices/default"
139139
```
140140
141-
The output of this command will show quite a bit of detail about the <machineNAME> used in the command. Near the bottom of the output, you will see a section for `"switchDetails"`, which will more than likely show the following (which is the Validation VM Switch that was created and cleaned up on the device, but was not detected by the DeviceManagementExtension and updated cloud-side):
141+
The output of this command shows quite a bit of detail about the <machineNAME> used in the command. Near the bottom of the output, there is a section for `"switchDetails"`, which will more than likely show the following (which is the Validation VM Switch that was created and cleaned up on the device, but wasn't detected by the DeviceManagementExtension and updated cloud-side):
142142
`"switchName": "ConvergedSwitch(managementcompute)",`
143143
`"switchType": "External"`
144144
145145
1. After confirming the `show` command worked by outputting the `edgeDevices` data, and likely confirming the `"switchDetails"`, it is time to `delete` the resource from ARM so it can be refreshed appropriately from the device.
146146
147-
1. Delete the `edgeDevices` resource which has the incorrectly stored VM Switch information with the following command:
148-
`az resource delete --ids "/subscriptions/<subGUID>/resourceGroups/<resourceGROUPNAME>/providers/Microsoft.HybridCompute/machines/<machineNAME>/providers/Microsoft.AzureStackHCI/edgeDevices/default"`
147+
1. Delete the `edgeDevices` resource, which has the incorrectly stored VM switch information. Run the following command:
148+
149+
```AzureCLI
150+
az resource delete --ids "/subscriptions/<subGUID>/resourceGroups/<resourceGROUPNAME>/providers/Microsoft.HybridCompute/machines/<machineNAME>/providers/Microsoft.AzureStackHCI/edgeDevices/default"
151+
```
149152
150-
Replace the values (remember to remove the "<>" characters as well) in the above example command with the appropriate values for:
153+
Replace the values (remember to remove the \<\> characters as well) with the appropriate values for:
151154
`<subGUID>`
152155
`<resourceGROUPNAME>`
153156
`<machineNAME>`
154157
155-
This is the same resource `--ids` from the `show`, so you can just leverage that same string. In fact, you could just "up arrow" in the console and replace `show` with `delete`.
158+
This is the same resource `--ids` from the `show`, so you can just use that same string. In fact, you could just "up arrow" in the console and replace `show` with `delete`.
156159
157-
Here is an example output:
160+
Here's an example output:
158161
159162
```Output
160163
`az resource delete --ids "/subscriptions/d41eb627-825d-4419-a14d-c6ad485f4110/resourceGroups/EDGECI-REGISTRATION-rr1n26r1512-kXOKQuGV/providers/Microsoft.HybridCompute/machines/ASRR1N26R15U33/providers/Microsoft.AzureStackHCI/edgeDevices/default"
161164
```
162-
- **Note:** Run just like this, there will be no output from this command, it will either work and return the command prompt, or present an error. It should not present an error, if it does, that will require additional troubleshooting.
163-
1. Verify the deletion of the resource by running the `show` command again. Here is an example output:
165+
When run, there is no output from this command. The command works and returns the command prompt, or presents an error. It shouldn't present an error, if it does, that will require more troubleshooting.
166+
167+
1. Verify the deletion of the resource by running the `show` command again. Here's an example output:
164168
165169
```Output
166170
(ResourceNotFound) The resource 'Microsoft.HybridCompute/machines/<machineNAME>/providers/Microsoft.AzureStackHCI/edgeDevices/default' could not be found.
@@ -176,7 +180,7 @@ Follow these steps to refresh the cloud data:
176180
177181
1. Restart the `DeviceManagementService` on the seed node. Run the following PowerShell command:
178182
179-
```powershell
183+
```PowerShell
180184
Restart-Service DeviceManagementService
181185
```
182186
@@ -187,7 +191,7 @@ Follow these steps to refresh the cloud data:
187191
188192
### Redo the Azure portal
189193
190-
With device and cloud data now back in sync, you can go to the Azure portal and provide the deployment inputs. The previous step prevents any cached information from previous attempts.
194+
With device and cloud data now back in sync, you can go to the Azure portal and provide the deployment inputs. The previous step prevents any cached information from previous attempts.
191195
192196
Follow these steps in the Azure portal:
193197
@@ -199,7 +203,7 @@ Follow these steps in the Azure portal:
199203
200204
1. Confirm the information on the subsequent pages. You should see the following changes:
201205
- On the **Networking** page, you should no longer see the `The selected physical network adapter is not binded to the management virtual Switch` error that might have been seen previously.
202-
- On the **Validation** page at the end, if you are past the original issue, the `deploymentdata.physicalnodes[0].ipv4address is not a valid IPv4 address` error won't be displayed.
206+
- On the **Validation** page at the end, if you're past the original issue, the `deploymentdata.physicalnodes[0].ipv4address is not a valid IPv4 address` error won't be displayed.
203207
204208
1. If no other validation issues occur, start the deployment.
205209

0 commit comments

Comments
 (0)