Skip to content

Commit 33ca9a4

Browse files
committed
Edits + Acrolinx.
1 parent 7b466a0 commit 33ca9a4

File tree

1 file changed

+23
-23
lines changed

1 file changed

+23
-23
lines changed

articles/databox-online/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning.md

Lines changed: 23 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ ms.author: alkohli
1616

1717
This article describes how to troubleshoot common errors when deploying virtual machines on an Azure Stack Edge Pro GPU device. The article provides guidance for investigating the most common issues that cause VM provisioning timeouts and issues during network interface and VM creation.
1818

19-
To diagnose any VM provisioning failure, you'll review guest logs for the failed virtual machine. For steps to collect VM guest logs and include them in a Support package, see [Collect guest logs for VMs on Azure Stack Edge Pro](azure-stack-edge-gpu-collect-virtual-machine-guest-logs.md).
19+
To diagnose any VM provisioning failure, review guest logs for the failed virtual machine. For steps to collect VM guest logs and include them in a Support package, see [Collect guest logs for VMs on Azure Stack Edge Pro](azure-stack-edge-gpu-collect-virtual-machine-guest-logs.md).
2020

2121
For guidance on issues that prevent successful upload of a VM image before your VM deployment, see [Troubleshoot virtual machine image uploads in Azure Stack Edge Pro GPU](azure-stack-edge-gpu-troubleshoot-virtual-machine-image-upload.md).
2222

@@ -34,14 +34,14 @@ The following issues are the top causes of VM provisioning timeouts:
3434
- The VM image that you used to deploy the VM wasn't prepared correctly. [Learn more](#vm-image-not-prepared-correctly)
3535
- The default gateway and DNS server couldn't be reached from the guest VM. [Learn more](#gateway-dns-server-couldnt-be-reached-from-guest-vm)
3636
- During a `cloud init` installation, `cloud init` either didn't run or there were issues while it was running. (Linux VMs only) [Learn more](#cloud-init-issues-linux-vms)
37-
- For a Linux VM deployed using a custom VM image, the Provisioning flags in the /etc/waagent.conf file are not correct. (Linux VMs only) [Learn more](#provisioning-flags-set-incorrectly-linux-vms)
37+
- For a Linux VM deployed using a custom VM image, the Provisioning flags in the /etc/waagent.conf file aren't correct. (Linux VMs only) [Learn more](#provisioning-flags-set-incorrectly-linux-vms)
3838
- Primary network interface attached to a SRIOV enabled virtual switch [Learn more](#primary-network-interface-attached-to-a-sriov-enabled-virtual-switch)
3939

4040
### IP assigned to the VM is already in use
4141

4242
**Error description:** The VM was assigned a static IP address that is already in use, and VM provisioning failed. This error happens when the IP address is in use in the subnet on which the VM is deployed. When you deploy a VM via the Azure portal, the process checks for an existing IP address within your device but can't check IP addresses of other services or virtual machines that might also be on your subnet.
4343

44-
**Suggested solution:** Use a static IP address that is not in use, or use a dynamic IP address provided by the DHCP server.
44+
**Suggested solution:** Use a static IP address that isn't in use, or use a dynamic IP address provided by the DHCP server.
4545

4646
To check for a duplicate IP address:
4747

@@ -71,7 +71,7 @@ For an overview of requirements, see [Create custom VM images for an Azure Stack
7171

7272
### Gateway, DNS server couldn't be reached from guest VM
7373

74-
**Error description:** If the default gateway and DNS server can't be reached during VM deployment, VM provisioning will time out, and the VM deployment will fail.
74+
**Error description:** If the default gateway and DNS server can't be reached during VM deployment, VM provisioning times out and the VM deployment fails.
7575

7676
**Suggested solution:** Verify that the default gateway and DNS server can be reached from the VM. Then repeat VM deployment.
7777

@@ -91,7 +91,7 @@ To verify that the default gateway and DNS server can be reached from the VM, do
9191

9292
### `cloud init` issues (Linux VMs)
9393

94-
**Error description:** `cloud init` did not run, or there were issues while `cloud init` was running. `cloud-init` is used to customize a Linux VM when the VM boots for the first time. For more information, see [cloud-init support for virtual machines in Azure](../virtual-machines/linux/using-cloud-init.md).
94+
**Error description:** `cloud init` didn't run, or there were issues while `cloud init` was running. `cloud-init` is used to customize a Linux VM when the VM boots for the first time. For more information, see [cloud-init support for virtual machines in Azure](../virtual-machines/linux/using-cloud-init.md).
9595

9696
**Suggested solutions:** To find issues that occurred when `cloud init` was run:
9797
1. [Connect to the VM](azure-stack-edge-gpu-deploy-virtual-machine-portal.md#connect-to-a-vm).
@@ -107,7 +107,7 @@ To check for some of the most common issues that prevent `cloud init` from runni
107107

108108
`cloud-init --version`
109109

110-
The command should return the cloud init version number. If the image is not `cloud init`-based, the command won't return version information.
110+
The command should return the cloud init version number. If the image isn't `cloud init`-based, the command won't return version information.
111111

112112
To get help with `cloud init` options, run the following command:
113113

@@ -119,7 +119,7 @@ To check for some of the most common issues that prevent `cloud init` from runni
119119

120120
![Illustration of a cloud-init log entry for a VM image with the Data Source set to Azure. The identifying text is highlighted.](./media/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning/cloud-init-log-entry-01.png)
121121

122-
If the data source is not set to Azure, you may need to revise your `cloud init` script. For more information, see [Diving deeper into cloud-init](../virtual-machines/linux/cloud-init-deep-dive.md).
122+
If the data source isn't set to Azure, you may need to revise your `cloud init` script. For more information, see [Diving deeper into cloud-init](../virtual-machines/linux/cloud-init-deep-dive.md).
123123

124124

125125
### Provisioning flags set incorrectly (Linux VMs)
@@ -135,15 +135,15 @@ To check for some of the most common issues that prevent `cloud init` from runni
135135

136136
### Primary network interface attached to a SRIOV enabled virtual switch
137137

138-
**Error description:** The primary network interface attached to a single root I/O virtualization (SRIOV) interface-enabled virtual switch caused network traffic to bypass the hyper-v, so the host could not receive DHCP requests from the VM, resulting in a provisioning timeout.
138+
**Error description:** The primary network interface attached to a single root I/O virtualization (SRIOV) interface-enabled virtual switch caused network traffic to bypass the Hyper-V, so the host couldn't receive DHCP requests from the VM, resulting in a provisioning timeout.
139139

140140
**Suggested solutions:**
141141

142142
- Connect the VM primary network interface to a virtual switch without enabling accelerated networking.
143143

144-
- On an Azure Stack Edge Pro 1 device, virtual switches created on Port 1 to Port 4 do not enable accelerated networking. On Port 5 or Port 6, virtual switches will enable accelerated networking by default.
144+
- On an Azure Stack Edge Pro 1 device, virtual switches created on Port 1 to Port 4 don't enable accelerated networking. On Port 5 or Port 6, virtual switches enable accelerated networking by default.
145145

146-
- On an Azure Stack Edge Pro 2 device, virtual switches created on Port 1 or Port 2 do not enable accelerated networking. On Port 3 or Port 4, virtual switches will enable accelerated networking by default.
146+
- On an Azure Stack Edge Pro 2 device, virtual switches created on Port 1 or Port 2 don't enable accelerated networking. On Port 3 or Port 4, virtual switches enable accelerated networking by default.
147147

148148
## Network interface creation issues
149149

@@ -158,7 +158,7 @@ To verify whether the network interface was created successfully, do these steps
158158

159159
1. In the Azure portal, go to the Azure Stack Edge resource for your device (go to **Edge Services** > **Virtual machines**). Then select **Deployments**, and navigate to the VM deployment.
160160

161-
1. If a network interface was not created successfully, you'll see the following error.
161+
1. If a network interface wasn't created successfully, you see the following error.
162162

163163
![Screenshot of the error displayed in the Azure portal when network interface creation fails during VM deployment on an Azure Stack Edge device.](./media/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning/nic-creation-failed-01.png)
164164

@@ -175,7 +175,7 @@ This section covers common issues that occur during VM creation.
175175

176176
**Suggested solution:** Use the following steps to delete the existing Marketplace image, and then create a new Marketplace image from Azure portal.
177177

178-
1. From Azure portal, delete the existin Marketplace image.
178+
1. From Azure portal, delete the existing Marketplace image.
179179

180180
1. List the ingestion and the BlobDownload ingestion job for the Marketplace image. Use these steps to [Connect to ARM](azure-stack-edge-gpu-connect-resource-manager.md?tabs=Az). Run the following script to list ingestion jobs:
181181

@@ -207,21 +207,21 @@ This section covers common issues that occur during VM creation.
207207
208208
![Screenshot of example syntax to find ingestion job name.](./media/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning/ingestion-job-name.png)
209209
210-
1. If the ingestion job is found in Step 1, use the following steps to delete the ingestion job.
210+
1. If the ingestion job is found in Step 1, use the following steps to delete the ingestion job and delete the image. For example, the ingestion job name in the example above is `ubuntu-18-04`. Additionally, `Subscription ID` and `Resource group` name can be found in the example.
211211
212-
1. Run the following commands to delete image. For example, the ingestion job name in the example above is `ubuntu-18-04`. Additionally, `Subscription ID` and `Resource group` name can be found in the example.
213-
214-
```powershell
215-
$uri2 = "https://management.<appliance name>.<DNS domain>/subscriptions/sid/resourceGroups/rgname/providers/Microsoft.AzureBridge/locations/dbelocal/ingestionJobs/<ingestion job name>?api-version=2018-06-01"
212+
```powershell
213+
$uri2 = "https://management.<appliance name>.<DNS domain>/subscriptions/sid/resourceGroups/rgname/providers/Microsoft.AzureBridge/locations/dbelocal/ingestionJobs/<ingestion job name>?api-version=2018-06-01"
214+
```
216215

217-
Invoke-RestMethod -Method DELETE -Uri $uri2 -Headers $headers
218-
```
216+
```powershell
217+
Invoke-RestMethod -Method DELETE -Uri $uri2 -Headers $headers
218+
```
219219

220-
1. Create a new Marketplace image from Azure portal.
220+
1. Follow steps to [Create a new VM image from Azure Marketplace](azure-stack-edge-create-a-vm-from-azure-marketplace.md).
221221

222222
### Not enough memory to create the VM
223223

224-
**Error description:** When VM creation fails because of insufficient memory, you'll see the following error.
224+
**Error description:** When VM creation fails because of insufficient memory, you see the following error.
225225

226226
![Screenshot of the error displayed in the Azure portal when VM creation fails on an Azure Stack Edge device.](./media/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning/vm-creation-failed-01.png)
227227

@@ -243,12 +243,12 @@ The memory available for the deployment of a VM is constrained by several factor
243243

244244
### Insufficient number of GPUs to create GPU VM
245245

246-
If you try to deploy a VM on a GPU device that already has Kubernetes enabled, no GPUs will be available, and VM provisioning will fail with the following error:
246+
If you try to deploy a VM on a GPU device that already has Kubernetes enabled, no GPUs are available, and VM provisioning fails with the following error:
247247

248248
![Screenshot of the error displayed in the Azure portal when creation of a GPU VM fails because of no available GPUs on an Azure Stack Edge device.](./media/azure-stack-edge-gpu-troubleshoot-virtual-machine-provisioning/gpu-vm-creation-failed-01.png)
249249

250250
**Possible causes:**
251-
If Kubernetes is enabled before the VM is created, Kubernetes will use all the available GPUs, and you won’t be able to create any GPU-size VMs. You can create as many GPU-size VMs as the number of available GPUs. Your Azure Stack Edge device can be equipped with 1 or 2 GPUs.
251+
If Kubernetes is enabled before the VM is created, Kubernetes uses all the available GPUs, and you won’t be able to create any GPU-size VMs. You can create as many GPU-size VMs as the number of available GPUs. Your Azure Stack Edge device can be equipped with 1 or 2 GPUs.
252252

253253
**Suggested solution:** For VM deployment options on a 1-GPU or 2-GPU device with Kubernetes configured, see [GPU VMs and Kubernetes](azure-stack-edge-gpu-overview-gpu-virtual-machines.md#gpu-vms-and-kubernetes).
254254

0 commit comments

Comments
 (0)