Skip to content

Commit 740f1aa

Browse files
authored
PR review edit, Microsoft style
1 parent be4a4f6 commit 740f1aa

File tree

1 file changed

+34
-31
lines changed

1 file changed

+34
-31
lines changed

articles/machine-learning/how-to-troubleshoot-online-endpoints.md

Lines changed: 34 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -92,17 +92,17 @@ Add `--help` and/or `--debug` to commands to see more information. Include the `
9292

9393
Below is a list of common deployment errors that are reported as part of the deployment operation status.
9494

95-
* [OutOfQuota](#err-outofquota)
96-
* [OutOfCapacity](#err-outofcapacity)
97-
* [BadArgument](#err-badargument)
98-
* [ResourceNotReady](#err-resourcenotready)
99-
* [ResourceNotFound](#err-resourcenotfound)
100-
* [OperationCancelled](#err-operationcancelled)
101-
* [InternalServerError](#err-internalservererror)
95+
* [OutOfQuota](#error-outofquota)
96+
* [OutOfCapacity](#error-outofcapacity)
97+
* [BadArgument](#error-badargument)
98+
* [ResourceNotReady](#error-resourcenotready)
99+
* [ResourceNotFound](#error-resourcenotfound)
100+
* [OperationCancelled](#error-operationcancelled)
101+
* [InternalServerError](#error-internalservererror)
102102

103103
### ERROR: OutOfQuota
104104

105-
Below is a list of common resources that may run out of quota when using Azure services.
105+
Below is a list of common resources that might run out of quota when using Azure services:
106106

107107
* [CPU](#cpu-quota)
108108
* [Role assignments](#role-assignment-quota)
@@ -114,7 +114,7 @@ Below is a list of common resources that may run out of quota when using Azure s
114114

115115
Before deploying a model, you need to have enough compute quota. This quota defines how much virtual cores are available per subscription, per workspace, per SKU, and per region. Each deployment subtracts from available quota and adds it back after deletion, based on type of the SKU.
116116

117-
A possible mitigation is to check if there are unused deployments that can be deleted. Or you can submit a [request for a quota increase](./how-to-manage-quotas.md).
117+
A possible mitigation is to check if there are unused deployments that can be deleted. Or you can submit a [request for a quota increase](./how-to-manage-quotas.md#request-quota-increases).
118118

119119
#### Role assignment quota
120120

@@ -132,7 +132,7 @@ The requested CPU or memory couldn't be satisfied. Please adjust your request or
132132

133133
To run the `score.py` provided as part of the deployment, Azure creates a container that includes all the resources that the `score.py` needs, and runs the scoring script on that container.
134134

135-
If your container could not start, this means scoring could not happen. It could be that the container is requesting more resources than what `instance_type` could support. If so, consider updating the `instance_type` of the online deployment.
135+
If your container could not start, this means scoring could not happen. It might be that the container is requesting more resources than what `instance_type` can support. If so, consider updating the `instance_type` of the online deployment.
136136

137137
To get the exact reason for an error, run:
138138

@@ -146,7 +146,7 @@ The specified VM Size failed to provision due to a lack of Azure Machine Learnin
146146

147147
### ERROR: BadArgument
148148

149-
Below is a list of reasons you may run into this error:
149+
Below is a list of reasons you might run into this error:
150150

151151
* [Resource request was greater than limits](#resource-requests-greater-than-limits)
152152
* [Unable to download resources](#unable-to-download-resources)
@@ -167,10 +167,10 @@ To pull blobs, Azure uses [managed identities](../active-directory/managed-ident
167167

168168
- If you created the associated endpoint with UserAssigned, the user's managed identity must have Storage blob data reader permission on the workspace storage account.
169169

170-
During this process, you can run into a few different issues depending on which stage the operation failed at.
170+
During this process, you can run into a few different issues depending on which stage the operation failed at:
171171

172172
* [Unable to download user container image](#unable-to-download-user-container-image)
173-
* [Unable to download user model/code artifacts](#unable-to-download-user-modelcode-artifacts)
173+
* [Unable to download user model or code artifacts](#unable-to-download-user-model-or-code-artifacts)
174174

175175
To get more details about these errors, run:
176176

@@ -183,51 +183,54 @@ az ml online-deployment get-logs -n <endpoint-name> --deployment <deployment-nam
183183
It is possible that the user container could not be found.
184184

185185
Make sure container image is available in workspace ACR.
186-
- For example, if image is `testacr.azurecr.io/azureml/azureml_92a029f831ce58d2ed011c3c42d35acb:latest` check the repository with
186+
187+
For example, if image is `testacr.azurecr.io/azureml/azureml_92a029f831ce58d2ed011c3c42d35acb:latest` check the repository with
187188
`az acr repository show-tags -n testacr --repository azureml/azureml_92a029f831ce58d2ed011c3c42d35acb --orderby time_desc --output table`.
188189

189-
#### Unable to download user model/code artifacts
190+
#### Unable to download user model or code artifacts
190191

191-
It is possible that the user model/code artifacts could not be found.
192+
It is possible that the user model or code artifacts can't be found.
192193

193194
Make sure model and code artifacts are registered to the same workspace as the deployment. Use the `show` command to show details for a model or code artifact in a workspace.
195+
194196
- For example:
195197

196-
```azurecli
197-
az ml model show --name <model-name>
198-
az ml code show --name <code-name> --version <version>
199-
```
200-
You can also check if the blobs are present in the workspace storage account.
198+
```azurecli
199+
az ml model show --name <model-name>
200+
az ml code show --name <code-name> --version <version>
201+
```
202+
203+
You can also check if the blobs are present in the workspace storage account.
201204

202-
- For example, if the blob is `https://foobar.blob.core.windows.net/210212154504-1517266419/WebUpload/210212154504-1517266419/GaussianNB.pkl` you can use this command to check if it exists:
205+
- For example, if the blob is `https://foobar.blob.core.windows.net/210212154504-1517266419/WebUpload/210212154504-1517266419/GaussianNB.pkl`, you can use this command to check if it exists:
203206

204-
`az storage blob exists --account-name foobar --container-name 210212154504-1517266419 --name WebUpload/210212154504-1517266419/GaussianNB.pkl --subscription <sub-name>`
207+
`az storage blob exists --account-name foobar --container-name 210212154504-1517266419 --name WebUpload/210212154504-1517266419/GaussianNB.pkl --subscription <sub-name>`
205208

206209
### ERROR: ResourceNotReady
207210

208-
To run the `score.py` provided as part of the deployment, Azure creates a container that includes all the resources that the `score.py` needs, and runs the scoring script on that container. The error in this scenario is that this container is crashing when running, which means scoring couldn't happen. This error happens when:
211+
To run the `score.py` provided as part of the deployment, Azure creates a container that includes all the resources that the `score.py` needs, and runs the scoring script on that container. The error in this scenario is that this container is crashing when running, which means scoring can't happen. This error happens when:
209212

210213
- There's an error in `score.py`. Use `get-logs` to help diagnose common problems:
211-
- A package that was imported but is not in the conda environment
212-
- A syntax error
213-
- A failure in the `init()` method
214+
- A package that was imported but is not in the conda environment.
215+
- A syntax error.
216+
- A failure in the `init()` method.
214217
- If `get-logs` isn't producing any logs, it usually means that the container has failed to start. To debug this issue, try [deploying locally](https://github.com/MicrosoftDocs/azure-docs/blob/master/articles/machine-learning/how-to-troubleshoot-online-endpoints.md#deploy-locally) instead.
215218
- Readiness or liveness probes are not set up correctly.
216219
- There's an error in the environment setup of the container, such as a missing dependency.
217220

218221
### ERROR: ResourceNotFound
219222

220-
This error occurs when Azure Resource Manager (ARM) can't find a required resource. For example, you will receive this error if a storage account was referred to but cannot be found at the path on which it was specified. Be sure to double check resources which may have been supplied by exact path or the spelling of their names.
223+
This error occurs when Azure Resource Manager can't find a required resource. For example, you will receive this error if a storage account was referred to but cannot be found at the path on which it was specified. Be sure to double check resources which might have been supplied by exact path or the spelling of their names.
221224

222-
More details on this error can be found [here](https://aka.ms/ARMResourceNotFoundFix).
225+
For more information, see [Resolve resource not found errors](../azure-resource-manager/troubleshooting/error-not-found).
223226

224227
### ERROR: OperationCancelled
225228

226-
Azure operations have a certain priority level and are executed from highest to lowest. This error happens when your operation happened to be overridden by another operation which has a higher priority. Retrying the operation may allow it to be performed without cancellation.
229+
Azure operations have a certain priority level and are executed from highest to lowest. This error happens when your operation happened to be overridden by another operation that has a higher priority. Retrying the operation might allow it to be performed without cancellation.
227230

228231
### ERROR: InternalServerError
229232

230-
While we do our best to provide a stable and reliable service, sometimes things don't go according to plan. If you get this error, it means something isn't right on our side and we need to fix it. Submit a [customer support ticket](https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/newsupportrequest) with all related information and we'll address the issue.
233+
Although we do our best to provide a stable and reliable service, sometimes things don't go according to plan. If you get this error, it means that something isn't right on our side, and we need to fix it. Submit a [customer support ticket](https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/newsupportrequest) with all related information and we'll address the issue.
231234

232235
## Autoscaling issues
233236

0 commit comments

Comments
 (0)