You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-troubleshoot-online-endpoints.md
+7-15Lines changed: 7 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -198,13 +198,16 @@ The specified VM Size failed to provision due to a lack of Azure Machine Learnin
198
198
Below is a list of reasons you might run into this error:
199
199
200
200
*[Resource request was greater than limits](#resource-requests-greater-than-limits)
201
-
*[Unable to download resources](#unable-to-download-resources)
201
+
*[Startup task failed due to authorization error](#authorization-error)
202
+
*[Startup task failed due to incorrect role assignments on resource](#authorization-error)
203
+
*[Unable to download user container image](#unable-to-download-user-container-image)
204
+
*[Unable to download user model or code artifacts](#unable-to-download-user-model-or-code-artifacts)
202
205
203
206
#### Resource requests greater than limits
204
207
205
208
Requests for resources must be less than or equal to limits. If you don't set limits, we set default values when you attach your compute to an Azure Machine Learning workspace. You can check limits in the Azure portal or by using the `az ml compute show` command.
206
209
207
-
#### Unable to download resources
210
+
#### Authorization error
208
211
209
212
After provisioning the compute resource, during deployment creation, Azure tries to pull the user container image from the workspace private Azure Container Registry (ACR) and mount the user model and code artifacts into the user container from the workspace storage account.
- If you created the associated endpoint with UserAssigned, the user's managed identity must have Storage blob data reader permission on the workspace storage account.
218
221
219
-
During this process, you can run into a few different issues depending on which stage the operation failed at:
220
-
221
-
*[Unable to download user container image](#unable-to-download-user-container-image)
222
-
*[Unable to download user model or code artifacts](#unable-to-download-user-model-or-code-artifacts)
223
-
224
-
To get more details about these errors, run:
225
-
226
-
```azurecli
227
-
az ml online-deployment get-logs -n <endpoint-name> --deployment <deployment-name> --l 100
228
-
```
229
-
230
222
#### Unable to download user container image
231
223
232
-
It is possible that the user container could not be found.
224
+
It is possible that the user container could not be found. Check [container logs](#get-container-logs) to get more details.
233
225
234
226
Make sure container image is available in workspace ACR.
235
227
@@ -238,7 +230,7 @@ For example, if image is `testacr.azurecr.io/azureml/azureml_92a029f831ce58d2ed0
238
230
239
231
#### Unable to download user model or code artifacts
240
232
241
-
It is possible that the user model or code artifacts can't be found.
233
+
It is possible that the user model or code artifacts can't be found. Check [container logs](#get-container-logs) to get more details.
242
234
243
235
Make sure model and code artifacts are registered to the same workspace as the deployment. Use the `show` command to show details for a model or code artifact in a workspace.
0 commit comments