Skip to content

Commit 21f4fd8

Browse files
Merge pull request #223420 from edebar01/edebar01-update-troubleshooting-doc
More updates to how-to-troubleshoot-environments.md
2 parents 9c2571c + df11c44 commit 21f4fd8

File tree

1 file changed

+77
-6
lines changed

1 file changed

+77
-6
lines changed

articles/machine-learning/how-to-troubleshoot-environments.md

Lines changed: 77 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -871,16 +871,87 @@ Ensure that you're using a compatible python version
871871
* [mpi4py installation](https://aka.ms/azureml/environment/install-mpi4py)
872872

873873
### Interactive auth was attempted
874-
- Failed to create or update the conda environment because pip attempted interactive authentication
875-
- Instead, provide authentication via [workspace connection](https://aka.ms/azureml/environment/set-connection-v1)
874+
<!--issueDescription-->
875+
This issue can happen when pip attempts interactive authentication during package installation.
876+
877+
**Potential causes:**
878+
* You've listed a package that requires authentication, but you haven't provided credentials
879+
* During the image build, pip tried to prompt you to authenticate which failed the build
880+
because you can't provide interactive authentication during a build
881+
882+
**Affected areas (symptoms):**
883+
* Failure in building environments from UI, SDK, and CLI.
884+
* Failure in running jobs because it will implicitly build the environment in the first step.
885+
<!--/issueDescription-->
886+
887+
**Troubleshooting steps**
888+
889+
Provide authentication via workspace connections
890+
891+
*Applies to: Python SDK azureml V1*
892+
893+
```
894+
from azureml.core import Workspace
895+
ws = Workspace.from_config()
896+
ws.set_connection("connection1", "PythonFeed", "<URL>", "Basic", "{'Username': '<username>', 'Password': '<password>'}")
897+
```
898+
899+
*Applies to: Azure CLI extensions V1 & V2*
900+
901+
Create a workspace connection from a YAML specification file
902+
903+
```
904+
az ml connection create --file connection.yml --resource-group my-resource-group --workspace-name my-workspace
905+
```
906+
907+
**Resources**
908+
* [Python SDK AzureML v1 workspace connections](https://aka.ms/azureml/environment/set-connection-v1)
909+
* [Python SDK AzureML v2 workspace connections](/python/api/azure-ai-ml/azure.ai.ml.entities.workspaceconnection)
910+
* [Azure CLI workspace connections](/cli/azure/ml/connection)
876911

877912
### Forbidden blob
878-
- Failed to create or update the conda environment because a blob contained in the associated storage account was inaccessible
879-
- Either open up permissions on the blob or add/replace the SAS token in the URL
913+
<!--issueDescription-->
914+
This issue can happen when an attempt to access a blob in a storage account is rejected.
915+
916+
**Potential causes:**
917+
* The authorization method you're using to access the storage account is invalid
918+
* You're attempting to authorize via shared access signature (SAS), but the SAS token is expired or invalid
919+
920+
**Affected areas (symptoms):**
921+
* Failure in building environments from UI, SDK, and CLI.
922+
* Failure in running jobs because it will implicitly build the environment in the first step.
923+
<!--/issueDescription-->
924+
925+
**Troubleshooting steps**
926+
927+
Read the following to understand [how to authorize access to blob data in the Azure portal](../storage/blobs/authorize-data-operations-portal.md)
928+
929+
Read the following to understand [how to authorize access to data in Azure storage](../storage/common/authorize-data-access.md)
930+
931+
Read the following if you're interested in [using SAS to access Azure storage resources](../storage/common/storage-sas-overview.md)
880932

881933
### Horovod build
882-
- Failed to create or update the conda environment because horovod failed to build
883-
- See [horovod installation](https://aka.ms/azureml/environment/install-horovod)
934+
<!--issueDescription-->
935+
This issue can happen when the conda environment fails to be created or updated because horovod failed to build.
936+
937+
**Potential causes:**
938+
* Horovod installation requires other modules that you haven't installed
939+
* Horovod installation requires certain libraries that you haven't included
940+
941+
**Affected areas (symptoms):**
942+
* Failure in building environments from UI, SDK, and CLI.
943+
* Failure in running jobs because it will implicitly build the environment in the first step.
944+
<!--/issueDescription-->
945+
946+
**Troubleshooting steps**
947+
948+
Many issues could cause a horovod failure, and there's a comprehensive list of them in horovod's documentation
949+
* Review the [horovod troubleshooting guide](https://horovod.readthedocs.io/en/stable/troubleshooting_include.html#)
950+
* Review your Build log to see if there's an error message that surfaced when horovod failed to build
951+
* It's possible that the problem you're encountering is detailed in the horovod troubleshooting guide, along with a solution
952+
953+
**Resources**
954+
* [horovod installation](https://aka.ms/azureml/environment/install-horovod)
884955

885956
### Conda command not found
886957
- Failed to create or update the conda environment because the conda command is missing

0 commit comments

Comments
 (0)