Skip to content

Commit b36ec2b

Browse files
Merge pull request #223928 from edebar01/patch-1
last of build failure updates to how-to-troubleshoot-environments.md
2 parents e784ca6 + 996386c commit b36ec2b

File tree

1 file changed

+98
-11
lines changed

1 file changed

+98
-11
lines changed

articles/machine-learning/how-to-troubleshoot-environments.md

Lines changed: 98 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -954,23 +954,110 @@ Many issues could cause a horovod failure, and there's a comprehensive list of t
954954
* [horovod installation](https://aka.ms/azureml/environment/install-horovod)
955955

956956
### Conda command not found
957-
- Failed to create or update the conda environment because the conda command is missing
958-
- For system-managed environments, conda should be in the path in order to create the user's environment
959-
from the provided conda specification
957+
<!--issueDescription-->
958+
This issue can happen when the conda command isn't recognized during conda environment creation or update.
959+
960+
**Potential causes:**
961+
* conda isn't installed in the base image you're using
962+
* conda isn't installed via your Dockerfile before you try to execute the conda command
963+
* conda isn't included in or wasn't added to your path
964+
965+
**Affected areas (symptoms):**
966+
* Failure in building environments from UI, SDK, and CLI.
967+
* Failure in running jobs because it will implicitly build the environment in the first step.
968+
<!--/issueDescription-->
969+
970+
**Troubleshooting steps**
971+
972+
Ensure that you have a conda installation step in your Dockerfile before trying to execute any conda commands
973+
* Review this [list of conda installers](https://docs.conda.io/en/latest/miniconda.html) to determine what you need for your scenario
974+
975+
If you've tried installing conda and are experiencing this issue, ensure that you've added conda to your path
976+
* Review this [example](https://stackoverflow.com/questions/58269375/how-to-install-packages-with-miniconda-in-dockerfile) for guidance
977+
* Review how to set [environment variables in a Dockerfile](https://docs.docker.com/engine/reference/builder/#env)
978+
979+
**Resources**
980+
* All available conda distributions are found in the [conda repository](https://repo.anaconda.com/miniconda/)
960981

961982
### Incompatible Python version
962-
- Failed to create or update the conda environment because a package specified in the conda environment isn't compatible with the specified python version
963-
- Update the Python version or use a different version of the package
983+
<!--issueDescription-->
984+
This issue can happen when there's a package specified in your conda environment that isn't compatible with your specified Python version.
985+
986+
**Affected areas (symptoms):**
987+
* Failure in building environments from UI, SDK, and CLI.
988+
* Failure in running jobs because it will implicitly build the environment in the first step.
989+
<!--/issueDescription-->
990+
991+
**Troubleshooting steps**
992+
993+
Use a different version of the package that's compatible with your specified Python version
994+
995+
Alternatively, use a different version of Python that's compatible with the package you've specified
996+
* If you're changing your Python version, use a version that's supported and that isn't nearing its end-of-life soon
997+
* See Python [end-of-life dates](https://aka.ms/azureml/environment/python-end-of-life)
998+
999+
**Resources**
1000+
* [Python documentation by version](https://aka.ms/azureml/environment/python-versions)
9641001

9651002
### Conda bare redirection
966-
- Failed to create or update the conda environment because a package was specified on the command line using ">" or "<"
967-
without using quotes. Consider adding quotes around the package specification
1003+
<!--issueDescription-->
1004+
This issue can happen when a package is specified on the command line using "<" or ">" without using quotes, causing conda environment creation or update to fail.
1005+
1006+
**Affected areas (symptoms):**
1007+
* Failure in building environments from UI, SDK, and CLI.
1008+
* Failure in running jobs because it will implicitly build the environment in the first step.
1009+
<!--/issueDescription-->
1010+
1011+
**Troubleshooting steps**
1012+
1013+
Add quotes around the package specification
1014+
* For example, change `conda install -y pip<=20.1.1` to `conda install -y "pip<=20.1.1"`
9681015

9691016
### *Pip issues during build*
9701017
### Failed to install packages
971-
- Failed to install Python packages
972-
- Review the image build log for more information on this error
1018+
<!--issueDescription-->
1019+
This issue can happen when your image build fails during Python package installation.
1020+
1021+
**Potential causes:**
1022+
* There are many issues that could cause this error
1023+
* This is a generic message that's surfaced when the error you're encountering isn't yet covered by AzureML analysis
1024+
1025+
**Affected areas (symptoms):**
1026+
* Failure in building environments from UI, SDK, and CLI.
1027+
* Failure in running jobs because it will implicitly build the environment in the first step.
1028+
<!--/issueDescription-->
1029+
1030+
**Troubleshooting steps**
1031+
1032+
Review your Build log for more information on your image build failure
1033+
1034+
Leave feedback for the AzureML team to analyze the error you're experiencing
1035+
* [File a problem or suggestion](https://github.com/Azure/azureml-assets/issues/new?assignees=&labels=environmentLogs&template=environmentLogsFeedback.yml)
9731036

9741037
### Can't uninstall package
975-
- Pip failed to uninstall a Python package that was installed via the OS's package manager
976-
- Consider creating a separate environment using conda instead
1038+
<!--issueDescription-->
1039+
This can happen when pip fails to uninstall a Python package that was installed via the operating system's package manager.
1040+
1041+
**Potential causes:**
1042+
* An existing pip problem or a problematic pip version
1043+
* An issue arising from not using an isolated environment
1044+
1045+
**Affected areas (symptoms):**
1046+
* Failure in building environments from UI, SDK, and CLI.
1047+
* Failure in running jobs because it will implicitly build the environment in the first step.
1048+
<!--/issueDescription-->
1049+
1050+
**Troubleshooting steps**
1051+
1052+
Read the following and determine if your failure is caused by an existing pip problem
1053+
* [Cannot uninstall while creating Docker image](https://stackoverflow.com/questions/63383400/error-cannot-uninstall-ruamel-yaml-while-creating-docker-image-for-azure-ml-a)
1054+
* [pip 10 disutils partial uninstall issue](https://github.com/pypa/pip/issues/5247)
1055+
* [pip 10 no longer uninstalls disutils packages](https://github.com/pypa/pip/issues/4805)
1056+
1057+
Try the following
1058+
1059+
```
1060+
pip install --ignore-installed [package]
1061+
```
1062+
1063+
Try creating a separate environment using conda

0 commit comments

Comments
 (0)