@@ -4,10 +4,11 @@ titleSuffix: Azure Machine Learning
4
4
description : Learn how to troubleshoot and resolve issues in your automated machine learning experiments.
5
5
author : PhaniShekhar
6
6
ms.author : phmantri
7
+ ms.reviewer : ssalgado
7
8
services : machine-learning
8
9
ms.service : machine-learning
9
10
ms.subservice : automl
10
- ms.date : 10 /21/2021
11
+ ms.date : 12 /21/2023
11
12
ms.topic : troubleshooting
12
13
ms.custom : automl, sdkv2
13
14
---
@@ -20,23 +21,23 @@ In this guide, learn how to identify and resolve issues in your automated machin
20
21
21
22
## Troubleshoot automated ML for Images and NLP in Studio
22
23
23
- In case of failures in runs for Automated ML for Images and NLP, you can use the following steps to understand the error.
24
- 1 . In the studio UI, the AutoML run should have a failure message indicating the reason for failure.
25
- 2 . For more details, go to the child run of this AutoML run . This child run is a HyperDrive run .
26
- 3 . In the " Trials" tab, you can check all the trials done for this HyperDrive run.
27
- 4 . Go to the failed trial runs .
28
- 5 . These runs should have an error message in the " Status" section of the " Overview" tab indicating the reason for failure.
29
- Please click on " See more details" to get more details about the failure.
30
- 6 . You can look at " std_log.txt" in the " Outputs + Logs" tab to look at detailed logs and exception traces.
31
-
32
- If your Automated ML runs uses pipeline runs for trials, you can follow the following steps to understand the error.
33
- 1 . Please follow the steps 1-4 above to identify the failed trial run .
24
+ If there is a job failure for Automated ML for Images and NLP, you can use the following steps to understand the error.
25
+ 1 . In the studio UI, the AutoML job should have a failure message indicating the reason for failure.
26
+ 2 . For more details, go to the child job of this AutoML job . This child run is a HyperDrive job .
27
+ 3 . In the ** Trials** tab, you can check all the trials done for this HyperDrive run.
28
+ 4 . Go to the failed trial job .
29
+ 5 . These jobs should have an error message in the ** Status** section of the ** Overview** tab indicating the reason for failure.
30
+ Select ** See more details** to get more details about the failure.
31
+ 6 . Additionally you can view ** std_log.txt** in the ** Outputs + Logs** tab to look at detailed logs and exception traces.
32
+
33
+ If your Automated ML runs uses pipeline runs for trials, follow these steps to understand the error.
34
+ 1 . Follow the steps 1-4 above to identify the failed trial job .
34
35
2 . This run should show you the pipeline run and the failed nodes in the pipeline are marked with Red color.
35
- :::image type="content" source="./media/how-to-troubleshoot-auto-ml/pipeline-graph-sample.jpg" alt-text="Diagram that shows a failed pipeline run ." lightbox="./media/how-to-troubleshoot-auto-ml/pipeline-graph-sample.jpg":::
36
- 3 . Double click the failed node in the pipeline.
37
- 4 . These runs should have an error message in the " Status" section of the " Overview" tab indicating the reason for failure.
38
- Please click on " See more details" to get more details about the failure.
39
- 5 . You can look at " std_log.txt" in the " Outputs + Logs" tab to look at detailed logs and exception traces.
36
+ :::image type="content" source="./media/how-to-troubleshoot-auto-ml/pipeline-graph-sample.jpg" alt-text="Diagram that shows a failed pipeline job ." lightbox="./media/how-to-troubleshoot-auto-ml/pipeline-graph-sample.jpg":::
37
+ 3 . Select the failed node in the pipeline.
38
+ 4 . These jobs should have an error message in the ** Status** section of the ** Overview** tab indicating the reason for failure.
39
+ Select ** See more details** to get more details about the failure.
40
+ 5 . You can look at ** std_log.txt** in the ** Outputs + Logs** tab to look at detailed logs and exception traces.
40
41
41
42
## Next steps
42
43
0 commit comments