You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/data-factory/data-factory-troubleshoot-guide.md
+16-16Lines changed: 16 additions & 16 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -107,7 +107,7 @@ For connector issues such as an encounter error using the copy activity, refer t
107
107
108
108
-**Message**: `The cluster is in Terminated state, not available to receive jobs. Please fix the cluster or retry later.`
109
109
110
-
-**Cause**: The cluster was terminated. For interactive clusters, this might be a race condition.
110
+
-**Cause**: The cluster was terminated. For interactive clusters, this issue might be a race condition.
111
111
112
112
-**Recommendation**: To avoid this error, use job clusters.
113
113
@@ -561,7 +561,7 @@ The following table applies to Azure Batch.
561
561
562
562
-**Message**: `The batch ID for Spark job is invalid. Please retry your job, and if the problem persists, contact the ADF support for further assistance.`
563
563
564
-
-**Cause**: There was an internal problem with the service that caused this.
564
+
-**Cause**: There was an internal problem with the service that caused this error.
565
565
566
566
-**Recommendation**: This could be a transient issue. Retry your job, and if the problem persists, contact the ADF support for further assistance.
567
567
@@ -571,17 +571,17 @@ The following table applies to Azure Batch.
571
571
572
572
-**Cause**: There was an internal error while trying to determine the region from the primary storage account.
573
573
574
-
-**Recommendation**: Try another storage. If this isn't an acceptable solution, contact ADF support team for further assistance.
574
+
-**Recommendation**: Try another storage. If this option isn't an acceptable solution, contact ADF support team for further assistance.
575
575
576
576
### Error code: 208
577
577
578
578
-**Message**: `Service Principal or the MSI authenticator are not instantiated. Please consider providing a Service Principal in the HDI on demand linked service which has permissions to create an HDInsight cluster in the provided subscription and try again. In case if this is not an acceptable solution, contact ADF support team for further assistance.`
579
579
580
580
-**Cause**: There was an internal error while trying to read the Service Principal or instantiating the MSI authentication.
581
581
582
-
-**Recommendation**: Consider providing a Service Principal which has permissions to create an HDInsight cluster in the provided subscription and try again. Verify that the [Manage Identities are set up correctly](https://docs.microsoft.com/azure/hdinsight/hdinsight-managed-identities).
582
+
-**Recommendation**: Consider providing a service principal, which has permissions to create an HDInsight cluster in the provided subscription and try again. Verify that the [Manage Identities are set up correctly](https://docs.microsoft.com/azure/hdinsight/hdinsight-managed-identities).
583
583
584
-
If this isn't an acceptable solution, contact ADF support team for further assistance.
584
+
If this option isn't an acceptable solution, contact ADF support team for further assistance.
585
585
586
586
### Error code: 2300
587
587
@@ -591,15 +591,15 @@ The following table applies to Azure Batch.
591
591
592
592
-**Recommendation**: Verify that the cluster hasn't been deleted, and that the provided URI is correct. When you open the URI in a browser, you should see the Ambari UI. If the cluster is in a virtual network, the URI should be the private URI. To open it, use a Virtual Machine (VM) that is part of the same virtual network.
593
593
594
-
For more information, see [this](https://docs.microsoft.com/azure/hdinsight/hdinsight-plan-virtual-network-deployment#directly-connect-to-apache-hadoop-services).
594
+
For more information, see [Directly connect to Apache Hadoop services](https://docs.microsoft.com/azure/hdinsight/hdinsight-plan-virtual-network-deployment#directly-connect-to-apache-hadoop-services).
595
595
596
596
</br>
597
597
598
598
-**Cause**: If the error message contains a message similar to `A task was canceled.`, the job submission timed out.
599
599
600
600
-**Recommendation**: The problem could be either general HDInsight connectivity or network connectivity. First confirm that the HDInsight Ambari UI is available from any browser. Then check that your credentials are still valid.
601
601
602
-
If you're using a self-hosted integrated runtime (IR), do this from the VM or machine where the self-hosted IR is installed. Then try submitting the job from Data Factory again. If it still fails, contact the Data Factory team for support.
602
+
If you're using a self-hosted integrated runtime (IR), perform this step from the VM or machine where the self-hosted IR is installed. Then try submitting the job from Data Factory again. If it still fails, contact the Data Factory team for support.
603
603
604
604
For more information, read [Ambari Web UI](https://docs.microsoft.com/azure/hdinsight/hdinsight-hadoop-manage-ambari#ambari-web-ui).
605
605
@@ -615,7 +615,7 @@ The following table applies to Azure Batch.
615
615
616
616
-**Cause**: When the error message contains a message similar to `502 - Web server received an invalid response while acting as a gateway or proxy server`, this error is returned by HDInsight service.
617
617
618
-
-**Recommendation**: For a 502 error, this often occurs when your Ambari Server process was shut down. You can restart the Ambari Services by rebooting the head node.
618
+
-**Recommendation**: A 502 error often occurs when your Ambari Server process was shut down. You can restart the Ambari Services by rebooting the head node.
619
619
620
620
1. Connect to one of your nodes on Hdinsight using SSH.
621
621
1. Identify your active head node host by running `ping headnodehost`.
@@ -661,12 +661,12 @@ The following table applies to Azure Batch.
661
661
1. Review your YARN memory. If your YARN memory is high, the processing of your jobs may be delayed. If you do not have enough resources to accommodate your Spark application/job, scale up the cluster to ensure the cluster has enough memory and cores.
662
662
1. Run a Sample test job.
663
663
1. If you run the same job on Hdinsight backend, check that it succeeded. For examples of sample runs, see [Run the MapReduce examples included in HDInsight](https://docs.microsoft.com/azure/hdinsight/hadoop/apache-hadoop-run-samples-linux)
664
-
1. If the job still failed on HDinsight, check the application logs and information which to provide to Support:
664
+
1. If the job still failed on HDinsight, check the application logs and information, which to provide to Support:
665
665
1. Check whether the job was submitted to YARN. If the job wasn't submitted to yarn, use `--master yarn`.
666
666
1. If the application finished execution, collect the start time and end time of the YARN Application. If the application didn't complete the execution, collect Start time/Launch time.
667
667
1. Check and collect application log with `yarn logs -applicationId <Insert_Your_Application_ID>`.
668
668
1. Check and collect the yarn resource manager logs under the `/var/log/hadoop-yarn/yarn` directory.
669
-
1. If these are not enough to resolve the issue, contact Azure HDInsight team for support and provide the above logs and timestamps.
669
+
1. If these steps are not enough to resolve the issue, contact Azure HDInsight team for support and provide the above logs and timestamps.
670
670
671
671
### Error code: 2303
672
672
@@ -683,18 +683,18 @@ The following table applies to Azure Batch.
683
683
1. Review your YARN memory. If your YARN memory is high, the processing of your jobs may be delayed. If you do not have enough resources to accommodate your Spark application/job, scale up the cluster to ensure the cluster has enough memory and cores.
684
684
1. Run a Sample test job.
685
685
1. If you run the same job on Hdinsight backend, check that it succeeded. For examples of sample runs, see [Run the MapReduce examples included in HDInsight](https://docs.microsoft.com/azure/hdinsight/hadoop/apache-hadoop-run-samples-linux)
686
-
1. If the job still failed on HDinsight, check the application logs and information which to provide to Support:
686
+
1. If the job still failed on HDinsight, check the application logs and information, which to provide to Support:
687
687
1. Check whether the job was submitted to YARN. If the job wasn't submitted to yarn, use `--master yarn`.
688
688
1. If the application finished execution, collect the start time and end time of the YARN Application. If the application didn't complete the execution, collect Start time/Launch time.
689
689
1. Check and collect application log with `yarn logs -applicationId <Insert_Your_Application_ID>`.
690
690
1. Check and collect the yarn resource manager logs under the `/var/log/hadoop-yarn/yarn` directory.
691
-
1. If these are not enough to resolve the issue, contact Azure HDInsight team for support and provide the above logs and timestamps.
691
+
1. If these steps are not enough to resolve the issue, contact Azure HDInsight team for support and provide the above logs and timestamps.
692
692
693
693
### Error code: 2304
694
694
695
695
-**Message**: `MSI authentication is not supported on storages for HDI activities.`
696
696
697
-
-**Cause**: The storage linked services used in the HDInsight (HDI) linked service or HDI activity, are configured with an MSI authentication which isn't supported.
697
+
-**Cause**: The storage linked services used in the HDInsight (HDI) linked service or HDI activity, are configured with an MSI authentication that isn't supported.
698
698
699
699
-**Recommendation**: Provide full connection strings for storage accounts used in the HDI linked service or HDI activity.
700
700
@@ -722,7 +722,7 @@ The following table applies to Azure Batch.
722
722
723
723
-**Cause**: ADF tried to create a batch on a Spark cluster using Livy API (livy/batch), but received an error.
724
724
725
-
-**Recommendation**: Follow the error message to fix the issue. If there isn't enough information to get it resolved, contact the HDI team and provide them the batch ID and job ID, which can be found in the activity run Output in ADF Monitoring page. To troubleshoot further please collect the full log of the batch job.
725
+
-**Recommendation**: Follow the error message to fix the issue. If there isn't enough information to get it resolved, contact the HDI team and provide them the batch ID and job ID, which can be found in the activity run Output in ADF Monitoring page. To troubleshoot further, collect the full log of the batch job.
726
726
727
727
For more information on how to collect the full log, see [Get the full log of a batch job](https://docs.microsoft.com/rest/api/hdinsightspark/hdinsight-spark-batch-job#get-the-full-log-of-a-batch-job).
728
728
@@ -802,7 +802,7 @@ The following table applies to Azure Batch.
802
802
Connect to the VM where the IR is installed and open the Ambari UI in a browser. Use the private URL for the cluster. This connection should work from the browser. If it doesn't, contact HDInsight support team for further assistance.
803
803
1. If self-hosted IR isn't being used, then the HDI cluster should be accessible publicly. Open the Ambari UI in a browser and check that it opens up. If there are any issues with the cluster or the services on it, contact HDInsight support team for assistance.
804
804
805
-
The HDI cluster URL used in ADF linked service must be accessible for ADF IR (self-hosted or Azure) in order for the test connection to pass, and for runs to work. This can be verified by opening the URL from a browser either from VM, or from any public machine.
805
+
The HDI cluster URL used in ADF linked service must be accessible for ADF IR (self-hosted or Azure) in order for the test connection to pass, and for runs to work. This state can be verified by opening the URL from a browser either from VM, or from any public machine.
806
806
807
807
### Error code: 2343
808
808
@@ -994,7 +994,7 @@ The following table applies to Azure Batch.
994
994
995
995
- **Message**: `No response from the endpoint. Possible causes: network connectivity, DNS failure, server certificate validation or timeout.`
996
996
997
-
- **Cause**: This is a Network connectivity issue, a DNS failure, a server certificate validation, or a timeout.
997
+
- **Cause**: This is a issue is due to either Network connectivity, a DNS failure, a server certificate validation, or a timeout.
998
998
999
999
- **Recommendation**: Validate that the endpoint you are trying to hit is responding to requests. You may use tools like **Fiddler/Postman**.
0 commit comments