You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/hdinsight-troubleshoot-failed-cluster.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,7 +76,7 @@ Each HDInsight cluster relies on various Azure services, and on open-source soft
76
76
Apache Ambari provides management and monitoring of a HDInsight cluster with a web UI and a REST API.
77
77
Ambari is included on Linux-based HDInsight clusters. Select the **Cluster Dashboard** pane on the Azure portal HDInsight page. Select the **HDInsight cluster dashboard** pane to open the Ambari UI, and enter the cluster login credentials.
To open a list of service views, select **Ambari Views** on the Azure portal page. This list depends on which libraries are installed. For example, you may see YARN Queue Manager, Hive View, and Tez View. Select a service link to see configuration and service information.
Ambari displays an alert showing the hosts on which the WebHCat service is down. You can try to bring the WebHCat service back up by restarting the service on its host.
If a WebHCat server still does not come up, then check the operations log for failure messages. For more detailed information, check the `stderr` and `stdout` files referenced on the node.
130
130
@@ -173,7 +173,7 @@ At the YARN level, there are two types of timeouts:
173
173
174
174
The following image shows the joblauncher queue at 714.4% overused. This is acceptable so long as there is still free capacity in the default queue to borrow from. However, when the cluster is fully utilized and the YARN memory is at 100% capacity, new jobs must wait, which eventually causes timeouts.
There are two ways to resolve this issue: either reduce the speed of new jobs being submitted, or increase the consumption speed of old jobs by scaling up the cluster.
179
179
@@ -205,7 +205,7 @@ To diagnose these issues:
205
205
206
206
The Ambari UI **Stack and Version** page provides information about cluster services configuration and service version history. Incorrect Hadoop service library versions can be a cause of cluster failure. In the Ambari UI, select the **Admin** menu and then **Stacks and Versions**. Select the **Versions** tab on the page to see service version information:
207
207
208
-

208
+

0 commit comments