Skip to content

Commit dc7528d

Browse files
authored
Merge pull request #57326 from hrasheed-msft/hrasheed_hdihack1
updating for clarity, acrolinx score
2 parents 704ad6b + dfed37f commit dc7528d

File tree

1 file changed

+15
-15
lines changed

1 file changed

+15
-15
lines changed

articles/hdinsight/hdinsight-key-scenarios-to-monitor.md

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -8,20 +8,20 @@ ms.reviewer: jasonh
88
ms.service: hdinsight
99
ms.custom: hdinsightactive
1010
ms.topic: conceptual
11-
ms.date: 09/27/2017
12-
ms.author: maxluk
11+
ms.date: 11/06/2018
12+
ms.author: arindamc
1313
---
1414
# Monitor cluster performance
1515

16-
Monitoring the health and performance of an HDInsight cluster is essential for maintaining maximum performance and resource utilization. Monitoring can also help you address possible coding or cluster configuration errors.
16+
Monitoring the health and performance of an HDInsight cluster is essential for maintaining optimal performance and resource utilization. Monitoring can also help you detect and address cluster configuration errors and user code issues.
1717

18-
The following sections describe how to optimize cluster loading, YARN queue efficiency, and storage accessibility.
18+
The following sections describe how to monitor and optimize the load on your clusters, YARN queues and detect storage throttling issues.
1919

20-
## Cluster loading
20+
## Monitor cluster load
2121

22-
Hadoop clusters should balance loading across the nodes of the cluster. This balancing prevents processing tasks from being constrained by RAM, CPU, or disk resources.
22+
Hadoop clusters can deliver the most optimal performance when the load on cluster is evenly distributed across all the nodes. This enables the processing tasks to run without being constrained by RAM, CPU, or disk resources on individual nodes.
2323

24-
To get a high-level look at the nodes of your cluster and their loading, log in to the [Ambari Web UI](hdinsight-hadoop-manage-ambari.md), then select the **Hosts** tab. Your hosts are listed by their fully qualified domain names. Each host's operating status is shown by a colored health indicator:
24+
To get a high-level look at the nodes of your cluster and their loading, sign in to the [Ambari Web UI](hdinsight-hadoop-manage-ambari.md), then select the **Hosts** tab. Your hosts are listed by their fully qualified domain names. Each host's operating status is shown by a colored health indicator:
2525

2626
| Color | Description |
2727
| --- | --- |
@@ -42,29 +42,29 @@ See [Manage HDInsight clusters by using the Ambari Web UI](hdinsight-hadoop-mana
4242

4343
## YARN queue configuration
4444

45-
Hadoop has various services running across its distributed platform. YARN (Yet Another Resource Negotiator) coordinates these services, allocates cluster resources, and manages access to a common data set.
45+
Hadoop has various services running across its distributed platform. YARN (Yet Another Resource Negotiator) coordinates these services and allocates cluster resources to ensure that any load is evenly distributed across the cluster.
4646

47-
YARN divides the two responsibilities of the JobTracker, resource management and job scheduling/monitoring, into two daemons: a global ResourceManager, and a per-application ApplicationMaster (AM).
47+
YARN divides the two responsibilities of the JobTracker, resource management and job scheduling/monitoring, into two daemons: a global Resource Manager, and a per-application ApplicationMaster (AM).
4848

49-
The ResourceManager is a *pure scheduler*, and solely arbitrates available resources between all competing applications. The ResourceManager ensures that all resources are always in use, optimizing for various constants such as SLAs, capacity guarantees, and so forth. The ApplicationMaster negotiates resources from the ResourceManager, and works with the NodeManager(s) to execute and monitor the containers and their resource consumption.
49+
The Resource Manager is a *pure scheduler*, and solely arbitrates available resources between all competing applications. The Resource Manager ensures that all resources are always in use, optimizing for various constants such as SLAs, capacity guarantees, and so forth. The ApplicationMaster negotiates resources from the Resource Manager, and works with the NodeManager(s) to execute and monitor the containers and their resource consumption.
5050

5151
When multiple tenants share a large cluster, there is competition for the cluster's resources. The CapacityScheduler is a pluggable scheduler that assists in resource sharing by queueing up requests. The CapacityScheduler also supports *hierarchical queues* to ensure that resources are shared between the sub-queues of an organization, before other applications' queues are allowed to use free resources.
5252

53-
YARN allows us to allocate resources to these queues, and shows you whether all of your available resources are assigned. To view information about your queues, log in to the Ambari Web UI, then select **YARN Queue Manager** from the top menu.
53+
YARN allows us to allocate resources to these queues, and shows you whether all of your available resources are assigned. To view information about your queues, sign in to the Ambari Web UI, then select **YARN Queue Manager** from the top menu.
5454

5555
![YARN Queue Manager](./media/hdinsight-key-scenarios-to-monitor/yarn-queue-manager.png)
5656

5757
The YARN Queue Manager page shows a list of your queues on the left, along with the percentage of capacity assigned to each.
5858

5959
![YARN Queue Manager details page](./media/hdinsight-key-scenarios-to-monitor/yarn-queue-manager-details.png)
6060

61-
For a more detailed look at your queues, from the Ambari dashboard, select the **YARN** service from the list on the left. Then under the **Quick Links** dropdown menu, select **ResourceManager UI** underneath your active node.
61+
For a more detailed look at your queues, from the Ambari dashboard, select the **YARN** service from the list on the left. Then under the **Quick Links** dropdown menu, select **Resource Manager UI** underneath your active node.
6262

63-
![ResourceManager UI menu link](./media/hdinsight-key-scenarios-to-monitor/resource-manager-ui-menu.png)
63+
![Resource Manager UI menu link](./media/hdinsight-key-scenarios-to-monitor/resource-manager-ui-menu.png)
6464

65-
In the ResourceManager UI, select **Scheduler** from the left-hand menu. You see a list of your queues underneath *Application Queues*. Here you can see the capacity used for each of your queues, how well the jobs are distributed between them, and whether any jobs are resource-constrained.
65+
In the Resource Manager UI, select **Scheduler** from the left-hand menu. You see a list of your queues underneath *Application Queues*. Here you can see the capacity used for each of your queues, how well the jobs are distributed between them, and whether any jobs are resource-constrained.
6666

67-
![ResourceManager UI menu link](./media/hdinsight-key-scenarios-to-monitor/resource-manager-ui.png)
67+
![Resource Manager UI menu link](./media/hdinsight-key-scenarios-to-monitor/resource-manager-ui.png)
6868

6969
## Storage throttling
7070

0 commit comments

Comments
 (0)