Merge pull request #104025 from dagiro/freshness186

PRMerger15 · web-flow · commit 99c6abc7b0d7 · 2020-02-11T14:37:20.000-08:00
freshness186
diff --git a/articles/hdinsight/hdinsight-autoscale-clusters.md b/articles/hdinsight/hdinsight-autoscale-clusters.md
@@ -5,17 +5,17 @@ author: hrasheed-msft
 ms.author: hrasheed
 ms.reviewer: jasonh
 ms.service: hdinsight
-ms.custom: hdinsightactive
 ms.topic: conceptual
-ms.date: 10/22/2019
+ms.custom: hdinsightactive
+ms.date: 02/11/2020
 ---
 
 # Automatically scale Azure HDInsight clusters
 
 > [!Important]
-> The Autoscale feature only works for Spark, Hive, LLAP and HBase clusters created after May 8th 2019. 
+> The Autoscale feature only works for Apache Spark, ApacheHive, LLAP and Apache HBase clusters created after May 8th 2019.
 
-Azure HDInsight’s cluster Autoscale feature automatically scales the number of worker nodes in a cluster up and down. Other types of nodes in the cluster can't be scaled currently.  During the creation of a new HDInsight cluster, a minimum and maximum number of worker nodes can be set. Autoscale then monitors the resource requirements of the analytics load and scales the number of worker nodes up or down. There's no additional charge for this feature.
+Azure HDInsight's cluster Autoscale feature automatically scales the number of worker nodes in a cluster up and down. Other types of nodes in the cluster can't be scaled currently.  During the creation of a new HDInsight cluster, a minimum and maximum number of worker nodes can be set. Autoscale then monitors the resource requirements of the analytics load and scales the number of worker nodes up or down. There's no additional charge for this feature.
 
 ## Cluster compatibility
 
@@ -40,12 +40,14 @@ Schedule-based scaling changes the number of nodes in your cluster based on cond
 
 Autoscale continuously monitors the cluster and collects the following metrics:
 
-* **Total Pending CPU**: The total number of cores required to start execution of all pending containers.
-* **Total Pending Memory**: The total memory (in MB) required to start execution of all pending containers.
-* **Total Free CPU**: The sum of all unused cores on the active worker nodes.
-* **Total Free Memory**: The sum of unused memory (in MB) on the active worker nodes.
-* **Used Memory per Node**: The load on a worker node. A worker node on which 10 GB of memory is used, is considered under more load than a worker with 2 GB of used memory.
-* **Number of Application Masters per Node**: The number of Application Master (AM) containers running on a worker node. A worker node that is hosting two AM containers, is considered more important than a worker node that is hosting zero AM containers.
+|Metric|Description|
+|---|---|
+|Total Pending CPU|The total number of cores required to start execution of all pending containers.|
+|Total Pending Memory|The total memory (in MB) required to start execution of all pending containers.|
+|Total Free CPU|The sum of all unused cores on the active worker nodes.|
+|Total Free Memory|The sum of unused memory (in MB) on the active worker nodes.|
+|Used Memory per Node|The load on a worker node. A worker node on which 10 GB of memory is used, is considered under more load than a worker with 2 GB of used memory.|
+|Number of Application Masters per Node|The number of Application Master (AM) containers running on a worker node. A worker node that is hosting two AM containers, is considered more important than a worker node that is hosting zero AM containers.|
 
 The above metrics are checked every 60 seconds. Autoscale makes scale-up and scale-down decisions based on these metrics.
 
@@ -71,11 +73,9 @@ Based on the number of AM containers per node and the current CPU and memory req
 
 ### Create a cluster with load-based Autoscaling
 
-To use Autoscale on a cluster, the **Enable autoscale** option must be enabled when the cluster is created. 
-
-To enable the Autoscale feature with load-based scaling, complete the following steps as part of the normal cluster creation process:
+To use Autoscale on a cluster, the **Enable autoscale** option must be enabled when the cluster is created. To enable the Autoscale feature with load-based scaling, complete the following steps as part of the normal cluster creation process:
 
-1. On the **Configuration + pricing** tab, check the **Enable autoscale** checkbox.
+1. On the **Configuration + pricing** tab, select the **Enable autoscale** checkbox.
 1. Select **Load-based** under **Autoscale type**.
 1. Enter the desired values for the following properties:  
 
@@ -85,7 +85,7 @@ To enable the Autoscale feature with load-based scaling, complete the following
 
     ![Enable worker node load-based autoscale](./media/hdinsight-autoscale-clusters/azure-portal-cluster-configuration-pricing-autoscale.png)
 
-The initial number of worker nodes must fall between the minimum and maximum, inclusive. This value defines the initial size of the cluster when it's created. The minimum number of worker nodes should be set to three or more. . Scaling your cluster to fewer than three nodes can result in it getting stuck in safe mode because of insufficient file replication. See [Getting stuck in safe mode]( https://docs.microsoft.com/ azure/hdinsight/hdinsight-scaling-best-practices#getting-stuck-in-safe-mode) for more information.
+The initial number of worker nodes must fall between the minimum and maximum, inclusive. This value defines the initial size of the cluster when it's created. The minimum number of worker nodes should be set to three or more. Scaling your cluster to fewer than three nodes can result in it getting stuck in safe mode because of insufficient file replication.  For more information, see [Getting stuck in safe mode](./hdinsight-scaling-best-practices.md#getting-stuck-in-safe-mode).
 
 ### Create a cluster with schedule-based Autoscaling
 
@@ -94,7 +94,7 @@ To enable the Autoscale feature with schedule-based scaling, complete the follow
 1. On the **Configuration + pricing** tab, check the **Enable  autoscale** checkbox.
 1. Enter the **Number of nodes** for **Worker node**, which controls the limit for scaling up the cluster.
 1. Select the option **Schedule-based** under **Autoscale type**.
-1. Click **Configure** to open the **Autoscale configuration** window.
+1. Select **Configure** to open the **Autoscale configuration** window.
 1. Select your timezone and then click **+ Add condition**
 1. Select the days of the week that the new condition should apply to.
 1. Edit the time the condition should take effect and the number of nodes that the cluster should be scaled to.
@@ -185,7 +185,7 @@ You can create an HDInsight cluster with schedule-based Autoscaling an Azure Res
 
 #### Using the Azure portal
 
-To enable Autoscale on a running cluster, select **Cluster size** under **Settings**. Then click **Enable autoscale**. Select the type of Autoscale that you want and enter the options for load-based or schedule-based scaling. Finally, click **Save**.
+To enable Autoscale on a running cluster, select **Cluster size** under **Settings**. Then select **Enable autoscale**. Select the type of Autoscale that you want and enter the options for load-based or schedule-based scaling. Finally, select **Save**.
 
 ![Enable worker node schedule-based autoscale running cluster](./media/hdinsight-autoscale-clusters/azure-portal-settings-autoscale.png)
 
@@ -214,7 +214,7 @@ Consider the following factors before making a decision on which mode to choose:
 * Enable Autoscale during cluster creation.
 * The minimum number of nodes should be at least three.
 * Load variance: does the load of the cluster follow a consistent pattern at specific times, on specific days. If not, load based scheduling is a better option.
-* SLA requirements: Autoscale scaling is reactive instead of predictive. Will there be a sufficient delay between when the load starts to increase and when the cluster needs to be at its target size? If there are strict SLA requirements and the load is a fixed known pattern, ‘schedule based’ is a better option.
+* SLA requirements: Autoscale scaling is reactive instead of predictive. Will there be a sufficient delay between when the load starts to increase and when the cluster needs to be at its target size? If there are strict SLA requirements and the load is a fixed known pattern, 'schedule based' is a better option.
 
 ### Consider the latency of scale up or scale down operations
 
@@ -228,7 +228,7 @@ The running jobs will continue to run and finish. The pending jobs will wait to
 
 ### Minimum cluster size
 
-Do not scale your cluster down to fewer than three nodes. Scaling your cluster to fewer than three nodes can result in it getting stuck in safe mode because of insufficient file replication. See [Getting stuck in safe mode]( https://docs.microsoft.com/ azure/hdinsight/hdinsight-scaling-best-practices#getting-stuck-in-safe-mode) for more information.
+Don't scale your cluster down to fewer than three nodes. Scaling your cluster to fewer than three nodes can result in it getting stuck in safe mode because of insufficient file replication.  For more information, see [Getting stuck in safe mode](./hdinsight-scaling-best-practices.md#getting-stuck-in-safe-mode).
 
 ## Monitoring
 
@@ -240,24 +240,24 @@ The cluster status listed in the Azure portal can help you monitor Autoscale act
 
 All of the cluster status messages that you might see are explained in the list below.
 
-| Cluster status | Explanation |
+| Cluster status | Description |
 |---|---|
 | Running | The cluster is operating normally. All of the previous Autoscale activities have completed successfully. |
 | Updating  | The cluster Autoscale configuration is being updated.  |
 | HDInsight configuration  | A cluster scale up or scale down operation is in progress.  |
 | Updating Error  | HDInsight encountered issues during the Autoscale configuration update. Customers can choose to either retry the update or disable autoscale.  |
 | Error  | Something is wrong with the cluster, and it isn't usable. Delete this cluster and create a new one.  |
 
-To view the current number of nodes in your cluster, go to the **Cluster size** chart on the **Overview** page for your cluster, or click **Cluster size** under **Settings**.
+To view the current number of nodes in your cluster, go to the **Cluster size** chart on the **Overview** page for your cluster, or select **Cluster size** under **Settings**.
 
 ### Operation history
 
 You can view the cluster scale-up and scale-down history as part of the cluster metrics. You can also list all scaling actions over the past day, week, or other period of time.
 
-Select **Metrics** under **Monitoring**. Then click **Add metric** and **Number of Active Workers** from the **Metric** dropdown box. Click the button in the upper right to change the time range.
+Select **Metrics** under **Monitoring**. Then select **Add metric** and **Number of Active Workers** from the **Metric** dropdown box. Select the button in the upper right to change the time range.
 
 ![Enable worker node schedule-based autoscale metric](./media/hdinsight-autoscale-clusters/hdinsight-autoscale-clusters-chart-metric.png)
 
 ## Next steps
 
-* Read about best practices for scaling clusters manually in [Scaling best practices](hdinsight-scaling-best-practices.md)
+Read about best practices for scaling clusters manually in [Scaling best practices](hdinsight-scaling-best-practices.md)