Merge pull request #87606 from dagiro/cats12

PRMerger20 · web-flow · commit 20185442a55b · 2019-09-10T12:56:10.000-07:00
cats12
diff --git a/articles/hdinsight/hadoop/apache-hadoop-debug-jobs.md b/articles/hdinsight/hadoop/apache-hadoop-debug-jobs.md
@@ -11,7 +11,7 @@ ms.date: 11/14/2017
 ms.author: ashishth
 ---
 
-# Analyze Apache Hadoop logs
+# Analyze Apache Hadoop logs in Azure HDInsight
 
 Each Apache Hadoop cluster in Azure HDInsight has an Azure storage account used as the default file system. The storage account is referred as the default Storage account. Cluster uses the Azure Table storage and the Blob storage on the default Storage account to store its logs.  To find out the default storage account for your cluster, see [Manage Apache Hadoop clusters in HDInsight](../hdinsight-administer-use-portal-linux.md#find-the-storage-accounts). The logs retain in the Storage account even after the cluster is deleted.
 
diff --git a/articles/hdinsight/hadoop/apache-hadoop-mahout-linux-mac.md b/articles/hdinsight/hadoop/apache-hadoop-mahout-linux-mac.md
@@ -10,7 +10,7 @@ ms.topic: conceptual
 ms.date: 04/24/2019
 ---
 
-# Generate movie recommendations by using Apache Mahout with Linux-based Apache Hadoop in HDInsight (SSH)
+# Generate movie recommendations using Apache Mahout with Apache Hadoop in HDInsight (SSH)
 
 [!INCLUDE [mahout-selector](../../../includes/hdinsight-selector-mahout.md)]
 
diff --git a/articles/hdinsight/hdinsight-linux-ambari-ssh-tunnel.md b/articles/hdinsight/hdinsight-linux-ambari-ssh-tunnel.md
@@ -11,7 +11,7 @@ ms.date: 05/28/2019
 ms.author: hrasheed
 ---
 
-# Use SSH Tunneling to access Apache Ambari web UI, JobHistory, NameNode, Apache Oozie, and other web UIs
+# Use SSH tunneling to access Apache Ambari web UI, JobHistory, NameNode, Apache Oozie, and other UIs
 
 HDInsight clusters provide access to the Apache Ambari web UI over the Internet, but some features require an SSH tunnel. For example, the web UI for the Apache Oozie service cannot be accessed over the internet without an SSh tunnel.
 
diff --git a/articles/hdinsight/hdinsight-scaling-best-practices.md b/articles/hdinsight/hdinsight-scaling-best-practices.md
@@ -9,7 +9,7 @@ ms.topic: conceptual
 ms.date: 06/10/2019
 ---
 
-# Scale HDInsight clusters
+# Scale Azure HDInsight clusters
 
 HDInsight provides elasticity by giving you the option to scale up and scale down the number of worker nodes in your clusters. This elasticity, allows you to shrink a cluster after hours or on weekends, and expand it during peak business demands.
 
diff --git a/articles/hdinsight/hdinsight-version-release.md b/articles/hdinsight/hdinsight-version-release.md
@@ -9,7 +9,7 @@ ms.topic: conceptual
 ms.date: 04/15/2019
 ---
 
-# HDInsight 4.0 overview
+# Azure HDInsight 4.0 overview
 
 Azure HDInsight is one of the most popular services among enterprise customers for open-source Apache Hadoop and Apache Spark analytics on Azure. HDInsight 4.0 is a cloud distribution of Apache Hadoop components. This article provides information about the most recent Azure HDInsight release and how to upgrade.
 
diff --git a/articles/hdinsight/spark/apache-spark-intellij-tool-plugin.md b/articles/hdinsight/spark/apache-spark-intellij-tool-plugin.md
@@ -10,7 +10,7 @@ ms.date: 09/04/2019
 ms.author: hrasheed
 ---
 
-# Tutorial: Use Azure Toolkit for IntelliJ to create Apache Spark applications for an HDInsight cluster
+# Tutorial: Use Azure Toolkit for IntelliJ to create Apache Spark applications for HDInsight cluster
 
 This tutorial demonstrates how to use the Azure Toolkit for IntelliJ plug-in to develop Apache Spark applications written in [Scala](https://www.scala-lang.org/), and then submit them to an HDInsight Spark cluster directly from the IntelliJ integrated development environment (IDE). You can use the plug-in in a few ways:
 
diff --git a/articles/hdinsight/spark/apache-spark-perf.md b/articles/hdinsight/spark/apache-spark-perf.md
@@ -10,7 +10,7 @@ ms.topic: conceptual
 ms.date: 04/03/2019
 ---
 
-# Optimize Apache Spark jobs
+# Optimize Apache Spark jobs in HDInsight
 
 Learn how to optimize [Apache Spark](https://spark.apache.org/) cluster configuration for your particular workload.  The most common challenge is memory pressure, due to improper configurations (particularly wrong-sized executors), long-running operations, and tasks that result in Cartesian operations. You can speed up jobs with appropriate caching, and by allowing for [data skew](#optimize-joins-and-shuffles). For the best performance, monitor and review long-running and resource-consuming Spark job executions.
 
diff --git a/articles/hdinsight/spark/apache-spark-troubleshoot-job-fails-noclassdeffounderror.md b/articles/hdinsight/spark/apache-spark-troubleshoot-job-fails-noclassdeffounderror.md
@@ -8,7 +8,7 @@ ms.author: hrasheed
 ms.date: 07/29/2019
 ---
 
-# Scenario: Apache Spark streaming job that reads data from an Apache Kafka cluster fails with a NoClassDefFoundError in Azure HDInsight
+# Apache Spark streaming job that reads Apache Kafka data fails with NoClassDefFoundError in HDInsight
 
 This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.
 
diff --git a/articles/hdinsight/spark/apache-spark-troubleshoot-job-slowness-container.md b/articles/hdinsight/spark/apache-spark-troubleshoot-job-slowness-container.md
@@ -8,7 +8,7 @@ ms.author: hrasheed
 ms.date: 08/21/2019
 ---
 
-# Scenario: Apache Spark job run slowly when the Azure storage container contains many files in Azure HDInsight
+# Apache Spark job run slowly when the Azure storage container contains many files in Azure HDInsight
 
 This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.
 
diff --git a/articles/hdinsight/spark/apache-spark-troubleshoot-sparkexception-kryo-serialization-failed.md b/articles/hdinsight/spark/apache-spark-troubleshoot-sparkexception-kryo-serialization-failed.md
@@ -8,7 +8,7 @@ ms.author: hrasheed
 ms.date: 07/29/2019
 ---
 
-# Scenario: Unable to download large data sets using JDBC/ODBC and Apache Thrift software framework in Azure HDInsight
+# Unable to download large data sets using JDBC/ODBC and Apache Thrift software framework in HDInsight
 
 This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.