Skip to content

Commit 2018544

Browse files
authored
Merge pull request #87606 from dagiro/cats12
cats12
2 parents 0fd1afd + 7bf2b06 commit 2018544

10 files changed

+10
-10
lines changed

articles/hdinsight/hadoop/apache-hadoop-debug-jobs.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.date: 11/14/2017
1111
ms.author: ashishth
1212
---
1313

14-
# Analyze Apache Hadoop logs
14+
# Analyze Apache Hadoop logs in Azure HDInsight
1515

1616
Each Apache Hadoop cluster in Azure HDInsight has an Azure storage account used as the default file system. The storage account is referred as the default Storage account. Cluster uses the Azure Table storage and the Blob storage on the default Storage account to store its logs. To find out the default storage account for your cluster, see [Manage Apache Hadoop clusters in HDInsight](../hdinsight-administer-use-portal-linux.md#find-the-storage-accounts). The logs retain in the Storage account even after the cluster is deleted.
1717

articles/hdinsight/hadoop/apache-hadoop-mahout-linux-mac.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.topic: conceptual
1010
ms.date: 04/24/2019
1111
---
1212

13-
# Generate movie recommendations by using Apache Mahout with Linux-based Apache Hadoop in HDInsight (SSH)
13+
# Generate movie recommendations using Apache Mahout with Apache Hadoop in HDInsight (SSH)
1414

1515
[!INCLUDE [mahout-selector](../../../includes/hdinsight-selector-mahout.md)]
1616

articles/hdinsight/hdinsight-linux-ambari-ssh-tunnel.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.date: 05/28/2019
1111
ms.author: hrasheed
1212
---
1313

14-
# Use SSH Tunneling to access Apache Ambari web UI, JobHistory, NameNode, Apache Oozie, and other web UIs
14+
# Use SSH tunneling to access Apache Ambari web UI, JobHistory, NameNode, Apache Oozie, and other UIs
1515

1616
HDInsight clusters provide access to the Apache Ambari web UI over the Internet, but some features require an SSH tunnel. For example, the web UI for the Apache Oozie service cannot be accessed over the internet without an SSh tunnel.
1717

articles/hdinsight/hdinsight-scaling-best-practices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.topic: conceptual
99
ms.date: 06/10/2019
1010
---
1111

12-
# Scale HDInsight clusters
12+
# Scale Azure HDInsight clusters
1313

1414
HDInsight provides elasticity by giving you the option to scale up and scale down the number of worker nodes in your clusters. This elasticity, allows you to shrink a cluster after hours or on weekends, and expand it during peak business demands.
1515

articles/hdinsight/hdinsight-version-release.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.topic: conceptual
99
ms.date: 04/15/2019
1010
---
1111

12-
# HDInsight 4.0 overview
12+
# Azure HDInsight 4.0 overview
1313

1414
Azure HDInsight is one of the most popular services among enterprise customers for open-source Apache Hadoop and Apache Spark analytics on Azure. HDInsight 4.0 is a cloud distribution of Apache Hadoop components. This article provides information about the most recent Azure HDInsight release and how to upgrade.
1515

articles/hdinsight/spark/apache-spark-intellij-tool-plugin.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.date: 09/04/2019
1010
ms.author: hrasheed
1111
---
1212

13-
# Tutorial: Use Azure Toolkit for IntelliJ to create Apache Spark applications for an HDInsight cluster
13+
# Tutorial: Use Azure Toolkit for IntelliJ to create Apache Spark applications for HDInsight cluster
1414

1515
This tutorial demonstrates how to use the Azure Toolkit for IntelliJ plug-in to develop Apache Spark applications written in [Scala](https://www.scala-lang.org/), and then submit them to an HDInsight Spark cluster directly from the IntelliJ integrated development environment (IDE). You can use the plug-in in a few ways:
1616

articles/hdinsight/spark/apache-spark-perf.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.topic: conceptual
1010
ms.date: 04/03/2019
1111
---
1212

13-
# Optimize Apache Spark jobs
13+
# Optimize Apache Spark jobs in HDInsight
1414

1515
Learn how to optimize [Apache Spark](https://spark.apache.org/) cluster configuration for your particular workload. The most common challenge is memory pressure, due to improper configurations (particularly wrong-sized executors), long-running operations, and tasks that result in Cartesian operations. You can speed up jobs with appropriate caching, and by allowing for [data skew](#optimize-joins-and-shuffles). For the best performance, monitor and review long-running and resource-consuming Spark job executions.
1616

articles/hdinsight/spark/apache-spark-troubleshoot-job-fails-noclassdeffounderror.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.author: hrasheed
88
ms.date: 07/29/2019
99
---
1010

11-
# Scenario: Apache Spark streaming job that reads data from an Apache Kafka cluster fails with a NoClassDefFoundError in Azure HDInsight
11+
# Apache Spark streaming job that reads Apache Kafka data fails with NoClassDefFoundError in HDInsight
1212

1313
This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.
1414

articles/hdinsight/spark/apache-spark-troubleshoot-job-slowness-container.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.author: hrasheed
88
ms.date: 08/21/2019
99
---
1010

11-
# Scenario: Apache Spark job run slowly when the Azure storage container contains many files in Azure HDInsight
11+
# Apache Spark job run slowly when the Azure storage container contains many files in Azure HDInsight
1212

1313
This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.
1414

articles/hdinsight/spark/apache-spark-troubleshoot-sparkexception-kryo-serialization-failed.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.author: hrasheed
88
ms.date: 07/29/2019
99
---
1010

11-
# Scenario: Unable to download large data sets using JDBC/ODBC and Apache Thrift software framework in Azure HDInsight
11+
# Unable to download large data sets using JDBC/ODBC and Apache Thrift software framework in HDInsight
1212

1313
This article describes troubleshooting steps and possible resolutions for issues when using Apache Spark components in Azure HDInsight clusters.
1414

0 commit comments

Comments
 (0)