Merge pull request #100849 from dagiro/freshness172

v-albemi · web-flow · commit 2797e334790c · 2020-01-13T12:24:49.000-08:00
freshness172
diff --git a/articles/hdinsight/hadoop/apache-hadoop-use-mapreduce-ssh.md b/articles/hdinsight/hadoop/apache-hadoop-use-mapreduce-ssh.md
@@ -2,15 +2,14 @@
 title: MapReduce and SSH connection with Apache Hadoop - Azure HDInsight
 description: Learn how to use SSH to run MapReduce jobs using Apache Hadoop on HDInsight.
 author: hrasheed-msft
+ms.author: hrasheed
 ms.reviewer: jasonh
-
 ms.service: hdinsight
-ms.custom: hdinsightactive
 ms.topic: conceptual
-ms.date: 04/10/2018
-ms.author: hrasheed
-
+ms.custom: hdinsightactive
+ms.date: 01/10/2020
 ---
+
 # Use MapReduce with Apache Hadoop on HDInsight with SSH
 
 [!INCLUDE [mapreduce-selector](../../../includes/hdinsight-selector-use-mapreduce.md)]
@@ -20,31 +19,17 @@ Learn how to submit MapReduce jobs from a Secure Shell (SSH) connection to HDIns
 > [!NOTE]
 > If you are already familiar with using Linux-based Apache Hadoop servers, but you are new to HDInsight, see [Linux-based HDInsight tips](../hdinsight-hadoop-linux-information.md).
 
-## <a id="prereq"></a>Prerequisites
-
-* A Linux-based HDInsight (Hadoop on HDInsight) cluster
-
-* An SSH client. For more information, see [Use SSH with HDInsight](../hdinsight-hadoop-linux-use-ssh-unix.md)
-
-## <a id="ssh"></a>Connect with SSH
+## Prerequisites
 
-Connect to the cluster using SSH. For example, the following command connects to a cluster named **myhdinsight** as the **sshuser** account:
+An Apache Hadoop cluster on HDInsight. See [Create Apache Hadoop clusters using the Azure portal](../hdinsight-hadoop-create-linux-clusters-portal.md).
 
-```bash
-ssh sshuser@myhdinsight-ssh.azurehdinsight.net
-```
+## Use Hadoop commands
 
-**If you use a certificate key for SSH authentication**, you may need to specify the location of the private key on your client system, for example:
+1. Use [ssh command](../hdinsight-hadoop-linux-use-ssh-unix.md) to connect to your cluster. Edit the command below by replacing CLUSTERNAME with the name of your cluster, and then enter the command:
 
-```bash
-ssh -i ~/mykey.key sshuser@myhdinsight-ssh.azurehdinsight.net
-```
-
-**If you use a password for SSH authentication**, you need to provide the password when prompted.
-
-For more information on using SSH with HDInsight, see [Use SSH with HDInsight](../hdinsight-hadoop-linux-use-ssh-unix.md).
-
-## <a id="hadoop"></a>Use Hadoop commands
+    ```cmd
+    ssh sshuser@CLUSTERNAME-ssh.azurehdinsight.net
+    ```
 
 1. After you are connected to the HDInsight cluster, use the following command to start a MapReduce job:
 
@@ -57,14 +42,16 @@ For more information on using SSH with HDInsight, see [Use SSH with HDInsight](.
     > [!NOTE]
     > For more information about this MapReduce job and the example data, see [Use MapReduce in Apache Hadoop on HDInsight](hdinsight-use-mapreduce.md).
 
-2. The job emits details as it processes, and it returns information similar to the following text when the job completes:
+    The job emits details as it processes, and it returns information similar to the following text when the job completes:
 
-        File Input Format Counters
-        Bytes Read=1395666
-        File Output Format Counters
-        Bytes Written=337623
+    ```output
+    File Input Format Counters
+    Bytes Read=1395666
+    File Output Format Counters
+    Bytes Written=337623
+    ```
 
-3. When the job completes, use the following command to list the output files:
+1. When the job completes, use the following command to list the output files:
 
     ```bash
     hdfs dfs -ls /example/data/WordCountOutput
@@ -75,33 +62,27 @@ For more information on using SSH with HDInsight, see [Use SSH with HDInsight](.
     > [!NOTE]  
     > Some MapReduce jobs may split the results across multiple **part-r-#####** files. If so, use the ##### suffix to indicate the order of the files.
 
-4. To view the output, use the following command:
+1. To view the output, use the following command:
 
     ```bash
     hdfs dfs -cat /example/data/WordCountOutput/part-r-00000
     ```
 
-    This command displays a list of the words that are contained in the **wasb://example/data/gutenberg/davinci.txt** file and the number of times each word occurred. The following text is an example of the data that is contained in the file:
+    This command displays a list of the words that are contained in the **wasbs://example/data/gutenberg/davinci.txt** file and the number of times each word occurred. The following text is an example of the data that is contained in the file:
 
-        wreathed        3
-        wreathing       1
-        wreaths         1
-        wrecked         3
-        wrenching       1
-        wretched        6
-        wriggling       1
-
-## <a id="summary"></a>Summary
-
-As you can see, Hadoop commands provide an easy way to run MapReduce jobs in an HDInsight cluster and then view the job output.
+    ```output
+    wreathed        3
+    wreathing       1
+    wreaths         1
+    wrecked         3
+    wrenching       1
+    wretched        6
+    wriggling       1
+    ```
 
-## <a id="nextsteps"></a>Next steps
+## Next steps
 
-For general information about MapReduce jobs in HDInsight:
+As you can see, Hadoop commands provide an easy way to run MapReduce jobs in an HDInsight cluster and then view the job output. For information about other ways you can work with Hadoop on HDInsight:
 
 * [Use MapReduce on HDInsight Hadoop](hdinsight-use-mapreduce.md)
-
-For information about other ways you can work with Hadoop on HDInsight:
-
 * [Use Apache Hive with Apache Hadoop on HDInsight](hdinsight-use-hive.md)
-* [Use Apache Pig with Apache Hadoop on HDInsight](hdinsight-use-pig.md)