Merge pull request #111858 from dagiro/freshness_c7

PRMerger19 · web-flow · commit 85d523712b37 · 2020-04-17T14:52:57.000-07:00
freshness_c7
diff --git a/articles/hdinsight/spark/apache-spark-create-standalone-application.md b/articles/hdinsight/spark/apache-spark-create-standalone-application.md
@@ -1,20 +1,20 @@
 ---
 title: 'Tutorial: Scala Maven app for Spark & IntelliJ - Azure HDInsight'
-description: Tutorial - Create a Spark application written in Scala with Apache Maven as the build system and an existing Maven archetype for Scala provided by IntelliJ IDEA.
+description: Tutorial - Create a Spark application written in Scala with Apache Maven as the build system. And an existing Maven archetype for Scala provided by IntelliJ IDEA.
 author: hrasheed-msft
 ms.author: hrasheed
 ms.reviewer: jasonh
 ms.service: hdinsight
 ms.topic: tutorial
 ms.custom: hdinsightactive,mvc
-ms.date: 02/28/2020
+ms.date: 04/17/2020
 
 #customer intent: As a developer new to Apache Spark and to Apache Spark in Azure HDInsight, I want to learn how to create a Scala Maven application for Spark in HDInsight using IntelliJ.
 ---
 
 # Tutorial: Create a Scala Maven application for Apache Spark in HDInsight using IntelliJ
 
-In this tutorial, you learn how to create an [Apache Spark](./apache-spark-overview.md) application written in [Scala](https://www.scala-lang.org/) using [Apache Maven](https://maven.apache.org/) with IntelliJ IDEA. The article uses Apache Maven as the build system and starts with an existing Maven archetype for Scala provided by IntelliJ IDEA.  Creating a Scala application in IntelliJ IDEA involves the following steps:
+In this tutorial, you learn how to create an Apache Spark application written in Scala using Apache Maven with IntelliJ IDEA. The article uses Apache Maven as the build system. And starts with an existing Maven archetype for Scala provided by IntelliJ IDEA.  Creating a Scala application in IntelliJ IDEA involves the following steps:
 
 * Use Maven as the build system.
 * Update Project Object Model (POM) file to resolve Spark module dependencies.
@@ -40,17 +40,17 @@ In this tutorial, you learn how to:
 
 ## Install Scala plugin for IntelliJ IDEA
 
-Perform the following steps to install the Scala plugin:
+Do the following steps to install the Scala plugin:
 
 1. Open IntelliJ IDEA.
 
 2. On the welcome screen, navigate to **Configure** > **Plugins** to open the **Plugins** window.
 
-    ![IntelliJ IDEA enable scala plugin](./media/apache-spark-create-standalone-application/enable-scala-plugin1.png)
+    ![`IntelliJ IDEA enable scala plugin`](./media/apache-spark-create-standalone-application/enable-scala-plugin1.png)
 
 3. Select **Install** for the Scala plugin that is featured in the new window.  
 
-    ![IntelliJ IDEA install scala plugin](./media/apache-spark-create-standalone-application/install-scala-plugin.png)
+    ![`IntelliJ IDEA install scala plugin`](./media/apache-spark-create-standalone-application/install-scala-plugin.png)
 
 4. After the plugin installs successfully, you must restart the IDE.
 
@@ -75,8 +75,8 @@ Perform the following steps to install the Scala plugin:
     |  Property   | Description   |  
     | ----- | ----- |  
     |Project name| Enter a name.|  
-    |Project&nbsp;location| Enter the desired location to save your project.|
-    |Project SDK| This will be blank on your first use of IDEA.  Select **New...** and navigate to your JDK.|
+    |Project&nbsp;location| Enter the location to save your project.|
+    |Project SDK| This field will be blank on your first use of IDEA.  Select **New...** and navigate to your JDK.|
     |Spark Version|The creation wizard integrates the proper version for Spark SDK and Scala SDK. If the Spark cluster version is earlier than 2.0, select **Spark 1.x**. Otherwise, select **Spark2.x**. This example uses **Spark 2.3.0 (Scala 2.11.8)**.|
 
     ![IntelliJ IDEA Selecting the Spark SDK](./media/apache-spark-create-standalone-application/hdi-scala-new-project.png)
@@ -93,18 +93,18 @@ Perform the following steps to install the Scala plugin:
 
 4. Select the **Create from archetype** checkbox.  
 
-5. From the list of archetypes, select **org.scala-tools.archetypes:scala-archetype-simple**. This archetype creates the right directory structure and downloads the required default dependencies to write Scala program.
+5. From the list of archetypes, select **`org.scala-tools.archetypes:scala-archetype-simple`**. This archetype creates the right directory structure and downloads the required default dependencies to write Scala program.
 
-    ![IntelliJ IDEA create Maven project](./media/apache-spark-create-standalone-application/intellij-project-create-maven.png)
+    ![`IntelliJ IDEA create Maven project`](./media/apache-spark-create-standalone-application/intellij-project-create-maven.png)
 
 6. Select **Next**.
 
-7. Expand **Artifact Coordinates**. Provide relevant values for **GroupId**, and **ArtifactId**. **Name**, and **Location** will auto-populate. The following values are used in this tutorial:
+7. Expand **Artifact Coordinates**. Provide relevant values for **GroupId**, and **ArtifactId**. **Name**, and **Location** will autopopulate. The following values are used in this tutorial:
 
     - **GroupId:** com.microsoft.spark.example
     - **ArtifactId:** SparkSimpleApp
 
-    ![IntelliJ IDEA create Maven project](./media/apache-spark-create-standalone-application/intellij-artifact-coordinates.png)
+    ![`IntelliJ IDEA create Maven project`](./media/apache-spark-create-standalone-application/intellij-artifact-coordinates.png)
 
 8. Select **Next**.
 
@@ -114,7 +114,7 @@ Perform the following steps to install the Scala plugin:
 
 11. Once the project has imported, from the left pane navigate to **SparkSimpleApp** > **src** > **test** > **scala** > **com** > **microsoft** > **spark** > **example**.  Right-click **MySpec**, and then select **Delete...**. You don't need this file for the application.  Select **OK** in the dialog box.
   
-12. In the subsequent steps, you update the **pom.xml** to define the dependencies for the Spark Scala application. For those dependencies to be downloaded and resolved automatically, you must configure Maven accordingly.
+12. In the later steps, you update the **pom.xml** to define the dependencies for the Spark Scala application. For those dependencies to be downloaded and resolved automatically, you must configure Maven.
 
 13. From the **File** menu, select **Settings** to open the **Settings** window.
 
@@ -128,7 +128,7 @@ Perform the following steps to install the Scala plugin:
 
 17. From the left pane, navigate to **src** > **main** > **scala** > **com.microsoft.spark.example**, and then double-click **App** to open App.scala.
 
-18. Replace the existing sample code with the following code and save the changes. This code reads the data from the HVAC.csv (available on all HDInsight Spark clusters), retrieves the rows that only have one digit in the sixth column, and writes the output to **/HVACOut** under the default storage container for the cluster.
+18. Replace the existing sample code with the following code and save the changes. This code reads the data from the HVAC.csv (available on all HDInsight Spark clusters). Retrieves the rows that only have one digit in the sixth column. And writes the output to **/HVACOut** under the default storage container for the cluster.
 
         package com.microsoft.spark.example
    
@@ -169,29 +169,29 @@ Perform the following steps to install the Scala plugin:
 
     Save changes to pom.xml.
 
-22. Create the .jar file. IntelliJ IDEA enables creation of JAR as an artifact of a project. Perform the following steps.
+22. Create the .jar file. IntelliJ IDEA enables creation of JAR as an artifact of a project. Do the following steps.
 
     1. From the **File** menu, select **Project Structure...**.
 
     2. From the **Project Structure** window, navigate to **Artifacts** > **the plus symbol +** > **JAR** > **From modules with dependencies...**.
 
-        ![IntelliJ IDEA project structure add jar](./media/apache-spark-create-standalone-application/hdinsight-create-jar1.png)
+        ![`IntelliJ IDEA project structure add jar`](./media/apache-spark-create-standalone-application/hdinsight-create-jar1.png)
 
     3. In the **Create JAR from Modules** window, select the folder icon in the **Main Class** text box.
 
     4. In the **Select Main Class** window, select the class that appears by default and then select **OK**.
 
-        ![IntelliJ IDEA project structure select class](./media/apache-spark-create-standalone-application/hdinsight-create-jar2.png)
+        ![`IntelliJ IDEA project structure select class`](./media/apache-spark-create-standalone-application/hdinsight-create-jar2.png)
 
     5. In the **Create JAR from Modules** window, ensure the **extract to the target JAR** option is selected, and then select **OK**.  This setting creates a single JAR with all dependencies.
 
         ![IntelliJ IDEA project structure jar from module](./media/apache-spark-create-standalone-application/hdinsight-create-jar3.png)
 
     6. The **Output Layout** tab lists all the jars that are included as part of the Maven project. You can select and delete the ones on which the Scala application has no direct dependency. For the application, you're creating here, you can remove all but the last one (**SparkSimpleApp compile output**). Select the jars to delete and then select the negative symbol **-**.
 
-        ![IntelliJ IDEA project structure delete output](./media/apache-spark-create-standalone-application/hdi-delete-output-jars.png)
+        ![`IntelliJ IDEA project structure delete output`](./media/apache-spark-create-standalone-application/hdi-delete-output-jars.png)
 
-        Ensure sure the **Include in project build** checkbox is selected, which ensures that the jar is created every time the project is built or updated. Select **Apply** and then **OK**.
+        Ensure sure the **Include in project build** checkbox is selected. This option ensures that the jar is created every time the project is built or updated. Select **Apply** and then **OK**.
 
     7. To create the jar, navigate to **Build** > **Build Artifacts** > **Build**. The project will compile in about 30 seconds.  The output jar is created under **\out\artifacts**.
 
@@ -201,7 +201,7 @@ Perform the following steps to install the Scala plugin:
 
 To run the application on the cluster, you can use the following approaches:
 
-* **Copy the application jar to the Azure Storage blob** associated with the cluster. You can use [**AzCopy**](../../storage/common/storage-use-azcopy.md), a command-line utility, to do so. There are many other clients as well that you can use to upload data. You can find more about them at [Upload data for Apache Hadoop jobs in HDInsight](../hdinsight-upload-data.md).
+* **Copy the application jar to the Azure Storage blob** associated with the cluster. You can use **AzCopy**, a command-line utility, to do so. There are many other clients as well that you can use to upload data. You can find more about them at [Upload data for Apache Hadoop jobs in HDInsight](../hdinsight-upload-data.md).
 
 * **Use Apache Livy to submit an application job remotely** to the Spark cluster. Spark clusters on HDInsight includes Livy that exposes REST endpoints to remotely submit Spark jobs. For more information, see [Submit Apache Spark jobs remotely using Apache Livy with Spark clusters on HDInsight](apache-spark-livy-rest-interface.md).
 
@@ -219,7 +219,7 @@ If you're not going to continue to use this application, delete the cluster that
 
 1. Select **Delete**. Select **Yes**.
 
-![HDInsight azure portal delete cluster](./media/apache-spark-create-standalone-application/hdinsight-azure-portal-delete-cluster.png "Delete HDInsight cluster")
+![`HDInsight azure portal delete cluster`](./media/apache-spark-create-standalone-application/hdinsight-azure-portal-delete-cluster.png "Delete HDInsight cluster")
 
 ## Next step