Skip to content

Commit 15a8779

Browse files
authored
Merge pull request #99566 from dagiro/freshness132
freshness132
2 parents bc67e93 + 15d32de commit 15a8779

File tree

1 file changed

+17
-11
lines changed

1 file changed

+17
-11
lines changed

articles/hdinsight/hdinsight-hadoop-windows-tools.md

Lines changed: 17 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -4,19 +4,20 @@ description: Work from a Windows PC in Hadoop on HDInsight. Manage and query clu
44
author: hrasheed-msft
55
ms.author: hrasheed
66
ms.reviewer: jasonh
7-
ms.topic: conceptual
87
ms.service: hdinsight
8+
ms.topic: conceptual
99
ms.custom: hdinsightactive,hdiseo17may2017
10-
ms.date: 04/24/2019
10+
ms.date: 12/20/2019
1111
---
1212

1313
# Work in the Apache Hadoop ecosystem on HDInsight from a Windows PC
1414

15-
Learn about development and management options on the Windows PC for working in the Apache Hadoop ecosystem on HDInsight.
15+
Learn about development and management options on the Windows PC for working in the Apache Hadoop ecosystem on HDInsight.
1616

1717
HDInsight is based on Apache Hadoop and Hadoop components, open-source technologies developed on Linux. HDInsight version 3.4 and higher uses the Ubuntu Linux distribution as the underlying OS for the cluster. However, you can work with HDInsight from a Windows client or Windows development environment.
1818

1919
## Use PowerShell for deployment and management tasks
20+
2021
Azure PowerShell is a scripting environment that you can use to control and automate deployment and management tasks in HDInsight from Windows.
2122

2223
Examples of tasks you can do with PowerShell:
@@ -28,23 +29,26 @@ Examples of tasks you can do with PowerShell:
2829
Follow steps to [install and configure Azure Powershell](https://docs.microsoft.com/powershell/azure/install-az-ps) to get the latest version.
2930

3031
## Utilities you can run in a browser
32+
3133
The following utilities have a web UI that runs in a browser:
3234
* **[Azure Cloud Shell](https://docs.microsoft.com/azure/cloud-shell/overview)** is an interactive, command-line shell that runs in your browser and from within the Azure portal.
35+
3336
* **[Apache Ambari Web UI](hdinsight-hadoop-manage-ambari.md)** is a management and monitoring utility available in the Azure portal that can be used to manage different kinds of jobs, such as:
3437
* [Use Apache Ambari with the REST API](hdinsight-hadoop-manage-ambari-rest-api.md)
3538
* [Apache Hive View in Apache Ambari](hadoop/apache-hadoop-use-hive-ambari-view.md)
3639
* [Apache Tez View in Apache Ambari](hdinsight-debug-ambari-tez-view.md)
3740

3841
## Data Lake (Hadoop) Tools for Visual Studio
42+
3943
Use Data Lake Tools for Visual Studio to deploy and manage Storm topologies. Data Lake Tools also installs the SCP.NET SDK, which allows you to develop C# Storm topologies with Visual Studio.
4044

41-
Before you go to the following examples, [install and try Data Lake Tools for Visual Studio](hadoop/apache-hadoop-visual-studio-tools-get-started.md).
45+
Before you go to the following examples, [install and try Data Lake Tools for Visual Studio](hadoop/apache-hadoop-visual-studio-tools-get-started.md).
4246

4347
Examples of tasks you can do with Visual Studio and Data Lake Tools for Visual Studio:
4448
* [Deploy and manage Storm topologies from Visual Studio](storm/apache-storm-deploy-monitor-topology-linux.md)
4549
* [Develop C# topologies for Storm using Visual Studio](storm/apache-storm-develop-csharp-visual-studio-topology.md). The bits include example templates for Storm topologies you can connect to databases, such as Azure Cosmos DB and SQL Database.
4650

47-
## Visual Studio and the .NET SDK
51+
## Visual Studio and the .NET SDK
4852

4953
You can use Visual Studio with the .NET SDK to manage clusters and develop big data applications. You can use other IDEs for the following tasks, but examples are shown in Visual Studio.
5054

@@ -54,25 +58,26 @@ Examples of tasks you can do with the .NET SDK in Visual Studio:
5458
* [Use C# user-defined functions with Apache Hive and Apache Pig streaming on Apache Hadoop](hadoop/apache-hadoop-hive-pig-udf-dotnet-csharp.md).
5559

5660
## Intellij IDEA and Eclipse IDE for Spark clusters
61+
5762
Both [Intellij IDEA](https://www.jetbrains.com/idea/download) and the [Eclipse IDE](https://www.eclipse.org/downloads/) can be used to:
5863
* Develop and submit a Scala Spark application on an HDInsight Spark cluster.
5964
* Access Spark cluster resources.
6065
* Develop and run a Scala Spark application locally.
6166

62-
These articles show how:
67+
These articles show how:
6368
* Intellij IDEA: [Create Apache Spark applications using the Azure Toolkit for Intellij plug-in and the Scala SDK.](spark/apache-spark-intellij-tool-plugin.md)
64-
* Eclipse IDE or Scala IDE for Eclipse: [Create Apache Spark applications and the Azure Toolkit for Eclipse](spark/apache-spark-eclipse-tool-plugin.md)
69+
* Eclipse IDE or Scala IDE for Eclipse: [Create Apache Spark applications and the Azure Toolkit for Eclipse](spark/apache-spark-eclipse-tool-plugin.md)
6570

71+
## Notebooks on Spark for data scientists
6672

67-
## Notebooks on Spark for data scientists
68-
Apache Spark clusters in HDInsight include Apache Zeppelin notebooks and kernels that can be used with Jupyter notebooks.
73+
Apache Spark clusters in HDInsight include Apache Zeppelin notebooks and kernels that can be used with Jupyter notebooks.
6974

7075
* [Learn how to use kernels on Apache Spark clusters with Jupyter notebooks to test Spark applications](spark/apache-spark-zeppelin-notebook.md)
71-
* [Learn how to use Apache Zeppelin notebooks on Apache Spark clusters to run Spark jobs](spark/apache-spark-jupyter-notebook-kernels.md)
76+
* [Learn how to use Apache Zeppelin notebooks on Apache Spark clusters to run Spark jobs](spark/apache-spark-jupyter-notebook-kernels.md)
7277

7378
## Run Linux-based tools and technologies on Windows
7479

75-
If you encounter a situation where you must use a tool or technology that is only available on Linux, consider the following options:
80+
If you come across a situation where you must use a tool or technology that is only available on Linux, consider the following options:
7681

7782
* **Bash on Ubuntu on Windows 10** provides a Linux subsystem on Windows. Bash allows you to directly run Linux utilities without having to maintain a dedicated Linux installation. See [Windows Subsystem for Linux Installation Guide for Windows 10](https://docs.microsoft.com/windows/wsl/install-win10) for installation steps. Other [Unix shells](https://www.gnu.org/software/bash/) will work as well.
7883
* **Docker for Windows** provides access to many Linux-based tools, and can be run directly from Windows. For example, you can use Docker to run the Beeline client for Hive directly from Windows. You can also use Docker to run a local Jupyter notebook and remotely connect to Spark on HDInsight. [Get started with Docker for Windows](https://docs.docker.com/docker-for-windows/)
@@ -83,6 +88,7 @@ If you encounter a situation where you must use a tool or technology that is onl
8388
The Azure command-line interface (CLI) is Microsoft's cross-platform command-line experience for managing Azure resources. For more information, see [Azure Command-Line Interface (CLI)](https://docs.microsoft.com/cli/azure/?view=azure-cli-latest).
8489

8590
## Next steps
91+
8692
If you're new to working in Linux-based clusters, see the follow articles:
8793
* [Set up Apache Hadoop, Apache Kafka, Apache Spark, or other clusters](hdinsight-hadoop-provision-linux-clusters.md)
8894
* [Tips for HDInsight clusters on Linux](hdinsight-hadoop-linux-information.md)

0 commit comments

Comments
 (0)