Skip to content

Commit 92bf55c

Browse files
authored
Merge pull request #50225 from JasonWHowell/patch-2
Fixes MicrosoftDocs/azure-docs#14081
2 parents ce85a1e + 1490ab2 commit 92bf55c

File tree

1 file changed

+16
-19
lines changed

1 file changed

+16
-19
lines changed

articles/hdinsight/hdinsight-hadoop-provision-linux-clusters.md

Lines changed: 16 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -4,14 +4,12 @@ description: Set up Hadoop, Kafka, Spark, HBase, ML Services, or Storm clusters
44
keywords: hadoop cluster setup, kafka cluster setup, spark cluster setup, what is cluster in hadoop
55
services: hdinsight
66
author: jasonwhowell
7+
ms.author: jasonh
78
ms.reviewer: jasonh
8-
99
ms.service: hdinsight
1010
ms.custom: hdinsightactive,hdiseo17may2017
1111
ms.topic: conceptual
12-
ms.date: 05/14/2018
13-
ms.author: jasonh
14-
12+
ms.date: 08/27/2018
1513
---
1614
# Set up clusters in HDInsight with Hadoop, Spark, Kafka, and more
1715

@@ -136,11 +134,24 @@ To increase performance when using Oozie, use a custom metastore. A metastore ca
136134
> [!IMPORTANT]
137135
> You cannot reuse a custom Oozie metastore. To use a custom Oozie metastore, you must provide an empty Azure SQL Database when creating the HDInsight cluster.
138136
137+
138+
## Custom cluster setup
139+
Custom cluster setup builds on the Quick create settings, and adds the following options:
140+
- [HDInsight applications](#install-hdinsight-applications-on-clusters)
141+
- [Cluster size](#configure-cluster-size)
142+
- [Script actions](#advanced-settings-script-actions)
143+
- [Virtual network](#advanced-settings-extend-clusters-with-a-virtual-network)
144+
145+
## Install HDInsight applications on clusters
146+
147+
An HDInsight application is an application that users can install on a Linux-based HDInsight cluster. You can use applications provided by Microsoft, third parties, or that you develop yourself. For more information, see [Install third-party Hadoop applications on Azure HDInsight](hdinsight-apps-install-applications.md).
148+
149+
Most of the HDInsight applications are installed on an empty edge node. An empty edge node is a Linux virtual machine with the same client tools installed and configured as in the head node. You can use the edge node for accessing the cluster, testing your client applications, and hosting your client applications. For more information, see [Use empty edge nodes in HDInsight](hdinsight-apps-use-edge-node.md).
150+
139151
## Configure cluster size
140152

141153
You are billed for node usage for as long as the cluster exists. Billing starts when a cluster is created and stops when the cluster is deleted. Clusters can’t be de-allocated or put on hold.
142154

143-
144155
### Number of nodes for each cluster type
145156
Each cluster type has its own number of nodes, terminology for nodes, and default VM size. In the following table, the number of nodes for each node type is in parentheses.
146157

@@ -188,20 +199,6 @@ To find out what value you should use to specify a VM size while creating a clus
188199
189200
For more information, see [Sizes for virtual machines](../virtual-machines/windows/sizes.md). For information about pricing of the various sizes, see [HDInsight pricing](https://azure.microsoft.com/pricing/details/hdinsight).
190201

191-
## Custom cluster setup
192-
Custom cluster setup builds on the Quick create settings, and adds the following options:
193-
- [HDInsight applications](#hdinsight-applications)
194-
- [Cluster size](#cluster-size)
195-
- Advanced settings
196-
- [Script actions](#customize-clusters-using-script-action)
197-
- [Virtual network](#use-virtual-network)
198-
199-
## Install HDInsight applications on clusters
200-
201-
An HDInsight application is an application that users can install on a Linux-based HDInsight cluster. You can use applications provided by Microsoft, third parties, or that you develop yourself. For more information, see [Install third-party Hadoop applications on Azure HDInsight](hdinsight-apps-install-applications.md).
202-
203-
Most of the HDInsight applications are installed on an empty edge node. An empty edge node is a Linux virtual machine with the same client tools installed and configured as in the head node. You can use the edge node for accessing the cluster, testing your client applications, and hosting your client applications. For more information, see [Use empty edge nodes in HDInsight](hdinsight-apps-use-edge-node.md).
204-
205202
## Advanced settings: Script actions
206203

207204
You can install additional components or customize cluster configuration by using scripts during creation. Such scripts are invoked via **Script Action**, which is a configuration option that can be used from the Azure portal, HDInsight Windows PowerShell cmdlets, or the HDInsight .NET SDK. For more information, see [Customize HDInsight cluster using Script Action](hdinsight-hadoop-customize-cluster-linux.md).

0 commit comments

Comments
 (0)