You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/hdinsight-hadoop-provision-linux-clusters.md
+16-19Lines changed: 16 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,14 +4,12 @@ description: Set up Hadoop, Kafka, Spark, HBase, ML Services, or Storm clusters
4
4
keywords: hadoop cluster setup, kafka cluster setup, spark cluster setup, what is cluster in hadoop
5
5
services: hdinsight
6
6
author: jasonwhowell
7
+
ms.author: jasonh
7
8
ms.reviewer: jasonh
8
-
9
9
ms.service: hdinsight
10
10
ms.custom: hdinsightactive,hdiseo17may2017
11
11
ms.topic: conceptual
12
-
ms.date: 05/14/2018
13
-
ms.author: jasonh
14
-
12
+
ms.date: 08/27/2018
15
13
---
16
14
# Set up clusters in HDInsight with Hadoop, Spark, Kafka, and more
17
15
@@ -136,11 +134,24 @@ To increase performance when using Oozie, use a custom metastore. A metastore ca
136
134
> [!IMPORTANT]
137
135
> You cannot reuse a custom Oozie metastore. To use a custom Oozie metastore, you must provide an empty Azure SQL Database when creating the HDInsight cluster.
138
136
137
+
138
+
## Custom cluster setup
139
+
Custom cluster setup builds on the Quick create settings, and adds the following options:
An HDInsight application is an application that users can install on a Linux-based HDInsight cluster. You can use applications provided by Microsoft, third parties, or that you develop yourself. For more information, see [Install third-party Hadoop applications on Azure HDInsight](hdinsight-apps-install-applications.md).
148
+
149
+
Most of the HDInsight applications are installed on an empty edge node. An empty edge node is a Linux virtual machine with the same client tools installed and configured as in the head node. You can use the edge node for accessing the cluster, testing your client applications, and hosting your client applications. For more information, see [Use empty edge nodes in HDInsight](hdinsight-apps-use-edge-node.md).
150
+
139
151
## Configure cluster size
140
152
141
153
You are billed for node usage for as long as the cluster exists. Billing starts when a cluster is created and stops when the cluster is deleted. Clusters can’t be de-allocated or put on hold.
142
154
143
-
144
155
### Number of nodes for each cluster type
145
156
Each cluster type has its own number of nodes, terminology for nodes, and default VM size. In the following table, the number of nodes for each node type is in parentheses.
146
157
@@ -188,20 +199,6 @@ To find out what value you should use to specify a VM size while creating a clus
188
199
189
200
For more information, see [Sizes for virtual machines](../virtual-machines/windows/sizes.md). For information about pricing of the various sizes, see [HDInsight pricing](https://azure.microsoft.com/pricing/details/hdinsight).
190
201
191
-
## Custom cluster setup
192
-
Custom cluster setup builds on the Quick create settings, and adds the following options:
An HDInsight application is an application that users can install on a Linux-based HDInsight cluster. You can use applications provided by Microsoft, third parties, or that you develop yourself. For more information, see [Install third-party Hadoop applications on Azure HDInsight](hdinsight-apps-install-applications.md).
202
-
203
-
Most of the HDInsight applications are installed on an empty edge node. An empty edge node is a Linux virtual machine with the same client tools installed and configured as in the head node. You can use the edge node for accessing the cluster, testing your client applications, and hosting your client applications. For more information, see [Use empty edge nodes in HDInsight](hdinsight-apps-use-edge-node.md).
204
-
205
202
## Advanced settings: Script actions
206
203
207
204
You can install additional components or customize cluster configuration by using scripts during creation. Such scripts are invoked via **Script Action**, which is a configuration option that can be used from the Azure portal, HDInsight Windows PowerShell cmdlets, or the HDInsight .NET SDK. For more information, see [Customize HDInsight cluster using Script Action](hdinsight-hadoop-customize-cluster-linux.md).
0 commit comments