You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/hdinsight-hadoop-create-linux-clusters-portal.md
+46-43Lines changed: 46 additions & 43 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ ms.reviewer: jasonh
7
7
ms.service: hdinsight
8
8
ms.custom: hdinsightactive
9
9
ms.topic: conceptual
10
-
ms.date: 09/28/2019
10
+
ms.date: 11/13/2019
11
11
---
12
12
13
13
# Create Linux-based clusters in HDInsight by using the Azure portal
@@ -16,10 +16,10 @@ ms.date: 09/28/2019
16
16
17
17
The Azure portal is a web-based management tool for services and resources hosted in the Microsoft Azure cloud. In this article, you learn how to create Linux-based Azure HDInsight clusters by using the portal.
***An Azure subscription**. See [How to get Azure Free trial for testing Hadoop in HDInsight](https://azure.microsoft.com/documentation/videos/get-azure-free-trial-for-testing-hadoop-in-hdinsight/).
24
24
***A modern web browser**. The Azure portal uses HTML5 and JavaScript. It might not function correctly in older web browsers.
25
25
@@ -31,7 +31,7 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
31
31
32
32
1. Sign in to the [Azure portal](https://portal.azure.com).
33
33
34
-
1. From the left menu, navigate to **+ Create a resource** > **Analytics** > **HDInsight**.
34
+
1. From the left menu, navigate to **+ Create a resource** > **Analytics** > **Azure HDInsight**.
35
35
36
36

37
37
@@ -41,51 +41,53 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
41
41
42
42
1. On the **HDInsight** page, select **Custom (size, settings, apps)**.
43
43
44
-
1. Select **1 Basics**. Then enter the following information.
45
-
46
-

47
-
48
-
* Enter the **Cluster Name**. This name must be globally unique.
49
-
50
-
* From the **Subscription** drop-down list, select the Azure subscription that's used for the cluster.
51
-
52
-
* Select **Cluster type**. Then select the type of cluster you want to create. Examples are Hadoop and Apache Spark. The **Operating system** will be **Linux**. Next, select a cluster type version. Use the default version if you don't know what to choose. For more information, see [HDInsight cluster versions](hdinsight-component-versioning.md).
53
-
54
-
> [!IMPORTANT]
55
-
> HDInsight clusters come in a variety of types. They correspond to the workload or technology that the cluster is tuned for. There's no supported method to create a cluster that combines multiple types. Examples are Storm and HBase on one cluster.
56
-
57
-
* For **Cluster login username** and **Cluster login password**, provide the username and password for the admin user.
44
+
1. Select **1 Basics**. Then enter the following information:
58
45
59
-
* Enter an **SSH Username**. If you want the same SSH password as the admin password you specified earlier, select the **Use same password as cluster login** check box. If not, provide either a **PASSWORD** or **PUBLIC KEY** to authenticate the SSH user. A public key is the approach we recommend. Choose **Select** at the bottom to save the credentials configuration.
60
-
61
-
For more information, see [Connect to HDInsight (Apache Hadoop) by using SSH](hdinsight-hadoop-linux-use-ssh-unix.md).
46
+
|Property |Description |
47
+
|---|---|
48
+
|Cluster name|This name must be globally unique.|
49
+
|Subscription|From the drop-down list, select the Azure subscription that's used for the cluster.|
50
+
|Cluster type|Select the type of cluster you want to create. Examples are Hadoop and Apache Spark. The **Operating system** will be **Linux**. Next, select a cluster type version. Use the default version if you don't know what to choose. For more information, see [HDInsight cluster versions](hdinsight-component-versioning.md).|
51
+
|Cluster login username|Provide the username, default is **admin**.|
52
+
|Cluster login password|Provide the password.|
53
+
|Secure Shell (SSH) username|Default is **sshuser**. If you want the same SSH password as the admin password you specified earlier, select the **Use cluster login password for SSH** check box. If not, provide either a **PASSWORD** or **PUBLIC KEY** to authenticate the SSH user. A public key is the approach we recommend. Choose **Select** at the bottom to save the credentials configuration. For more information, see [Connect to HDInsight (Apache Hadoop) by using SSH](hdinsight-hadoop-linux-use-ssh-unix.md).|
54
+
|Resource group|Specify whether you want to create a new resource group or use an existing one.|
55
+
|Location|Specify a datacenter where the cluster is created.|
62
56
63
-
* For **Resource group**, specify whether you want to createa new resource group or use an existing one.
57
+

64
58
65
-
* Specify a datacenter **location** where the cluster is created.
59
+
> [!IMPORTANT]
60
+
> HDInsight clusters come in a variety of types. They correspond to the workload or technology that the cluster is tuned for. There's no supported method to create a cluster that combines multiple types. Examples are Storm and HBase on one cluster.
66
61
67
-
* Select **Next** to move to the next page.
62
+
Select **Next** to move to the next page.
68
63
69
64
1. From **2 Security + networking**, you can connect your cluster to a virtual network by using the provided drop-down menu. Select an Azure virtual network and the subnet if you want to place the cluster into a virtual network. For information on using HDInsight with a virtual network, see [Plan a virtual network deployment for Azure HDInsight clusters](hdinsight-plan-virtual-network-deployment.md). The article includes specific configuration requirements for the virtual network.
70
65
71
66
If you want to use the **Enterprise Security Package**, follow these instructions: [Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services](https://docs.microsoft.com/azure/hdinsight/domain-joined/apache-domain-joined-configure-using-azure-adds).
72
67
73
68
Select **Next** to move to the next page.
74
69
75
-
1. From **3 Storage**, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage. For more information, see the following table.
70
+
1. From **3 Storage**, for **Storage Account Settings**, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage. For more information, see the following table.
71
+
72
+
| Primary Storage type | Description |
73
+
|------------------|-------------|
74
+
| Azure Storage | * For **Selection method**, choose **My subscriptions** if you want to specify a storage account that's part of your Azure subscription. Then select the storage account. Otherwise, select **Access key**. Then provide the information for the storage account that you want to choose from outside your Azure subscription.</br></br> * For **Default container**, choose the default container name suggested by the portal or specify your own.</br></br> * If Azure Blob storage is your default storage, you can also select **Additional Storage Accounts** to specify additional storage accounts to associate with the cluster. For **Azure Storage Keys**, select **Add a storage key**. Then you can provide a storage account from your Azure subscriptions or from other subscriptions. Provide the storage account access key.</br></br> * If Blob storage is your default storage, you can also select **Data Lake Storage access** to specify Azure Data Lake Storage as additional storage. For more information, see [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md).</li></ul> |
75
+
| Azure Data Lake Storage | Select **Azure Data Lake Storage Gen1** or **Azure Data Lake Storage Gen2**. Then refer to the article [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md) for instructions. |
76
+
77
+
**Metastore Settings (optional)**
76
78
77
-

79
+
As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster. For **Select a SQL database for Hive**, select a SQL database. Then provide the username and password for the database. Repeat these steps for Oozie metadata.
| **Azure Storage blobs as the default storage** | <ul><li>For **Primary Storage type**, select **Azure Storage**. For **Selection method**, choose **My subscriptions** if you want to specify a storage account that's part of your Azure subscription. Then select the storage account. Otherwise, select **Access key**. Then provide the information for the storage account that you want to choose from outside your Azure subscription.</li><li>For **Default container**, choose the default container name suggested by the portal or specify your own.</li><li>If Azure Blob storage is your default storage, you can also select **Additional Storage Accounts** to specify additional storage accounts to associate with the cluster. For **Azure Storage Keys**, select **Add a storage key**. Then you can provide a storage account from your Azure subscriptions or from other subscriptions. Provide the storage account access key.</li><li>If Blob storage is your default storage, you can also select **Data Lake Storage access** to specify Azure Data Lake Storage as additional storage. For more information, see [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md).</li></ul> |
82
-
| **Azure Data Lake Storage as the default storage** | For **Primary storage type**, select **Azure Data Lake Storage Gen1** or **Azure Data Lake Storage Gen2**. Then refer to the article [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md) for instructions. |
83
-
| **External metastores** | As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster. For **Select a SQL database for Hive**, select a SQL database. Then provide the username and password for the database. Repeat these steps for Oozie metadata.<br><br>Some considerations about using Azure SQL database for metastores are as follows: <ul><li>The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight. On the right side of the Azure SQL database dashboard, select the server name. This server is the one that the SQL database instance runs on. After you're in server view, select **Configure**. Then for **Azure Services**, select **Yes**. Then select **Save**.</li><li>When you create a metastore, don't name a database with dashes or hyphens. These characters can cause the cluster creation process to fail.</li></ul> |
81
+
Some considerations about using Azure SQL database for metastores are as follows:
82
+
* The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight. On the right side of the Azure SQL database dashboard, select the server name. This server is the one that the SQL database instance runs on. After you're in server view, select **Configure**. Then for **Azure Services**, select **Yes**. Then select **Save**.
83
+
* When you create a metastore, don't name a database with dashes or hyphens. These characters can cause the cluster creation process to fail.
84
84
85
-
> [!WARNING]
86
-
> Using an additional storage account in a different location than the HDInsight cluster isn't supported.
85
+

87
86
88
-
Select **Next** to move to the next page.
87
+
> [!WARNING]
88
+
> Using an additional storage account in a different location than the HDInsight cluster isn't supported.
89
+
90
+
Select **Next** to move to the next page.
89
91
90
92
1. From **4 Applications (optional)**, select any applications that you want. Microsoft, independent software vendors (ISVs), or you can develop these applications. For more information, see [Install applications during cluster creation](hdinsight-apps-install-applications.md#install-applications-during-cluster-creation).
91
93
@@ -108,24 +110,26 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
108
110
109
111
1. From **7 Summary**, verify the information you entered earlier. Then select **Create**.
110
112
111
-

112
-
113
+

114
+
113
115
> [!NOTE]
114
116
> It takes some time for the cluster to be created, usually around 20 minutes. Monitor **Notifications** to check on the provisioning process.
115
117
116
118
1. After the creation process finishes, select **Go to Resource** from the **Deployment succeeded** notification. The cluster window provides the following information.
Some of the icons in the window are explained as follows:
121
123
122
-
* The **Overview** tab provides all the essential information about the cluster. Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.
123
-
***Dashboard** directs you to the Ambari portal associated with the cluster.
124
-
***Secure Shell** provides information needed to access the cluster by using SSH.
125
-
* By using **Scale cluster**, you can increase the number of worker nodes associated with the cluster.
126
-
* **Delete** deletes the HDInsight cluster.
124
+
|Property | Description |
125
+
|---|---|
126
+
|Overview|Provides all the essential information about the cluster. Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.|
127
+
|Cluster dashboards|Directs you to the Ambari portal associated with the cluster.|
128
+
|SSH + Cluster login|Provides information needed to access the cluster by using SSH.|
129
+
|Delete|Deletes the HDInsight cluster.|
127
130
128
131
## Customize clusters
132
+
129
133
*[Customize HDInsight clusters by using Bootstrap](hdinsight-hadoop-customize-cluster-bootstrap.md)
130
134
*[Customize Linux-based HDInsight clusters by using script actions](hdinsight-hadoop-customize-cluster-linux.md)
131
135
@@ -144,7 +148,6 @@ You've successfully created an HDInsight cluster. Now learn how to work with you
144
148
### Apache Hadoop clusters
145
149
146
150
*[Use Apache Hive with HDInsight](hadoop/hdinsight-use-hive.md)
147
-
*[Use Apache Pig with HDInsight](hadoop/hdinsight-use-pig.md)
148
151
*[Use MapReduce with HDInsight](hadoop/hdinsight-use-mapreduce.md)
0 commit comments