Skip to content

Commit 0feb684

Browse files
authored
Merge pull request #95854 from dagiro/freshness48
freshness48
2 parents 57fae20 + 3408076 commit 0feb684

File tree

3 files changed

+46
-43
lines changed

3 files changed

+46
-43
lines changed

articles/hdinsight/hdinsight-hadoop-create-linux-clusters-portal.md

Lines changed: 46 additions & 43 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.reviewer: jasonh
77
ms.service: hdinsight
88
ms.custom: hdinsightactive
99
ms.topic: conceptual
10-
ms.date: 09/28/2019
10+
ms.date: 11/13/2019
1111
---
1212

1313
# Create Linux-based clusters in HDInsight by using the Azure portal
@@ -16,10 +16,10 @@ ms.date: 09/28/2019
1616

1717
The Azure portal is a web-based management tool for services and resources hosted in the Microsoft Azure cloud. In this article, you learn how to create Linux-based Azure HDInsight clusters by using the portal.
1818

19-
## Prerequisites
20-
2119
[!INCLUDE [delete-cluster-warning](../../includes/hdinsight-delete-cluster-warning.md)]
2220

21+
## Prerequisites
22+
2323
* **An Azure subscription**. See [How to get Azure Free trial for testing Hadoop in HDInsight](https://azure.microsoft.com/documentation/videos/get-azure-free-trial-for-testing-hadoop-in-hdinsight/).
2424
* **A modern web browser**. The Azure portal uses HTML5 and JavaScript. It might not function correctly in older web browsers.
2525

@@ -31,7 +31,7 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
3131

3232
1. Sign in to the [Azure portal](https://portal.azure.com).
3333

34-
1. From the left menu, navigate to **+ Create a resource** > **Analytics** > **HDInsight**.
34+
1. From the left menu, navigate to **+ Create a resource** > **Analytics** > **Azure HDInsight**.
3535

3636
![Create a new cluster in the Azure portal](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster.png "Creating a new cluster in the Azure portal")
3737

@@ -41,51 +41,53 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
4141

4242
1. On the **HDInsight** page, select **Custom (size, settings, apps)**.
4343

44-
1. Select **1 Basics**. Then enter the following information.
45-
46-
![HDInsight create cluster basics](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-basics.png "Creating a new cluster in the Azure portal")
47-
48-
* Enter the **Cluster Name**. This name must be globally unique.
49-
50-
* From the **Subscription** drop-down list, select the Azure subscription that's used for the cluster.
51-
52-
* Select **Cluster type**. Then select the type of cluster you want to create. Examples are Hadoop and Apache Spark. The **Operating system** will be **Linux**. Next, select a cluster type version. Use the default version if you don't know what to choose. For more information, see [HDInsight cluster versions](hdinsight-component-versioning.md).
53-
54-
> [!IMPORTANT]
55-
> HDInsight clusters come in a variety of types. They correspond to the workload or technology that the cluster is tuned for. There's no supported method to create a cluster that combines multiple types. Examples are Storm and HBase on one cluster.
56-
57-
* For **Cluster login username** and **Cluster login password**, provide the username and password for the admin user.
44+
1. Select **1 Basics**. Then enter the following information:
5845

59-
* Enter an **SSH Username**. If you want the same SSH password as the admin password you specified earlier, select the **Use same password as cluster login** check box. If not, provide either a **PASSWORD** or **PUBLIC KEY** to authenticate the SSH user. A public key is the approach we recommend. Choose **Select** at the bottom to save the credentials configuration.
60-
61-
For more information, see [Connect to HDInsight (Apache Hadoop) by using SSH](hdinsight-hadoop-linux-use-ssh-unix.md).
46+
|Property |Description |
47+
|---|---|
48+
|Cluster name|This name must be globally unique.|
49+
|Subscription|From the drop-down list, select the Azure subscription that's used for the cluster.|
50+
|Cluster type|Select the type of cluster you want to create. Examples are Hadoop and Apache Spark. The **Operating system** will be **Linux**. Next, select a cluster type version. Use the default version if you don't know what to choose. For more information, see [HDInsight cluster versions](hdinsight-component-versioning.md).|
51+
|Cluster login username|Provide the username, default is **admin**.|
52+
|Cluster login password|Provide the password.|
53+
|Secure Shell (SSH) username|Default is **sshuser**. If you want the same SSH password as the admin password you specified earlier, select the **Use cluster login password for SSH** check box. If not, provide either a **PASSWORD** or **PUBLIC KEY** to authenticate the SSH user. A public key is the approach we recommend. Choose **Select** at the bottom to save the credentials configuration. For more information, see [Connect to HDInsight (Apache Hadoop) by using SSH](hdinsight-hadoop-linux-use-ssh-unix.md).|
54+
|Resource group|Specify whether you want to create a new resource group or use an existing one.|
55+
|Location|Specify a datacenter where the cluster is created.|
6256

63-
* For **Resource group**, specify whether you want to create a new resource group or use an existing one.
57+
![HDInsight create cluster basics](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-basics.png "Creating a new cluster in the Azure portal")
6458

65-
* Specify a datacenter **location** where the cluster is created.
59+
> [!IMPORTANT]
60+
> HDInsight clusters come in a variety of types. They correspond to the workload or technology that the cluster is tuned for. There's no supported method to create a cluster that combines multiple types. Examples are Storm and HBase on one cluster.
6661

67-
* Select **Next** to move to the next page.
62+
Select **Next** to move to the next page.
6863

6964
1. From **2 Security + networking**, you can connect your cluster to a virtual network by using the provided drop-down menu. Select an Azure virtual network and the subnet if you want to place the cluster into a virtual network. For information on using HDInsight with a virtual network, see [Plan a virtual network deployment for Azure HDInsight clusters](hdinsight-plan-virtual-network-deployment.md). The article includes specific configuration requirements for the virtual network.
7065

7166
If you want to use the **Enterprise Security Package**, follow these instructions: [Configure a HDInsight cluster with Enterprise Security Package by using Azure Active Directory Domain Services](https://docs.microsoft.com/azure/hdinsight/domain-joined/apache-domain-joined-configure-using-azure-adds).
7267

7368
Select **Next** to move to the next page.
7469

75-
1. From **3 Storage**, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage. For more information, see the following table.
70+
1. From **3 Storage**, for **Storage Account Settings**, specify whether you want Azure Storage or Azure Data Lake Storage as your default storage. For more information, see the following table.
71+
72+
| Primary Storage type | Description |
73+
|------------------|-------------|
74+
| Azure Storage | * For **Selection method**, choose **My subscriptions** if you want to specify a storage account that's part of your Azure subscription. Then select the storage account. Otherwise, select **Access key**. Then provide the information for the storage account that you want to choose from outside your Azure subscription.</br></br> * For **Default container**, choose the default container name suggested by the portal or specify your own.</br></br> * If Azure Blob storage is your default storage, you can also select **Additional Storage Accounts** to specify additional storage accounts to associate with the cluster. For **Azure Storage Keys**, select **Add a storage key**. Then you can provide a storage account from your Azure subscriptions or from other subscriptions. Provide the storage account access key.</br></br> * If Blob storage is your default storage, you can also select **Data Lake Storage access** to specify Azure Data Lake Storage as additional storage. For more information, see [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md).</li></ul> |
75+
| Azure Data Lake Storage | Select **Azure Data Lake Storage Gen1** or **Azure Data Lake Storage Gen2**. Then refer to the article [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md) for instructions. |
76+
77+
**Metastore Settings (optional)**
7678

77-
![HDInsight create cluster storage](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-storage.png "Creating a new cluster in the Azure portal")
79+
As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster. For **Select a SQL database for Hive**, select a SQL database. Then provide the username and password for the database. Repeat these steps for Oozie metadata.
7880

79-
| Storage | Description |
80-
|----------------------------------------------|-------------|
81-
| **Azure Storage blobs as the default storage** | <ul><li>For **Primary Storage type**, select **Azure Storage**. For **Selection method**, choose **My subscriptions** if you want to specify a storage account that's part of your Azure subscription. Then select the storage account. Otherwise, select **Access key**. Then provide the information for the storage account that you want to choose from outside your Azure subscription.</li><li>For **Default container**, choose the default container name suggested by the portal or specify your own.</li><li>If Azure Blob storage is your default storage, you can also select **Additional Storage Accounts** to specify additional storage accounts to associate with the cluster. For **Azure Storage Keys**, select **Add a storage key**. Then you can provide a storage account from your Azure subscriptions or from other subscriptions. Provide the storage account access key.</li><li>If Blob storage is your default storage, you can also select **Data Lake Storage access** to specify Azure Data Lake Storage as additional storage. For more information, see [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md).</li></ul> |
82-
| **Azure Data Lake Storage as the default storage** | For **Primary storage type**, select **Azure Data Lake Storage Gen1** or **Azure Data Lake Storage Gen2**. Then refer to the article [Quickstart: Set up clusters in HDInsight](../storage/data-lake-storage/quickstart-create-connect-hdi-cluster.md) for instructions. |
83-
| **External metastores** | As an option, specify a SQL database to save Apache Hive and Apache Oozie metadata associated with the cluster. For **Select a SQL database for Hive**, select a SQL database. Then provide the username and password for the database. Repeat these steps for Oozie metadata.<br><br>Some considerations about using Azure SQL database for metastores are as follows: <ul><li>The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight. On the right side of the Azure SQL database dashboard, select the server name. This server is the one that the SQL database instance runs on. After you're in server view, select **Configure**. Then for **Azure Services**, select **Yes**. Then select **Save**.</li><li>When you create a metastore, don't name a database with dashes or hyphens. These characters can cause the cluster creation process to fail.</li></ul> |
81+
Some considerations about using Azure SQL database for metastores are as follows:
82+
* The Azure SQL database that's used for the metastore must allow connectivity to other Azure services, including Azure HDInsight. On the right side of the Azure SQL database dashboard, select the server name. This server is the one that the SQL database instance runs on. After you're in server view, select **Configure**. Then for **Azure Services**, select **Yes**. Then select **Save**.
83+
* When you create a metastore, don't name a database with dashes or hyphens. These characters can cause the cluster creation process to fail.
8484

85-
> [!WARNING]
86-
> Using an additional storage account in a different location than the HDInsight cluster isn't supported.
85+
![HDInsight create cluster storage](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-storage.png "Creating a new cluster in the Azure portal")
8786

88-
Select **Next** to move to the next page.
87+
> [!WARNING]
88+
> Using an additional storage account in a different location than the HDInsight cluster isn't supported.
89+
90+
Select **Next** to move to the next page.
8991

9092
1. From **4 Applications (optional)**, select any applications that you want. Microsoft, independent software vendors (ISVs), or you can develop these applications. For more information, see [Install applications during cluster creation](hdinsight-apps-install-applications.md#install-applications-during-cluster-creation).
9193

@@ -108,24 +110,26 @@ The Azure portal exposes most of the cluster properties. By using Azure Resource
108110

109111
1. From **7 Summary**, verify the information you entered earlier. Then select **Create**.
110112

111-
![HDInsight create cluster summary](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-summary.png "Specify number of cluster nodes")
112-
113+
![HDInsight create cluster summary](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-summary.png "Specify number of cluster nodes")
114+
113115
> [!NOTE]
114116
> It takes some time for the cluster to be created, usually around 20 minutes. Monitor **Notifications** to check on the provisioning process.
115117
116118
1. After the creation process finishes, select **Go to Resource** from the **Deployment succeeded** notification. The cluster window provides the following information.
117119

118120
![HDI Azure portal cluster overview](./media/hdinsight-hadoop-create-linux-clusters-portal/hdinsight-create-cluster-completed.png "Cluster properties")
119121

120-
The icons in the window are explained as follows:
122+
Some of the icons in the window are explained as follows:
121123

122-
* The **Overview** tab provides all the essential information about the cluster. Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.
123-
* **Dashboard** directs you to the Ambari portal associated with the cluster.
124-
* **Secure Shell** provides information needed to access the cluster by using SSH.
125-
* By using **Scale cluster**, you can increase the number of worker nodes associated with the cluster.
126-
* **Delete** deletes the HDInsight cluster.
124+
|Property | Description |
125+
|---|---|
126+
|Overview|Provides all the essential information about the cluster. Examples are the name, the resource group it belongs to, the location, the operating system, and the URL for the cluster dashboard.|
127+
|Cluster dashboards|Directs you to the Ambari portal associated with the cluster.|
128+
|SSH + Cluster login|Provides information needed to access the cluster by using SSH.|
129+
|Delete|Deletes the HDInsight cluster.|
127130

128131
## Customize clusters
132+
129133
* [Customize HDInsight clusters by using Bootstrap](hdinsight-hadoop-customize-cluster-bootstrap.md)
130134
* [Customize Linux-based HDInsight clusters by using script actions](hdinsight-hadoop-customize-cluster-linux.md)
131135

@@ -144,7 +148,6 @@ You've successfully created an HDInsight cluster. Now learn how to work with you
144148
### Apache Hadoop clusters
145149

146150
* [Use Apache Hive with HDInsight](hadoop/hdinsight-use-hive.md)
147-
* [Use Apache Pig with HDInsight](hadoop/hdinsight-use-pig.md)
148151
* [Use MapReduce with HDInsight](hadoop/hdinsight-use-mapreduce.md)
149152

150153
### Apache HBase clusters
-2.44 KB
Loading
15 Bytes
Loading

0 commit comments

Comments
 (0)