Skip to content

Commit 0882cd1

Browse files
committed
cats126
1 parent 97bbebb commit 0882cd1

File tree

3 files changed

+28
-32
lines changed

3 files changed

+28
-32
lines changed

articles/hdinsight/hdinsight-hadoop-create-linux-clusters-with-secure-transfer-storage.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,11 +9,13 @@ ms.custom: hdinsightactive
99
ms.topic: conceptual
1010
ms.date: 07/24/2018
1111
---
12+
1213
# Create Apache Hadoop cluster with secure transfer storage accounts in Azure HDInsight
1314

1415
The [Secure transfer required](../storage/common/storage-require-secure-transfer.md) feature enhances the security of your Azure Storage account by enforcing all requests to your account through a secure connection. This feature and the wasbs scheme are only supported by HDInsight cluster version 3.6 or newer.
1516

1617
## Prerequisites
18+
1719
Before you begin this article, you must have:
1820

1921
* **Azure subscription**: To create a free one-month trial account, browse to [azure.microsoft.com/free](https://azure.microsoft.com/free).
@@ -24,12 +26,11 @@ Before you begin this article, you must have:
2426

2527
[!INCLUDE [delete-cluster-warning](../../includes/hdinsight-delete-cluster-warning.md)]
2628

27-
2829
In this section, you create a Hadoop cluster in HDInsight using an [Azure Resource Manager template](../azure-resource-manager/resource-group-template-deploy.md). The template is located in [GitHub](https://azure.microsoft.com/resources/templates/101-hdinsight-linux-with-existing-default-storage-account/). Resource Manager template experience is not required for following this article. For other cluster creation methods and understanding the properties used in this article, see [Create HDInsight clusters](hdinsight-hadoop-provision-linux-clusters.md).
2930

3031
1. Click the following image to sign in to Azure and open the Resource Manager template in the Azure portal.
3132

32-
<a href="https://portal.azure.com/#create/Microsoft.Template/uri/https%3A%2F%2Fraw.githubusercontent.com%2FAzure%2Fazure-quickstart-templates%2Fmaster%2F101-hdinsight-linux-with-existing-default-storage-account%2Fazuredeploy.json" target="_blank"><img src="./media/hdinsight-hadoop-create-linux-clusters-with-secure-transfer-storage/hdi-deploy-to-azure1.png" alt="Deploy to Azure"></a>
33+
<a href="https://portal.azure.com/#create/Microsoft.Template/uri/https%3A%2F%2Fraw.githubusercontent.com%2FAzure%2Fazure-quickstart-templates%2Fmaster%2F101-hdinsight-linux-with-existing-default-storage-account%2Fazuredeploy.json" target="_blank"><img src="./media/hdinsight-hadoop-create-linux-clusters-with-secure-transfer-storage/hdi-deploy-to-azure1.png" alt="Deploy to Azure button for new cluster"></a>
3334

3435
2. Follow the instructions to create the cluster with the following specifications:
3536

articles/hdinsight/hdinsight-hadoop-customize-cluster-bootstrap.md

Lines changed: 3 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@ ms.date: 04/19/2019
1212

1313
# Customize HDInsight clusters using Bootstrap
1414

15-
Bootstrap scripts allow you to install and configure components in Azure HDInsight programmatically.
15+
Bootstrap scripts allow you to install and configure components in Azure HDInsight programmatically.
1616

1717
There are three approaches to set configuration file settings as your HDInsight cluster is created:
1818

@@ -52,7 +52,6 @@ The following PowerShell code customizes an [Apache Hive](https://hive.apache.or
5252
> [!IMPORTANT]
5353
> The parameter `Spark2Defaults` may need to be used with [Add-AzHDInsightConfigValue](https://docs.microsoft.com/powershell/module/az.hdinsight/add-azhdinsightconfigvalue). You can pass empty values to the parameter as shown in the code example below.
5454
55-
5655
```powershell
5756
# hive-site.xml configuration
5857
$hiveConfigValues = @{ "hive.metastore.client.socket.timeout"="90" }
@@ -125,9 +124,10 @@ You can use bootstrap in Resource Manager template:
125124
}
126125
```
127126

128-
![HDInsight Hadoop customizes cluster bootstrap Azure Resource Manager template](./media/hdinsight-hadoop-customize-cluster-bootstrap/hdinsight-customize-cluster-bootstrap-arm.png)
127+
![Hadoop customizes cluster bootstrap Azure Resource Manager template](./media/hdinsight-hadoop-customize-cluster-bootstrap/hdinsight-customize-cluster-bootstrap-arm.png)
129128

130129
## See also
130+
131131
* [Create Apache Hadoop clusters in HDInsight][hdinsight-provision-cluster] provides instructions on how to create an HDInsight cluster by using other custom options.
132132
* [Develop Script Action scripts for HDInsight][hdinsight-write-script]
133133
* [Install and use Apache Spark on HDInsight clusters][hdinsight-install-spark]
@@ -137,8 +137,6 @@ You can use bootstrap in Resource Manager template:
137137
[hdinsight-write-script]: hdinsight-hadoop-script-actions-linux.md
138138
[hdinsight-provision-cluster]: hdinsight-hadoop-provision-linux-clusters.md
139139
[powershell-install-configure]: /powershell/azureps-cmdlets-docs
140-
141-
142140
[img-hdi-cluster-states]: ./media/hdinsight-hadoop-customize-cluster/HDI-Cluster-state.png "Stages during cluster creation"
143141

144142
## Appendix: PowerShell sample
@@ -148,7 +146,6 @@ This PowerShell script creates an HDInsight cluster and customizes a Hive settin
148146
> [!WARNING]
149147
> Storage account kind `BlobStorage` cannot be used for HDInsight clusters.
150148
151-
152149
```powershell
153150
####################################
154151
# Set these variables

articles/hdinsight/hdinsight-hadoop-customize-cluster-linux.md

Lines changed: 22 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -40,12 +40,12 @@ Get more information on working with access management:
4040
A script action is Bash script that runs on the nodes in an HDInsight cluster. Characteristics and features of script actions are as follows:
4141

4242
* Must be stored on a URI that's accessible from the HDInsight cluster. The following are possible storage locations:
43-
43+
4444
* For regular clusters:
45-
45+
4646
* ADLS Gen1: The service principal HDInsight uses to access Data Lake Storage must have read access to the script. The URI format for scripts stored in Data Lake Storage Gen1 is
4747
`adl://DATALAKESTOREACCOUNTNAME.azuredatalakestore.net/path_to_file`.
48-
48+
4949
* A blob in an Azure Storage account that's either the primary or additional storage account for the HDInsight cluster. HDInsight is granted access to both of these types of storage accounts during cluster creation.
5050

5151
> [!IMPORTANT]
@@ -56,9 +56,9 @@ A script action is Bash script that runs on the nodes in an HDInsight cluster. C
5656
For example URIs, see [Example script action scripts](#example-script-action-scripts).
5757

5858
* For clusters with ESP:
59-
59+
6060
* The wasb:// or wasbs:// or http[s]:// URIs are supported.
61-
61+
6262
* Can be restricted to run on only certain node types. Examples are head nodes or worker nodes.
6363

6464
* Can be persisted or ad hoc.
@@ -103,15 +103,13 @@ The script runs while HDInsight is being configured. The script runs in parallel
103103
> [!NOTE]
104104
> You can perform operations like stopping and starting services, including Apache Hadoop-related services. If you stop services, make sure that the Ambari service and other Hadoop-related services are running before the script finishes. These services are required to successfully determine the health and state of the cluster while it's being created.
105105
106-
107106
During cluster creation, you can use many script actions at once. These scripts are invoked in the order in which they were specified.
108107

109108
> [!IMPORTANT]
110109
> Script actions must finish within 60 minutes, or they time out. During cluster provisioning, the script runs concurrently with other setup and configuration processes. Competition for resources such as CPU time or network bandwidth might cause the script to take longer to finish than it does in your development environment.
111110
>
112111
> To minimize the time it takes to run the script, avoid tasks like downloading and compiling applications from the source. Precompile applications and store the binary in Azure Storage.
113112
114-
115113
### Script action on a running cluster
116114

117115
A failure in a script run on an already running cluster doesn't automatically cause the cluster to change to a failed state. After a script finishes, the cluster should return to a running state.
@@ -157,13 +155,13 @@ This section explains the different ways you can use script actions when you cre
157155

158156
1. Start to create a cluster as described in [Set up clusters in HDInsight with Apache Hadoop, Apache Spark, Apache Kafka, and more](hdinsight-hadoop-provision-linux-clusters.md). During cluster creation, you arrive at a __Cluster summary__ page. From the __Cluster summary__ page, select the __edit__ link for __Advanced settings__.
159157

160-
![Advanced settings link](./media/hdinsight-hadoop-customize-cluster-linux/advanced-settings-link.png)
158+
![Azure portal cluster advanced settings](./media/hdinsight-hadoop-customize-cluster-linux/advanced-settings-link.png)
161159

162-
3. From the __Advanced settings__ section, select __Script actions__. From the __Script actions__ section, select __+ Submit new__.
160+
1. From the __Advanced settings__ section, select __Script actions__. From the __Script actions__ section, select __+ Submit new__.
163161

164-
![Submit a new script action](./media/hdinsight-hadoop-customize-cluster-linux/add-new-script-action.png)
162+
![Portal script actions submit new](./media/hdinsight-hadoop-customize-cluster-linux/add-new-script-action.png)
165163

166-
4. Use the __Select a script__ entry to select a premade script. To use a custom script, select __Custom__. Then provide the __Name__ and __Bash script URI__ for your script.
164+
1. Use the __Select a script__ entry to select a premade script. To use a custom script, select __Custom__. Then provide the __Name__ and __Bash script URI__ for your script.
167165

168166
![Add a script in the select script form](./media/hdinsight-hadoop-customize-cluster-linux/hdinsight-select-script.png)
169167

@@ -179,13 +177,13 @@ This section explains the different ways you can use script actions when you cre
179177

180178
Use the __Persist this script action__ entry to make sure that the script is applied during scaling operations.
181179

182-
5. Select __Create__ to save the script. Then you can use __+ Submit new__ to add another script.
180+
1. Select __Create__ to save the script. Then you can use __+ Submit new__ to add another script.
183181

184-
![Multiple script actions](./media/hdinsight-hadoop-customize-cluster-linux/multiple-scripts-actions.png)
182+
![HDInsight multiple script actions](./media/hdinsight-hadoop-customize-cluster-linux/multiple-scripts-actions.png)
185183

186184
When you're done adding scripts, select the __Select__ button and then the __Next__ button to return to the __Cluster summary__ section.
187185

188-
3. To create the cluster, select __Create__ from the __Cluster summary__ selection.
186+
1. To create the cluster, select __Create__ from the __Cluster summary__ selection.
189187

190188
### Use a script action from Azure Resource Manager templates
191189

@@ -245,7 +243,7 @@ Go to the [Azure portal](https://portal.azure.com):
245243

246244
![Add a script to a running cluster](./media/hdinsight-hadoop-customize-cluster-linux/add-script-running-cluster.png)
247245

248-
4. Use the __Select a script__ entry to select a premade script. To use a custom script, select __Custom__. Then provide the __Name__ and __Bash script URI__ for your script.
246+
1. Use the __Select a script__ entry to select a premade script. To use a custom script, select __Custom__. Then provide the __Name__ and __Bash script URI__ for your script.
249247

250248
![Add a script in the select script form](./media/hdinsight-hadoop-customize-cluster-linux/hdinsight-select-script.png)
251249

@@ -261,7 +259,7 @@ Go to the [Azure portal](https://portal.azure.com):
261259

262260
Use the __Persist this script action__ entry to make sure the script is applied during scaling operations.
263261

264-
5. Finally, select the **Create** button to apply the script to the cluster.
262+
1. Finally, select the **Create** button to apply the script to the cluster.
265263

266264
### Apply a script action to a running cluster from Azure PowerShell
267265

@@ -342,17 +340,17 @@ For an example of using the .NET SDK to apply scripts to a cluster, see [Apply a
342340
343341
1. From the default view, under **Settings**, select **Script actions**.
344342
345-
4. A history of scripts for this cluster displays on the script actions section. This information includes a list of persisted scripts. The following screenshot shows that the Solr script has been run on this cluster. The screenshot doesn't show any persisted scripts.
343+
1. A history of scripts for this cluster displays on the script actions section. This information includes a list of persisted scripts. The following screenshot shows that the Solr script has been run on this cluster. The screenshot doesn't show any persisted scripts.
346344

347-
![Script actions](./media/hdinsight-hadoop-customize-cluster-linux/script-action-history.png)
345+
![Portal script actions submit history](./media/hdinsight-hadoop-customize-cluster-linux/script-action-history.png)
348346

349-
5. Select a script from the history to display the **Properties** section for this script. From the top of the screen, you can rerun the script or promote it.
347+
1. Select a script from the history to display the **Properties** section for this script. From the top of the screen, you can rerun the script or promote it.
350348

351-
![Script actions, Properties](./media/hdinsight-hadoop-customize-cluster-linux/promote-script-actions.png)
349+
![Script actions properties promote](./media/hdinsight-hadoop-customize-cluster-linux/promote-script-actions.png)
352350

353-
6. You can also select the ellipsis, **...**, to the right of entries on the script actions section to perform actions.
351+
1. You can also select the ellipsis, **...**, to the right of entries on the script actions section to perform actions.
354352

355-
![Script actions, ellipsis](./media/hdinsight-hadoop-customize-cluster-linux/hdi-delete-promoted-sa.png)
353+
![Persisted script actions delete](./media/hdinsight-hadoop-customize-cluster-linux/hdi-delete-promoted-sa.png)
356354

357355
### Azure PowerShell
358356

@@ -411,7 +409,7 @@ Two types of open-source components are available in the HDInsight service:
411409
> [!WARNING]
412410
> Components provided with the HDInsight cluster are fully supported. Microsoft Support helps to isolate and resolve issues related to these components.
413411
>
414-
> Custom components receive commercially reasonable support to help you further troubleshoot the issue. Microsoft Support might be able to resolve the issue. Or they might ask you to engage available channels for the open-source technologies where deep expertise for that technology is found. Many community sites can be used. Examples are [MSDN forum for HDInsight](https://social.msdn.microsoft.com/Forums/azure/home?forum=hdinsight) and [Stack Overflow](https://stackoverflow.com).
412+
> Custom components receive commercially reasonable support to help you further troubleshoot the issue. Microsoft Support might be able to resolve the issue. Or they might ask you to engage available channels for the open-source technologies where deep expertise for that technology is found. Many community sites can be used. Examples are [MSDN forum for HDInsight](https://social.msdn.microsoft.com/Forums/azure/home?forum=hdinsight) and [Stack Overflow](https://stackoverflow.com).
415413
>
416414
> Apache projects also have project sites on the [Apache website](https://apache.org). An example is [Hadoop](https://hadoop.apache.org/).
417415

@@ -439,7 +437,7 @@ You can use the Ambari web UI to view information logged by script actions. If t
439437

440438
3. Find the entries that have **run\_customscriptaction** in the **Operations** column. These entries are created when the script actions run.
441439

442-
![Screenshot of operations](./media/hdinsight-hadoop-customize-cluster-linux/ambari-script-action.png)
440+
![Apache Ambari script action operations](./media/hdinsight-hadoop-customize-cluster-linux/ambari-script-action.png)
443441

444442
To view the **STDOUT** and **STDERR** output, select the **run\customscriptaction** entry and drill down through the links. This output is generated when the script runs and might have useful information.
445443

0 commit comments

Comments
 (0)