Skip to content

Commit 1e6ef33

Browse files
authored
Merge pull request #285963 from sreekzz/load-balancer
Updated Load balancer Pages
2 parents b1ffa71 + 550fb5f commit 1e6ef33

File tree

7 files changed

+52
-42
lines changed

7 files changed

+52
-42
lines changed

articles/hdinsight/hdinsight-faq.yml

Lines changed: 22 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -18,14 +18,14 @@ sections:
1818
- name: Creating or deleting HDInsight clusters
1919
questions:
2020
- question: |
21-
How do I provision an HDInsight cluster?
21+
How do I provision a HDInsight cluster?
2222
answer: |
2323
To review the HDInsight clusters types, and the provisioning methods, see [Set up clusters in HDInsight with Apache Hadoop, Apache Spark, Apache Kafka, and more](./hdinsight-hadoop-provision-linux-clusters.md).
2424
2525
- question: |
2626
How do I delete an existing HDInsight cluster?
2727
answer: |
28-
To learn more about deleting a cluster when it's no longer in use, see [Delete an HDInsight cluster](hdinsight-delete-cluster.md).
28+
To learn more about deleting a cluster when it's no longer in use, see [Delete a HDInsight cluster](hdinsight-delete-cluster.md).
2929
3030
Try to leave at least 30 to 60 minutes between create and delete operations. Otherwise the operation may fail with the following error message:
3131
@@ -39,9 +39,9 @@ sections:
3939
For more information, see [Capacity planning for HDInsight clusters](./hdinsight-capacity-planning.md).
4040
4141
- question: |
42-
What are the various types of nodes in an HDInsight cluster?
42+
What are the various types of nodes in a HDInsight cluster?
4343
answer: |
44-
See [Resource types in Azure HDInsight clusters](hdinsight-virtual-network-architecture.md#resource-types-in-azure-hdinsight-clusters).
44+
See [Resource types in Azure HDInsight clusters](hdinsight-virtual-network-architecture.md#resource-types-in-azure-hdinsight-cluster).
4545
4646
- question: |
4747
What are the best practices for creating large HDInsight clusters?
@@ -55,20 +55,20 @@ sections:
5555
- name: Individual Components
5656
questions:
5757
- question: |
58-
Can I install additional components on my cluster?
58+
Can I install more components on my cluster?
5959
answer: |
60-
Yes. To install additional components or customize cluster configuration, use:
60+
Yes. To install more components or customize cluster configuration, use:
6161
6262
- Scripts during or after creation. Scripts are invoked via [script action](./hdinsight-hadoop-customize-cluster-linux.md). Script action is a configuration option you can use from the Azure portal, HDInsight Windows PowerShell cmdlets, or the HDInsight .NET SDK. This configuration option can be used from the Azure portal, HDInsight Windows PowerShell cmdlets, or the HDInsight .NET SDK.
6363
6464
- [HDInsight Application Platform](https://azure.microsoft.com/services/hdinsight/partner-ecosystem/) to install applications.
6565
66-
For a list of supported components see [What are the Apache Hadoop components and versions available with HDInsight?](./hdinsight-component-versioning.md)
66+
For a list of supported components, see [What are the Apache Hadoop components and versions available with HDInsight?](./hdinsight-component-versioning.md)
6767
6868
- question: |
69-
Can I upgrade the individual components that are pre-installed on the cluster?
69+
Can I upgrade the individual components that are preinstalled on the cluster?
7070
answer: |
71-
If you upgrade built-in components or applications that are pre-installed on your cluster, the resulting configuration won't be supported by Microsoft. These system configurations have not been tested by Microsoft. Try to use a different version of the HDInsight cluster that may already have the upgraded version of the component pre-installed.
71+
If you upgrade built-in components or applications that are preinstalled on your cluster, the resulting configuration won't be supported by Microsoft. These system configurations haven't been tested by Microsoft. Try to use a different version of the HDInsight cluster that may already have the upgraded version of the component preinstalled.
7272
7373
For example, upgrading Hive as an individual component isn't supported. HDInsight is a managed service, and many services are integrated with Ambari server and tested. Upgrading a Hive on its own causes the indexed binaries of other components to change, and will cause component integration issues on your cluster.
7474
@@ -85,7 +85,7 @@ sections:
8585
8686
:::image type="content" source="media/hdinsight-faq/ambari-settings.png" alt-text="Ambari Settings.":::
8787
88-
3. In the User Settings window, select the new timezone from the Timezone drop down, and then click Save.
88+
3. In the User Settings window, select the new timezone from the Timezone drop down, and then select Save.
8989
9090
:::image type="content" source="media/hdinsight-faq/ambari-user-settings.png" alt-text="Ambari User Settings.":::
9191
@@ -110,7 +110,7 @@ sections:
110110
- question: |
111111
Does migrating a Hive metastore also migrate the default policies of the Ranger database?
112112
answer: |
113-
No, the policy definition is in the Ranger database, so migrating the Ranger database will migrate its policy.
113+
No, the policy definition is in the Ranger database, so migrating the Ranger database migrates its policy.
114114
115115
- question: |
116116
Can you migrate a Hive metastore from an Enterprise Security Package (ESP) cluster to a non-ESP cluster, and the other way around?
@@ -148,9 +148,9 @@ sections:
148148
- [HDInsight management IP addresses](./hdinsight-management-ip-addresses.md)
149149
150150
- question: |
151-
Can I deploy an additional virtual machine within the same subnet as an HDInsight cluster?
151+
Can I deploy more virtual machine within the same subnet as a HDInsight cluster?
152152
answer: |
153-
Yes, you can deploy an additional virtual machine within the same subnet as an HDInsight cluster. The following configurations are possible:
153+
Yes, you can deploy more virtual machine within the same subnet as a HDInsight cluster. The following configurations are possible:
154154
155155
- Edge nodes: You can add another edge node to the cluster, as described in [Use empty edge nodes on Apache Hadoop clusters in HDInsight](hdinsight-apps-use-edge-node.md).
156156
@@ -175,7 +175,7 @@ sections:
175175
For information on malware protection, see [Microsoft Antimalware for Azure Cloud Services and Virtual Machines](../security/fundamentals/antimalware.md).
176176
177177
- question: |
178-
How do I create a keytab for an HDInsight ESP cluster?
178+
How do I create a keytab for a HDInsight ESP cluster?
179179
answer: |
180180
Create a Kerberos keytab for your domain username. You can later use this keytab to authenticate to remote domain-joined clusters without entering a password. The domain name is uppercase:
181181
@@ -196,7 +196,7 @@ sections:
196196
- question: |
197197
How do I determine the proper SALT value?
198198
answer: |
199-
1. Use an interactive Kerberos login to determine the proper salt value for the keytab. Interactive Kerberos login will use the highest encryption by default. Tracing should be enabled to observe the salt. Below is a sample Kerberos login:
199+
1. Use an interactive Kerberos sign-in to determine the proper salt value for the keytab. Interactive Kerberos sign-in uses the highest encryption by default. Tracing should be enabled to observe the salt. Below is a sample Kerberos sign-in:
200200
201201
```shell
202202
@@ -215,9 +215,9 @@ sections:
215215
```
216216
217217
- question: |
218-
Can I use an existing Microsoft Entra tenant to create an HDInsight cluster that has the ESP?
218+
Can I use an existing Microsoft Entra tenant to create a HDInsight cluster that has the ESP?
219219
answer: |
220-
Enable Microsoft Entra Domain Services before you can create an HDInsight cluster with ESP. Open-source Hadoop relies on Kerberos for Authentication (as opposed to OAuth).
220+
Enable Microsoft Entra Domain Services before you can create a HDInsight cluster with ESP. Open-source Hadoop relies on Kerberos for Authentication (as opposed to OAuth).
221221
222222
To join VMs to a domain, you must have a domain controller. Microsoft Entra Domain Services is the managed domain controller, and is considered an extension of Microsoft Entra ID. Microsoft Entra Domain Services provides all the Kerberos requirements to build a secure Hadoop cluster in a managed way. HDInsight as a managed service integrates with Microsoft Entra Domain Services to provide security.
223223
@@ -236,7 +236,7 @@ sections:
236236
No, DAS is not supported on ESP clusters.
237237
238238
- question: |
239-
How can I pull login activity shown in Ranger?
239+
How can I pull sign-in activity shown in Ranger?
240240
answer: |
241241
For auditing requirements, Microsoft recommends enabling Azure Monitor logs as described in [Use Azure Monitor logs to monitor HDInsight clusters](./hdinsight-hadoop-oms-log-analytics-tutorial.md).
242242
@@ -301,7 +301,7 @@ sections:
301301
To audit blob storage accounts, configure monitoring using the procedure at [Monitor a storage account in the Azure portal](../storage/common/manage-storage-analytics-logs.md). An HDFS-audit log provides only auditing information for the local HDFS filesystem only (hdfs://mycluster). It doesn't include operations that are done on remote storage.
302302
303303
- question: |
304-
How can I transfer files between a blob container and an HDInsight head node?
304+
How can I transfer files between a blob container and a HDInsight head node?
305305
answer: |
306306
Run a script similar to the following shell script on your head node:
307307
@@ -394,12 +394,12 @@ sections:
394394
```
395395
396396
> [!NOTE]
397-
> Curl prompts you for a password. You must enter a valid password for the cluster login username.
397+
> Curl prompts you for a password. You must enter a valid password for the cluster sign-in username.
398398
399399
- name: Billing
400400
questions:
401401
- question: |
402-
How much does it cost to deploy an HDInsight cluster?
402+
How much does it cost to deploy a HDInsight cluster?
403403
answer: |
404404
For more information about pricing and FAQ related to billing, see the [Azure HDInsight Pricing](https://azure.microsoft.com/pricing/details/hdinsight/) page.
405405
@@ -422,7 +422,7 @@ sections:
422422
- name: Hive
423423
questions:
424424
- question: |
425-
Why does the Hive version appear as 1.2.1000 instead of 2.1 in the Ambari UI even though I'm running an HDInsight 3.6 cluster?
425+
Why does the Hive version appear as 1.2.1000 instead of 2.1 in the Ambari UI even though I'm running a HDInsight 3.6 cluster?
426426
answer: |
427427
Although only 1.2 appears in the Ambari UI, HDInsight 3.6 contains both Hive 1.2 and Hive 2.1.
428428

articles/hdinsight/hdinsight-plan-virtual-network-deployment.md

Lines changed: 11 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to plan an Azure Virtual Network deployment to connect HD
44
ms.service: azure-hdinsight
55
ms.topic: conceptual
66
ms.custom: hdinsightactive
7-
ms.date: 09/06/2024
7+
ms.date: 09/19/2024
88
---
99

1010
# Plan a virtual network for Azure HDInsight
@@ -26,7 +26,7 @@ The following are the questions that you must answer when planning to install HD
2626

2727
* Do you need to install HDInsight into an existing virtual network? Or are you creating a new network?
2828

29-
If you're using an existing virtual network, you may need to modify the network configuration before you can install HDInsight. For more information, see the [add HDInsight to an existing virtual network](#existingvnet) section.
29+
If you're using an existing virtual network, you may need to modify the network configuration before you can install HDInsight. For more information, see the [added HDInsight to an existing virtual network](#existingvnet) section.
3030

3131
* Do you want to connect the virtual network containing HDInsight to another virtual network or your on-premises network?
3232

@@ -56,7 +56,7 @@ Use the steps in this section to discover how to add a new HDInsight to an exist
5656

5757
As a managed service, HDInsight requires unrestricted access to several IP addresses in the Azure data center. To allow communication with these IP addresses, update any existing network security groups or user-defined routes.
5858

59-
HDInsight hosts multiple services, which use a variety of ports. Don't block traffic to these ports. For a list of ports to allow through virtual appliance firewalls, see the Security section.
59+
HDInsight hosts multiple services, which use various ports. Don't block traffic to these ports. For a list of ports to allow through virtual appliance firewalls, see the Security section.
6060

6161
To find your existing security configuration, use the following Azure PowerShell or Azure CLI commands:
6262

@@ -72,7 +72,7 @@ Use the steps in this section to discover how to add a new HDInsight to an exist
7272
az network nsg list --resource-group RESOURCEGROUP
7373
```
7474
75-
For more information, see the [Troubleshoot network security groups](../virtual-network/diagnose-network-traffic-filter-problem.md) document.
75+
For more information, see [Troubleshoot network security groups](../virtual-network/diagnose-network-traffic-filter-problem.md) document.
7676
7777
> [!IMPORTANT]
7878
> Network security group rules are applied in order based on rule priority. The first rule that matches the traffic pattern is applied, and no others are applied for that traffic. Order rules from most permissive to least permissive. For more information, see the [Filter network traffic with network security groups](../virtual-network/network-security-groups-overview.md) document.
@@ -89,9 +89,9 @@ Use the steps in this section to discover how to add a new HDInsight to an exist
8989
az network route-table list --resource-group RESOURCEGROUP
9090
```
9191
92-
For more information, see the [Troubleshoot routes](../virtual-network/diagnose-network-routing-problem.md) document.
92+
For more information, see the [Diagnose a virtual machine routing problem](../virtual-network/diagnose-network-routing-problem.md) document.
9393
94-
3. Create an HDInsight cluster and select the Azure Virtual Network during configuration. Use the steps in the following documents to understand the cluster creation process:
94+
3. Create a HDInsight cluster and select the Azure Virtual Network during configuration. Use the steps in the following documents to understand the cluster creation process:
9595
9696
* [Create HDInsight using the Azure portal](hdinsight-hadoop-create-linux-clusters-portal.md)
9797
* [Create HDInsight using Azure PowerShell](hdinsight-hadoop-create-linux-clusters-azure-powershell.md)
@@ -159,7 +159,7 @@ For more information, see the [Name Resolution for VMs and Role Instances](../vi
159159
160160
## Directly connect to Apache Hadoop services
161161
162-
You can connect to the cluster at `https://CLUSTERNAME.azurehdinsight.net`. This address uses a public IP, which may not be reachable if you have used NSGs to restrict incoming traffic from the internet. Additionally, when you deploy the cluster in a VNet you can access it using the private endpoint `https://CLUSTERNAME-int.azurehdinsight.net`. This endpoint resolves to a private IP inside the VNet for cluster access.
162+
You can connect to the cluster at `https://CLUSTERNAME.azurehdinsight.net`. This address uses a public IP, which may not be reachable if you have used NSGs to restrict incoming traffic from the internet. Additionally, when you deploy the cluster in a virtual network you can access it using the private endpoint `https://CLUSTERNAME-int.azurehdinsight.net`. This endpoint resolves to a private IP inside the virtual network for cluster access.
163163
164164
To connect to Apache Ambari and other web pages through the virtual network, use the following steps:
165165
@@ -194,9 +194,11 @@ To connect to Apache Ambari and other web pages through the virtual network, use
194194
195195
## Load balancing
196196
197-
When you create an HDInsight cluster, a load balancer is created as well. The type of this load balancer is at the [basic SKU level](../load-balancer/skus.md), which has certain constraints. One of these constraints is that if you have two virtual networks in different regions, you cannot connect to basic load balancers. See [virtual networks FAQ: constraints on global vnet peering](../virtual-network/virtual-networks-faq.md#what-are-the-constraints-related-to-global-virtual-network-peering-and-load-balancers), for more information.
197+
When you create a HDInsight cluster, several load balancers are created as well. Due to the [retirement of the basic load balancer](https://azure.microsoft.com/updates/azure-basic-load-balancer-will-be-retired-on-30-september-2025-upgrade-to-standard-load-balancer/), the type of load balancers is at the [standard SKU level](/azure/load-balancer/skus), which has certain constraints. Inbound flows to the standard load balancers are closed unless allowed  by a network security group. You may need to bond a network security to your subnet and configure the network security rules.
198198
199-
Another constraint is that the HDInsight load balancers should not be deleted or modified. **Any changes to the load balancer rules will get overwritten during certain maintenance events such as certificate renewals.** If the load balancers are modified and it affects the cluster functionality, you may need to recreate the cluster.
199+
There are [several outbound connectivity methods](/azure/load-balancer/load-balancer-outbound-connections) enabled for the standard load balancer. It’s worth noting that the default outbound access will be retired soon. If a NAT gateway is adopted to provide outbound network access, the subnet is not capable with the basic load balancer. If you intend to bond a NAT gateway to a subnet, there should be no basic load balancer existed in this subnet. With the NAT gateway as the outbound access method, a newly created HDInsight cluster can't share the same subnet with previously created HDInsight clusters with basic load balancers.
200+
201+
Another constraint is that the HDInsight load balancers shouldn't be deleted or modified. **Any changes to the load balancer rules will get overwritten during certain maintenance events such as certificate renewals.** If the load balancers are modified and it affects the cluster functionality, you may need to recreate the cluster.
200202
201203
## Next steps
202204

articles/hdinsight/hdinsight-restrict-public-connectivity.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to remove access to all outbound public IP addresses.
44
ms.service: azure-hdinsight
55
ms.custom: devx-track-azurepowershell
66
ms.topic: conceptual
7-
ms.date: 01/04/2024
7+
ms.date: 09/19/2024
88
---
99

1010
# Restrict public connectivity in Azure HDInsight
@@ -15,7 +15,7 @@ If you want public connectivity between your HDInsight cluster and dependent res
1515

1616
The following diagram shows what a potential HDInsight virtual network architecture might look like when `resourceProviderConnection` is set to *outbound*:
1717

18-
:::image type="content" source="media/hdinsight-private-link/outbound-resource-provider-connection-only.png" alt-text="Diagram of the HDInsight architecture using an outbound resource provider connection.":::
18+
:::image type="content" source="./media/hdinsight-restrict-public-connectivity/outbound-resource-provider-connection-only.svg" alt-text="Diagram showing the HDInsight architecture using an outbound resource provider connection." border="true" lightbox="./media/hdinsight-restrict-public-connectivity/outbound-resource-provider-connection-only.svg":::
1919

2020
> [!NOTE]
2121
> Restricting public connectivity is a prerequisite for enabling Private Link and shouldn't be considered the same capability.

0 commit comments

Comments
 (0)