Skip to content

Commit 0d99de3

Browse files
authored
Merge pull request #278248 from sreekzz/Freshness-MS-Date-Change
Freshness ms date change
2 parents 7618fc7 + ba7694a commit 0d99de3

File tree

46 files changed

+104
-105
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

46 files changed

+104
-105
lines changed

articles/hdinsight/cluster-management-best-practices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn best practices for managing HDInsight clusters.
44
ms.service: hdinsight
55
ms.custom: hdinsightactive
66
ms.topic: conceptual
7-
ms.date: 06/12/2023
7+
ms.date: 06/14/2024
88
---
99
# HDInsight cluster management best practices
1010

articles/hdinsight/domain-joined/apache-domain-joined-create-configure-enterprise-security-cluster.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to create and configure Enterprise Security Package clust
44
services: hdinsight
55
ms.service: hdinsight
66
ms.topic: how-to
7-
ms.date: 06/22/2023
7+
ms.date: 06/14/2024
88
ms.custom: devx-track-azurepowershell
99
---
1010

@@ -164,7 +164,7 @@ Create an Active Directory tenant administrator.
164164

165165
**Groups and roles**
166166
1. Select **0 groups selected**.
167-
1. Select **AAD DC Administrators**, and then **Select**.
167+
1. Select **`AAD DC` Administrators**, and then **Select**.
168168

169169
:::image type="content" source="./media/apache-domain-joined-create-configure-enterprise-security-cluster/azure-ad-add-group-member.png" alt-text="The Microsoft Entra groups dialog box." border="true":::
170170

@@ -266,7 +266,7 @@ Follow these steps to enable Microsoft Entra Domain Services. For more informati
266266
267267
:::image type="content" source="./media/apache-domain-joined-create-configure-enterprise-security-cluster/hdinsight-image-0086.png" alt-text="Create virtual network step." border="true":::
268268
269-
1. On the **Administrator group** page, you should see a notification that a group named **AAD DC Administrators** has already been created to administer this group. You can modify the membership of this group if you want to, but in this case you don't need to change it. Select **OK**.
269+
1. On the **Administrator group** page, you should see a notification that a group named **`AAD DC` Administrators** has already been created to administer this group. You can modify the membership of this group if you want to, but in this case you don't need to change it. Select **OK**.
270270
271271
:::image type="content" source="./media/apache-domain-joined-create-configure-enterprise-security-cluster/hdinsight-image-0088.png" alt-text="View the Microsoft Entra administrator group." border="true":::
272272

articles/hdinsight/domain-joined/identity-broker.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Azure HDInsight ID Broker (HIB)
33
description: Learn about Azure HDInsight ID Broker to simplify authentication for domain-joined Apache Hadoop clusters.
44
ms.service: hdinsight
55
ms.topic: how-to
6-
ms.date: 06/05/2023
6+
ms.date: 06/14/2024
77
---
88

99
# Azure HDInsight ID Broker (HIB)

articles/hdinsight/hadoop/apache-hadoop-connect-excel-power-query.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to take advantage of business intelligence components and
44
ms.service: hdinsight
55
ms.topic: how-to
66
ms.custom: hdinsightactive
7-
ms.date: 06/12/2023
7+
ms.date: 06/14/2024
88
---
99

1010
# Connect Excel to Apache Hadoop by using Power Query

articles/hdinsight/hadoop/apache-hadoop-develop-deploy-java-mapreduce-linux.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to use Apache Maven to create a Java-based MapReduce appl
44
ms.service: hdinsight
55
ms.topic: how-to
66
ms.custom: hdinsightactive, devx-track-java, devx-track-extended-java, linux-related-content
7-
ms.date: 06/22/2023
7+
ms.date: 06/14/2024
88
---
99

1010
# Develop Java MapReduce programs for Apache Hadoop on HDInsight

articles/hdinsight/hadoop/apache-hadoop-hive-pig-udf-dotnet-csharp.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn how to use C# user-defined functions (UDF) with Apache Hive a
44
ms.service: hdinsight
55
ms.topic: how-to
66
ms.custom: hdinsightactive, devx-track-csharp, devx-track-dotnet
7-
ms.date: 06/09/2023
7+
ms.date: 06/13/2024
88
---
99

1010
# Use C# user-defined functions with Apache Hive and Apache Pig on Apache Hadoop in HDInsight
@@ -14,7 +14,7 @@ Learn how to use C# user-defined functions (UDF) with [Apache Hive](https://hive
1414
> [!IMPORTANT]
1515
> The steps in this document work with Linux-based HDInsight clusters. Linux is the only operating system used on HDInsight version 3.4 or greater. For more information, see [HDInsight component versioning](../hdinsight-component-versioning.md).
1616
17-
Both Hive and Pig can pass data to external applications for processing. This process is known as _streaming_. When using a .NET application, the data is passed to the application on STDIN, and the application returns the results on STDOUT. To read and write from STDIN and STDOUT, you can use `Console.ReadLine()` and `Console.WriteLine()` from a console application.
17+
Both Hive and Pig can pass data to external applications for processing. This process is known as _streaming_. When you use a .NET application, the data is passed to the application on STDIN, and the application returns the results on STDOUT. To read and write from STDIN and STDOUT, you can use `Console.ReadLine()` and `Console.WriteLine()` from a console application.
1818

1919
## Prerequisites
2020

@@ -178,7 +178,7 @@ Next, upload the Hive and Pig UDF applications to storage on a HDInsight cluster
178178

179179
:::image type="content" source="./media/apache-hadoop-hive-pig-udf-dotnet-csharp/hdinsight-upload-icon.png" alt-text="HDInsight upload icon for new project." border="true":::
180180

181-
In the **Upload New File** dialog box, under **File name**, select **Browse**. In the **Upload Blob** dialog box, go to the *bin\debug* folder for the *HiveCSharp* project, and then choose the *HiveCSharp.exe* file. Finally, select **Open** and then **OK** to complete the upload.
181+
In the **Upload New File** dialog box, under **File name**, select **Browse**. In the **Upload Blob** dialog box, go to the `bin\debug` folder for the *HiveCSharp* project, and then choose the *HiveCSharp.exe* file. Finally, select **Open** and then **OK** to complete the upload.
182182

183183
* If you're using **Azure Data Lake Storage**, right-click an empty area in the file listing, and then select **Upload**. Finally, choose the *HiveCSharp.exe* file and select **Open**.
184184

@@ -249,7 +249,7 @@ You can also run a Pig job that uses your Pig UDF application.
249249
> [!NOTE]
250250
> The application name that is used for streaming must be surrounded by the `` ` `` (backtick) character when aliased, and by the `'` (single quote) character when used with `SHIP`.
251251

252-
4. After entering the last line, the job should start. It returns output similar to the following text:
252+
4. Once you enter the last line, the job should start. It returns output similar to the following text:
253253

254254
```output
255255
(2019-07-15 16:43:25 SampleClass5 [WARN] problem finding id 1358451042 - java.lang.Exception)

articles/hdinsight/hadoop/apache-hadoop-on-premises-migration-best-practices-data-migration.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Learn data migration best practices for migrating on-premises Hadoo
44
ms.service: hdinsight
55
ms.topic: how-to
66
ms.custom: hdinsightactive
7-
ms.date: 06/22/2023
7+
ms.date: 06/14/2024
88
---
99

1010
# Migrate on-premises Apache Hadoop clusters to Azure HDInsight - data migration best practices
@@ -52,7 +52,7 @@ DistCp tries to create map tasks so that each one copies roughly the same number
5252

5353
* DistCp's lowest granularity is a single file. Specifying a number of Mappers more than the number of source files doesn't help and will waste the available cluster resources.
5454

55-
* Consider the available Yarn memory on the cluster to determine the number of Mappers. Each Map task is launched as a Yarn container. Assuming that no other heavy workloads are running on the cluster, the number of Mappers can be determined by the following formula: m = (number of worker nodes \* YARN memory for each worker node) / YARN container size. However, If other applications are using memory, then choose to only use a portion of YARN memory for DistCp jobs.
55+
* Consider the available Yarn memory on the cluster to determine the number of Mappers. Each Map task is launched as a Yarn container. Assuming that no other heavy workloads are running on the cluster, the number of Mappers can be determined by the following formula: m = (number of worker nodes \* YARN memories for each worker node) / YARN container size. However, If other applications are using memory, then choose to only use a portion of YARN memory for DistCp jobs.
5656

5757
### Use more than one DistCp job
5858

articles/hdinsight/hadoop/connect-install-beeline.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Connect to HiveServer2 using Beeline or install Beeline locally to connec
33
description: Learn how to connect to the Apache Beeline client to run Hive queries with Hadoop on HDInsight. Beeline is a utility for working with HiveServer2 over JDBC.
44
ms.service: hdinsight
55
ms.topic: how-to
6-
ms.date: 06/12/2023
6+
ms.date: 06/14/2024
77
---
88
# Connect to HiveServer2 using Beeline or install Beeline locally to connect from your local
99

@@ -13,25 +13,25 @@ ms.date: 06/12/2023
1313

1414
### From an SSH session
1515

16-
When connecting from an SSH session to a cluster headnode, you can then connect to the `headnodehost` address on port `10001`:
16+
When you connect from an SSH session to a cluster headnode, you can then connect to the `headnodehost` address on port `10001`:
1717

1818
```bash
1919
beeline -u 'jdbc:hive2://headnodehost:10001/;transportMode=http'
2020
```
2121

2222
### Over an Azure Virtual Network
2323

24-
When connecting from a client to HDInsight over an Azure Virtual Network, you must provide the fully qualified domain name (FQDN) of a cluster head node. Since this connection is made directly to the cluster nodes, the connection uses port `10001`:
24+
When you connect from a client to HDInsight over an Azure Virtual Network, you must provide the fully qualified domain name (FQDN) of a cluster head node. Since this connection is made directly to the cluster nodes, the connection uses port `10001`:
2525

2626
```bash
2727
beeline -u 'jdbc:hive2://<headnode-FQDN>:10001/;transportMode=http'
2828
```
2929

30-
Replace `<headnode-FQDN>` with the fully qualified domain name of a cluster headnode. To find the fully qualified domain name of a headnode, use the information in the [Manage HDInsight using the Apache Ambari REST API](../hdinsight-hadoop-manage-ambari-rest-api.md#get-the-fqdn-of-cluster-nodes) document.
30+
Replace `<headnode-FQDN>` with the fully qualified domain name of a cluster headnode. To find the fully qualified domain name of a headnode, use the information in the [Managed HDInsight using the Apache Ambari REST API](../hdinsight-hadoop-manage-ambari-rest-api.md#get-the-fqdn-of-cluster-nodes) document.
3131

3232
### To HDInsight Enterprise Security Package (ESP) cluster using Kerberos
3333

34-
When connecting from a client to an Enterprise Security Package (ESP) cluster joined to Microsoft Entra Domain Services on a machine in same realm of the cluster, you must also specify the domain name `<AAD-Domain>` and the name of a domain user account with permissions to access the cluster `<username>`:
34+
When you connect from a client to an Enterprise Security Package (ESP) cluster joined to Microsoft Entra Domain Services on a machine in same realm of the cluster, you must also specify the domain name `<AAD-Domain>` and the name of a domain user account with permissions to access the cluster `<username>`:
3535

3636
```bash
3737
kinit <username>
@@ -48,15 +48,15 @@ To find the JDBC URL from Ambari:
4848

4949
### Over public or private endpoints
5050

51-
When connecting to a cluster using the public or private endpoints, you must provide the cluster login account name (default `admin`) and password. For example, using Beeline from a client system to connect to the `clustername.azurehdinsight.net` address. This connection is made over port `443`, and is encrypted using TLS/SSL.
51+
When you connect to a cluster using the public or private endpoints, you must provide the cluster login account name (default `admin`) and password. For example, using Beeline from a client system to connect to the `clustername.azurehdinsight.net` address. This connection is made over port `443`, and is encrypted using TLS/SSL.
5252

5353
Replace `clustername` with the name of your HDInsight cluster. Replace `admin` with the cluster login account for your cluster. For ESP clusters, use the full UPN (for example, [email protected]). Replace `password` with the password for the cluster login account.
5454

5555
```bash
5656
beeline -u 'jdbc:hive2://clustername.azurehdinsight.net:443/;ssl=true;transportMode=http;httpPath=/hive2' -n admin -p 'password'
5757
```
5858

59-
or for private endpoint:
59+
Or for private endpoint:
6060

6161
```bash
6262
beeline -u 'jdbc:hive2://clustername-int.azurehdinsight.net:443/;ssl=true;transportMode=http;httpPath=/hive2' -n admin -p 'password'
@@ -78,7 +78,7 @@ The connection string used is slightly different. Instead of containing `httpPa
7878
beeline -u 'jdbc:hive2://clustername.azurehdinsight.net:443/;ssl=true;transportMode=http;httpPath=/sparkhive2' -n admin -p 'password'
7979
```
8080

81-
or for private endpoint:
81+
Or for private endpoint:
8282

8383
```bash
8484
beeline -u 'jdbc:hive2://clustername-int.azurehdinsight.net:443/;ssl=true;transportMode=http;httpPath=/sparkhive2' -n admin -p 'password'
@@ -88,7 +88,7 @@ Private endpoints point to a basic load balancer, which can only be accessed fro
8888

8989
#### From cluster head node or inside Azure Virtual Network with Apache Spark
9090

91-
When connecting directly from the cluster head node, or from a resource inside the same Azure Virtual Network as the HDInsight cluster, port `10002` should be used for Spark Thrift server instead of `10001`. The following example shows how to connect directly to the head node:
91+
When you connect directly from the cluster head node, or from a resource inside the same Azure Virtual Network as the HDInsight cluster, port `10002` should be used for Spark Thrift server instead of `10001`. The following example shows how to connect directly to the head node:
9292

9393
```bash
9494
/usr/hdp/current/spark2-client/bin/beeline -u 'jdbc:hive2://headnodehost:10002/;transportMode=http'
@@ -150,7 +150,7 @@ Although Beeline is included on the head nodes, you may want to install it local
150150
151151
1. Then press **Ctrl+X**, then **Y**, then enter.
152152
153-
1. Close and then reopen you bash session.
153+
1. Close and then reopen your bash session.
154154
155155
1. Test your connection. Use the connection format from [Over public or private endpoints](#over-public-or-private-endpoints), above.
156156

articles/hdinsight/hadoop/hdinsight-troubleshoot-converting-service-principal-certificate.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Converting certificate contents to base-64 - Azure HDInsight
33
description: Converting service principal certificate contents to base-64 encoded string format in Azure HDInsight
44
ms.service: hdinsight
55
ms.topic: troubleshooting
6-
ms.date: 06/12/2023
6+
ms.date: 06/14/2024
77
---
88

99
# Converting service principal certificate contents to base-64 encoded string format in HDInsight

articles/hdinsight/hadoop/hdinsight-troubleshoot-out-disk-space.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Cluster node runs out of disk space in Azure HDInsight
33
description: Troubleshooting Apache Hadoop cluster node disk space issues in Azure HDInsight.
44
ms.service: hdinsight
55
ms.topic: troubleshooting
6-
ms.date: 06/12/2023
6+
ms.date: 06/14/2024
77
---
88

99
# Scenario: Cluster node runs out of disk space in Azure HDInsight

0 commit comments

Comments
 (0)