Skip to content

Commit 81130a7

Browse files
authored
Merge pull request #57902 from hrasheed-msft/hrasheed_apache4
Hrasheed apache4
2 parents d95f1a2 + 35c7531 commit 81130a7

File tree

59 files changed

+191
-200
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

59 files changed

+191
-200
lines changed

articles/hdinsight/domain-joined/apache-domain-joined-architecture.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,11 +18,11 @@ Many enterprises have moved toward a model in which clusters are managed by IT t
1818

1919
HDInsight relies on a popular identity provider--Active Directory--in a managed way. By integrating HDInsight with [Azure Active Directory Domain Services (Azure AD DS)](../../active-directory-domain-services/active-directory-ds-overview.md), you can access the clusters by using your domain credentials.
2020

21-
The virtual machines (VMs) in HDInsight are domain joined to your provided domain. So, all the services running on HDInsight (Ambari, Hive server, Ranger, Spark thrift server, and others) work seamlessly for the authenticated user. Administrators can then create strong authorization policies by using Apache Ranger to provide role-based access control for resources in the cluster.
21+
The virtual machines (VMs) in HDInsight are domain joined to your provided domain. So, all the services running on HDInsight (Apache Ambari, Apache Hive server, Apache Ranger, Apache Spark thrift server, and others) work seamlessly for the authenticated user. Administrators can then create strong authorization policies by using Apache Ranger to provide role-based access control for resources in the cluster.
2222

2323
## Integrate HDInsight with Active Directory
2424

25-
Open-source Hadoop relies on Kerberos for authentication and security. Therefore, HDInsight cluster nodes with Enterprise Security Package (ESP) are joined to a domain that's managed by Azure AD DS. Kerberos security is configured for the Hadoop components on the cluster.
25+
Open-source Apache Hadoop relies on Kerberos for authentication and security. Therefore, HDInsight cluster nodes with Enterprise Security Package (ESP) are joined to a domain that's managed by Azure AD DS. Kerberos security is configured for the Hadoop components on the cluster.
2626

2727
The following things are created automatically:
2828
- a service principal for each Hadoop component

articles/hdinsight/domain-joined/apache-domain-joined-configure-using-azure-adds.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ Enterprise Security Package (ESP) clusters provide multi-user access on Azure HD
1616
In this article, you learn how to configure a HDInsight cluster with ESP by using Azure Active Directory Domain Services (Azure AD-DS).
1717

1818
>[!NOTE]
19-
>ESP is GA in HDI 3.6 for Spark, Interactive, and Hadoop. ESP for HBase and Kafka cluster types is in preview.
19+
>ESP is GA in HDI 3.6 for Spark, Interactive, and Apache Hadoop. ESP for HBase and Kafka cluster types is in preview.
2020
2121
## Enable Azure AD-DS
2222

articles/hdinsight/domain-joined/apache-domain-joined-introduction.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: An introduction to Hadoop security with Enterprise Security Package
2+
title: An introduction to Apache Hadoop security with Enterprise Security Package
33
description: Learn how Enterprise Security Package supports the four pillars of enterprise security.
44
services: hdinsight
55
ms.service: hdinsight
@@ -10,13 +10,13 @@ ms.custom: hdinsightactive
1010
ms.topic: conceptual
1111
ms.date: 09/24/2018
1212
---
13-
# An introduction to Hadoop security with Enterprise Security Package
13+
# An introduction to Apache Hadoop security with Enterprise Security Package
1414

15-
In the past, Azure HDInsight supported only a single user: local admin. This worked great for smaller application teams or departments. As Hadoop-based workloads gained more popularity in the enterprise sector, the need for enterprise-grade capabilities like Active Directory-based authentication, multi-user support, and role-based access control became increasingly important.
15+
In the past, Azure HDInsight supported only a single user: local admin. This worked great for smaller application teams or departments. As Apache Hadoop-based workloads gained more popularity in the enterprise sector, the need for enterprise-grade capabilities like Active Directory-based authentication, multi-user support, and role-based access control became increasingly important.
1616

1717
You can create an HDInsight cluster with Enterprise Security Package (ESP) that's joined to an Active Directory domain. You can then configure a list of employees from the enterprise who can authenticate through Azure Active Directory to log on to the HDInsight cluster. No one from outside the enterprise can log on or access the HDInsight cluster.
1818

19-
The enterprise admin can configure role-based access control (RBAC) for Hive security by using [Apache Ranger](http://hortonworks.com/apache/ranger/). Configuring RBAC restricts data access to only what's needed. Finally, the admin can audit the data access by employees and any changes done to access control policies. The admin can then achieve a high degree of governance of their corporate resources.
19+
The enterprise admin can configure role-based access control (RBAC) for Apache Hive security by using [Apache Ranger](http://hortonworks.com/apache/ranger/). Configuring RBAC restricts data access to only what's needed. Finally, the admin can audit the data access by employees and any changes done to access control policies. The admin can then achieve a high degree of governance of their corporate resources.
2020

2121
> [!NOTE]
2222
> Oozie is now enabled on ESP clusters. To access the Oozie web UI, users should enable [tunneling](../hdinsight-linux-ambari-ssh-tunnel.md).

articles/hdinsight/domain-joined/apache-domain-joined-manage.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ Learn the users and the roles in HDInsight Enterprise Security Package (ESP), an
1616

1717
## Use VSCode to link to domain joined cluster
1818

19-
You can link a normal cluster by using Ambari managed username, also link a security hadoop cluster by using domain username (such as: [email protected]).
19+
You can link a normal cluster by using Apache Ambari managed username, also link a security Apache Hadoop cluster by using domain username (such as: [email protected]).
2020
1. Open the command palette by selecting **CTRL+SHIFT+P**, and then enter **HDInsight: Link a cluster**.
2121

2222
![link cluster command](./media/apache-domain-joined-manage/link-cluster-command.png)

articles/hdinsight/domain-joined/apache-domain-joined-run-hive.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,8 +10,8 @@ ms.custom: hdinsightactive
1010
ms.topic: conceptual
1111
ms.date: 09/24/2018
1212
---
13-
# Configure Hive policies in HDInsight with Enterprise Security Package
14-
Learn how to configure Apache Ranger policies for Hive. In this article, you create two Ranger policies to restrict access to the hivesampletable. The hivesampletable comes with HDInsight clusters. After you have configured the policies, you use Excel and ODBC driver to connect to Hive tables in HDInsight.
13+
# Configure Apache Hive policies in HDInsight with Enterprise Security Package
14+
Learn how to configure Apache Ranger policies for Apache Hive. In this article, you create two Ranger policies to restrict access to the hivesampletable. The hivesampletable comes with HDInsight clusters. After you have configured the policies, you use Excel and ODBC driver to connect to Hive tables in HDInsight.
1515

1616
## Prerequisites
1717
* A HDInsight cluster with Enterprise Security Package. See [Configure HDInsight clusters with ESP](apache-domain-joined-configure.md).

articles/hdinsight/domain-joined/apache-domain-joined-run-kafka.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Configure Kafka policies in HDInsight with Enterprise Security Package - Azure
2+
title: Configure Apache Kafka policies in HDInsight with Enterprise Security Package - Azure
33
description: Learn how to configure Apache Ranger policies for Kafka in Azure HDInsight with Enterprise Security Package.
44
services: hdinsight
55
ms.service: hdinsight
@@ -10,9 +10,9 @@ ms.topic: tutorial
1010
ms.date: 09/24/2018
1111
---
1212

13-
# Tutorial: Configure Kafka policies in HDInsight with Enterprise Security Package (Preview)
13+
# Tutorial: Configure Apache Kafka policies in HDInsight with Enterprise Security Package (Preview)
1414

15-
Learn how to configure Apache Ranger policies for Enterprise Security Package (ESP) Kafka clusters. ESP clusters are connected to a domain allowing users to authenticate with domain credentials. In this tutorial, you create two Ranger policies to restrict access to `sales*` and `marketingspend` topics.
15+
Learn how to configure Apache Ranger policies for Enterprise Security Package (ESP) Apache Kafka clusters. ESP clusters are connected to a domain allowing users to authenticate with domain credentials. In this tutorial, you create two Ranger policies to restrict access to `sales*` and `marketingspend` topics.
1616

1717
In this tutorial, you learn how to:
1818

articles/hdinsight/domain-joined/hdinsight-use-oozie-domain-joined-clusters.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.topic: conceptual
1111
ms.date: 09/24/2018
1212
---
1313
# Run Apache Oozie in HDInsight Hadoop clusters with Enterprise Security Package
14-
Oozie is a workflow and coordination system that manages Hadoop jobs. Oozie is integrated with the Hadoop stack, and it supports the following jobs:
14+
Apache Oozie is a workflow and coordination system that manages Apache Hadoop jobs. Oozie is integrated with the Hadoop stack, and it supports the following jobs:
1515
- Apache MapReduce
1616
- Apache Pig
1717
- Apache Hive

articles/hdinsight/hadoop/apache-hadoop-connect-excel-hive-odbc-driver.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Connect Excel to Hadoop with the Hive ODBC Driver - Azure HDInsight
2+
title: Connect Excel to Apache Hadoop with the Hive ODBC Driver - Azure HDInsight
33
description: Learn how to set up and use the Microsoft Hive ODBC driver for Excel to query data in HDInsight clusters from Microsoft Excel.
44
keywords: hadoop excel,hive excel,hive odbc
55
services: hdinsight
@@ -11,7 +11,7 @@ ms.topic: conceptual
1111
ms.date: 05/16/2018
1212
ms.author: hrasheed
1313
---
14-
# Connect Excel to Hadoop in Azure HDInsight with the Microsoft Hive ODBC driver
14+
# Connect Excel to Apache Hadoop in Azure HDInsight with the Microsoft Hive ODBC driver
1515

1616
[!INCLUDE [ODBC-JDBC-selector](../../../includes/hdinsight-selector-odbc-jdbc.md)]
1717

@@ -33,7 +33,7 @@ Download and install Microsoft Hive ODBC Driver from the [Download Center][hive-
3333

3434
This driver can be installed on 32-bit or 64-bit versions of Windows 7, Windows 8, Windows 10, Windows Server 2008 R2, and Windows Server 2012. The driver allows connection to Azure HDInsight. You shall install the version that matches the version of the application where you use the ODBC driver. For this tutorial, the driver is used from Office Excel.
3535

36-
## Create Hive ODBC data source
36+
## Create Apache Hive ODBC data source
3737
The following steps show you how to create a Hive ODBC Data Source.
3838

3939
1. From Windows 8 or Windows 10, press the Windows key to open the Start screen, and then type **data sources**.

articles/hdinsight/hadoop/apache-hadoop-connect-excel-power-query.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Connect Excel to Hadoop with Power Query - Azure HDInsight
2+
title: Connect Excel to Apache Hadoop with Power Query - Azure HDInsight
33
description: Learn how to take advantage of business intelligence components and use Power Query for Excel to access data stored in Hadoop on HDInsight.
44
services: hdinsight
55
author: hrasheed-msft
@@ -10,8 +10,8 @@ ms.custom: hdinsightactive
1010
ms.topic: conceptual
1111
ms.date: 05/16/2018
1212
---
13-
# Connect Excel to Hadoop by using Power Query
14-
One key feature of the Microsoft big-data solution is the integration of Microsoft business intelligence (BI) components with Hadoop clusters in Azure HDInsight. A primary example is the ability to connect Excel to the Azure Storage account that contains the data associated with your Hadoop cluster by using the Microsoft Power Query for Excel add-in. This article walks you through how to set up and use Power Query to query data associated with a Hadoop cluster managed with HDInsight.
13+
# Connect Excel to Apache Hadoop by using Power Query
14+
One key feature of the Microsoft big-data solution is the integration of Microsoft business intelligence (BI) components with Apache Hadoop clusters in Azure HDInsight. A primary example is the ability to connect Excel to the Azure Storage account that contains the data associated with your Hadoop cluster by using the Microsoft Power Query for Excel add-in. This article walks you through how to set up and use Power Query to query data associated with a Hadoop cluster managed with HDInsight.
1515

1616
### Prerequisites
1717
Before you begin this article, you must have the following items:

articles/hdinsight/hadoop/apache-hadoop-connect-hive-jdbc-driver.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
title: Query Hive through the JDBC driver - Azure HDInsight
3-
description: Use the JDBC driver from a Java application to submit Hive queries to Hadoop on HDInsight. Connect programmatically and from the SQuirrel SQL client.
2+
title: Query Apache Hive through the JDBC driver - Azure HDInsight
3+
description: Use the JDBC driver from a Java application to submit Apache Hive queries to Hadoop on HDInsight. Connect programmatically and from the SQuirrel SQL client.
44
services: hdinsight
55
author: hrasheed-msft
66
ms.reviewer: jasonh
@@ -12,11 +12,11 @@ ms.date: 04/02/2018
1212
ms.author: hrasheed
1313

1414
---
15-
# Query Hive through the JDBC driver in HDInsight
15+
# Query Apache Hive through the JDBC driver in HDInsight
1616

1717
[!INCLUDE [ODBC-JDBC-selector](../../../includes/hdinsight-selector-odbc-jdbc.md)]
1818

19-
Learn how to use the JDBC driver from a Java application to submit Hive queries to Hadoop in Azure HDInsight. The information in this document demonstrates how to connect programmatically and from the SQuirrel SQL client.
19+
Learn how to use the JDBC driver from a Java application to submit Apache Hive queries to Apache Hadoop in Azure HDInsight. The information in this document demonstrates how to connect programmatically and from the SQuirrel SQL client.
2020

2121
For more information on the Hive JDBC Interface, see [HiveJDBCInterface](https://cwiki.apache.org/confluence/display/Hive/HiveJDBCInterface).
2222

0 commit comments

Comments
 (0)