Skip to content

Commit c24f14a

Browse files
authored
Merge pull request #228159 from sreekzz/patch-146
Added HDInsight 5.1 row.
2 parents 412d49d + 9a7dadd commit c24f14a

File tree

4 files changed

+108
-66
lines changed

4 files changed

+108
-66
lines changed

articles/hdinsight/TOC.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,6 +49,8 @@ items:
4949
href: ./hdinsight-overview-versioning.md
5050
- name: HDInsight versions
5151
href: ./hdinsight-component-versioning.md
52+
- name: HDInsight version 5.1
53+
href: ./hdinsight-51-component-versioning.md
5254
- name: HDInsight version 5.0
5355
href: ./hdinsight-50-component-versioning.md
5456
- name: HDInsight version 4.0
Lines changed: 59 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,59 @@
1+
---
2+
title: Open-source components and versions - Azure HDInsight 5.1
3+
description: Learn about the open-source components and versions in Azure HDInsight 5.1
4+
ms.service: hdinsight
5+
ms.topic: conceptual
6+
ms.date: 02/28/2023
7+
---
8+
9+
# HDInsight 5.1 component versions
10+
11+
In this article, you learn about the open-source components and their versions in Azure HDInsight 5.1.
12+
13+
From February 27, 2023 we have started rolling out a new version of HDInsight 5.1, this version is backward compatible with HDInsight 4.0. and 5.0. All new open-source releases added as incremental releases on HDInsight 5.1.
14+
15+
## Open-source components available with HDInsight version 5.1
16+
17+
The Open-source component versions associated with HDInsight 5.1 listed in the following table.
18+
19+
| Component | HDInsight 5.1 | HDInsight 5.0 |
20+
|------------------|---------------|---------------|
21+
| Apache Spark | 3.3 ** | 3.1.2 |
22+
| Apache Hive | 3.1.2 * | 3.1.2 |
23+
| Apache Kafka | 3.2.0 ** | 2.4.1 |
24+
| Apache Hadoop with YARN | 3.3.4 * | 3.1.1 |
25+
| Apache Tez | 0.9.1 * | 0.9.1 |
26+
| Apache Pig | 0.17.0 * | 0.16.1 |
27+
| Apache Ranger | 2.1.0 * | 1.1.0 |
28+
| Apache HBase | 2.4.11 ** | - |
29+
| Apache Sqoop | 1.5.0 * | 1.5.0 |
30+
| Apache Oozie | 5.2.1 * | 4.3.1 |
31+
| Apache Zookeeper | 3.6.3 ** | 3.4.6 |
32+
| Apache Livy | 0.7.1 * | 0.5 |
33+
| Apache Ambari | 2.7.0 ** | 2.7.0 |
34+
| Apache Zeppelin | 0.10.0 * | 0.8.0 |
35+
| Apache Phoenix | 5.1.2 ** | - |
36+
37+
\* Under development/Planned
38+
39+
** Public Preview
40+
41+
## Spark versions supported in Azure HDInsight
42+
43+
Apache Spark versions supported in Azure HDIinsight
44+
45+
|Apache Spark version on HDInsight|Release date|Release stage|End of life announcement date|[End of standard support]()|[End of basic support]()|
46+
|--|--|--|--|--|--|
47+
|2.4|July 8, 2019|End of Life Announced (EOLA)| Feb10,2023| Aug 10,2023|Feb 10,2024|
48+
|3.1|March 11,2022|GA |-|-|-|
49+
|3.3|March 22,2023|Public Preview|-|-|-|
50+
51+
## Apache Spark 2.4 to Spark 3.x Migration Guides
52+
53+
Spark 2.4 to Spark 3.x Migration Guides see [here](https://spark.apache.org/docs/latest/migration-guide.html).
54+
55+
## Next steps
56+
57+
- [Cluster setup for Apache Hadoop, Spark, and more on HDInsight](hdinsight-hadoop-provision-linux-clusters.md)
58+
- [Enterprise Security Package](./enterprise-security-package.md)
59+
- [Work in Apache Hadoop on HDInsight from a Windows PC](hdinsight-hadoop-windows-tools.md)

articles/hdinsight/hdinsight-component-versioning.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -3,29 +3,30 @@ title: Open-source components and versions - Azure HDInsight
33
description: Learn about the open-source components and versions in Azure HDInsight.
44
ms.service: hdinsight
55
ms.topic: conceptual
6-
ms.date: 02/01/2022
6+
ms.date: 02/25/2023
77
---
88

99
# Azure HDInsight versions
1010

11-
HDInsight bundles open-source components and HDInsight platform into a package that is deployed on a cluster. For more information, see [how HDInsight versioning works](hdinsight-overview-versioning.md).
11+
HDInsight bundles open-source components and HDInsight platform into a package that deployed on a cluster. For more information, see [how HDInsight versioning works](hdinsight-overview-versioning.md).
1212

1313
## Supported HDInsight versions
1414

1515
This table lists the versions of HDInsight that are available in the Azure portal and other deployment methods like PowerShell, CLI and the .NET SDK.
1616

1717
| HDInsight version | VM OS | Release date| Support type | Support expiration date | Retirement date | High availability |
1818
| --- | --- | --- | --- | --- | --- | ---|
19+
| [HDInsight 5.1](hdinsight-51-component-versioning.md) |Ubuntu 18.0.4 LTS |Feb 27, 2022 | [Standard](hdinsight-component-versioning.md#support-options-for-hdinsight-versions) | Not announced |Not announced| Yes |
1920
| [HDInsight 5.0](hdinsight-50-component-versioning.md) |Ubuntu 18.0.4 LTS |July 01, 2022 | [Standard](hdinsight-component-versioning.md#support-options-for-hdinsight-versions) | Not announced |Not announced| Yes |
2021
| [HDInsight 4.0](hdinsight-40-component-versioning.md) |Ubuntu 18.0.4 LTS |September 24, 2018 | [Standard](hdinsight-component-versioning.md#support-options-for-hdinsight-versions) | Not announced | Not announced |Yes |
2122

2223
**Support expiration** means that Microsoft no longer provides support for the specific HDInsight version. You may not be able to create clusters from the Azure portal.
2324

24-
**Retirement** means that existing clusters of an HDInsight version continue to run as is. New clusters of this version can't be created through any means, which includes the CLI and SDKs. Other control plane features, such as manual scaling and autoscaling, aren't guaranteed to work after retirement date. Support isn't available for retired versions.
25+
**Retirement** means that existing clusters of an HDInsight version continue to run as is. You can't create new clusters of this version through any means, which includes the CLI and SDKs. Other control plane features, such as manual scaling and autoscaling, not guaranteed to work after retirement date. Support isn't available for retired versions.
2526

2627
## Support options for HDInsight versions
2728

28-
Support is defined as a time period that an HDInsight version is supported by Microsoft Customer Service and Support. HDInsight offers two types of support:
29+
Support defined as a time period that an HDInsight version supported by Microsoft Customer Service and Support. HDInsight offers two types of support:
2930
- **Standard support**
3031
- **Basic support**
3132

@@ -46,7 +47,7 @@ Standard support includes
4647

4748
### Basic support
4849

49-
Basic support provides limited servicing to the HDInsight Resource provider. HDInsight images and open-source software (OSS) components won't be serviced. Only critical security fixes will be patched on HDInsight clusters.
50+
Basic support provides limited servicing to the HDInsight Resource provider. HDInsight images and open-source software (OSS) components won't be serviced. Only critical security fixes patched on HDInsight clusters.
5051

5152
Basic support includes
5253
- Continued use of existing clusters.
@@ -78,7 +79,7 @@ Microsoft doesn't encourage creating analytics pipelines or solutions on cluster
7879
For extra release notes on the latest versions of HDInsight, see [HDInsight release notes](hdinsight-release-notes.md).
7980

8081
## Versioning considerations
81-
- Once a cluster is deployed with an image, that cluster isn't automatically upgraded to newer image version. When you create new clusters, most recent image version will be deployed.
82+
- Once a cluster deployed with an image, that cluster can't automatically upgrade to newer image version. When you create new clusters, most recent image version deployed.
8283
- Customers should test and validate that applications run properly when using new HDInsight version.
8384
- HDInsight reserves the right to change the default version without prior notice. If you have a version dependency, specify the HDInsight version when you create your clusters.
8485
- HDInsight may retire an OSS component version before retiring the HDInsight version.

articles/hdinsight/hdinsight-release-notes.md

Lines changed: 40 additions & 60 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Latest release notes for Azure HDInsight. Get development tips and
44
ms.custom: references_regions
55
ms.service: hdinsight
66
ms.topic: conceptual
7-
ms.date: 12/07/2022
7+
ms.date: 02/28/2023
88
---
99

1010

@@ -15,13 +15,13 @@ This article provides information about the **most recent** Azure HDInsight rele
1515
## Summary
1616

1717
Azure HDInsight is one of the most popular services among enterprise customers for open-source analytics on Azure.
18-
If you would like to subscribe on release notes, watch releases on [this GitHub repository](https://github.com/hdinsight/release-notes/releases).
18+
[Subscribe to our release notes](./subscribe-to-hdi-release-notes-repo.md) and watch releases on [this GitHub repository](https://github.com/hdinsight/release-notes/releases).
1919

20-
## Release date: December 12, 2022
20+
## Release date: February 28, 2023
2121

22-
This release applies to HDInsight 4.0. and 5.0 HDInsight release is made available to all regions over several days.
22+
This release applies to HDInsight 4.0. and 5.0, 5.1. HDInsight release will be available to all regions over several days. This release is applicable for image number 2302250400. [How to check the image number?](./view-hindsight-cluster-image-version.md)
2323

24-
HDInsight uses safe deployment practices, which involve gradual region deployment. It may take up to 10 business days for a new release or a new version to be available in all regions.
24+
HDInsight uses safe deployment practices, which involve gradual region deployment. it may take up to 10 business days for a new release or a new version to be available in all regions.
2525

2626
**OS versions**
2727

@@ -32,72 +32,52 @@ For workload specific versions, see [here.](./hdinsight-40-component-versioning.
3232

3333
![Icon showing new features with text.](media/hdinsight-release-notes/new-icon-for-new-feature.png)
3434

35-
* **Log Analytics** - Customers can enable classic monitoring to get the latest OMS version 14.19. To remove old versions, disable and enable classic monitoring.
36-
* **Ambari** user auto UI logout due to inactivity. For more information, see [here](./ambari-web-ui-auto-logout.md)
37-
* **Spark** - A new and optimized version of Spark 3.1.3 is included in this release. We tested Apache Spark 3.1.2(previous version) and Apache Spark 3.1.3(current version) using the TPC-DS benchmark. The test was carried out using E8 V3  SKU, for Apache Spark on 1-TB workload. Apache Spark 3.1.3 (current version) outperformed Apache Spark 3.1.2 (previous version) by over 40% in total query runtime for TPC-DS queries using the same hardware specs. The Microsoft Spark team added optimizations available in Azure Synapse with Azure HDInsight. For more information, please refer to [ Speed up your data workloads with performance updates to Apache Spark 3.1.2 in Azure Synapse](https://techcommunity.microsoft.com/t5/azure-synapse-analytics-blog/speed-up-your-data-workloads-with-performance-updates-to-apache/ba-p/2769467)
35+
**HDInsight 5.1**
3836

39-
![Icon showing new regions added with text.](media/hdinsight-release-notes/new-icon-for-new-regions-added.png)
37+
We have started rolling out a new version of HDInsight 5.1. All new open-source releases added as incremental releases on HDInsight 5.1.
4038

41-
* Qatar Central
42-
* Germany North
43-
44-
![Icon showing what's changed with text.](media/hdinsight-release-notes/new-icon-for-changed.png)
45-
46-
* HDInsight has moved away from Azul Zulu Java JDK 8 to Adoptium Temurin JDK 8, which supports high-quality TCK certified runtimes, and associated technology for use across the Java ecosystem.
47-
48-
* HDInsight has migrated to reload4j. The log4j changes are applicable to
49-
50-
* Apache Hadoop
51-
* Apache Zookeeper
52-
* Apache Oozie
53-
* Apache Ranger
54-
* Apache Sqoop
55-
* Apache Pig
56-
* Apache Ambari
57-
* Apache Kafka
58-
* Apache Spark
59-
* Apache Zeppelin
60-
* Apache Livy
61-
* Apache Rubix
62-
* Apache Hive
63-
* Apache Tez
64-
* Apache HBase
65-
* OMI
66-
* Apache Pheonix
39+
For more information, see [HDInsight 5.1.0 version](./hdinsight-51-component-versioning.md)
6740

6841
![Icon showing update with text.](media/hdinsight-release-notes/new-icon-for-updated.png)
6942

70-
HDInsight will implement TLS1.2 going forward, and earlier versions will be updated on the platform. If you're running any applications on top of HDInsight and they use TLS 1.0 and 1.1, upgrade to TLS 1.2 to avoid any disruption in services.
43+
**Kafka 3.2.0 Upgrade (Preview)**
7144

72-
For more information, see [How to enable Transport Layer Security (TLS)](/mem/configmgr/core/plan-design/security/enable-tls-1-2-client)
45+
* Kafka 3.2.0 includes several significant new features/improvements.
46+
* Upgraded Zookeeper to 3.6.3
47+
* Kafka Streams support
48+
* Stronger delivery guarantees for the Kafka producer enabled by default.
49+
* log4j 1.x replaced with reload4j.
50+
* Send a hint to the partition leader to recover the partition.
51+
* `JoinGroupRequest` and `LeaveGroupRequest` have a reason attached.
52+
* Added Broker count metrics8.
53+
* Mirror Maker2 improvements.
7354

55+
**HBase 2.4.11 Upgrade (Preview)**
56+
* This version has new features such as the addition of new caching mechanism types for block cache, the ability to alter `hbase:meta table` and view the `hbase:meta` table from the HBase WEB UI.
7457

75-
![Icon showing end of support with text.](media/hdinsight-release-notes/new-icon-for-end-of-support.png)
58+
**Phoenix 5.1.2 Upgrade (Preview)**
59+
* Phoenix version upgraded to 5.1.2 in this release. This upgrade includes the Phoenix Query Server. The Phoenix Query Server proxies the standard Phoenix JDBC driver and provides a backwards-compatible wire protocol to invoke that JDBC driver.
60+
61+
**Ambari CVEs**
62+
* Multiple Ambari CVEs are fixed.
7663

77-
End of support for Azure HDInsight clusters on Ubuntu 16.04 LTS from 30 November 2022. HDInsight had begun release of cluster images using Ubuntu 18.04 from June 27, 2021. We recommend our customers who are running clusters using Ubuntu 16.04 is to rebuild their clusters with the latest HDInsight images by 30 November 2022.
78-
79-
For more information on how to check Ubuntu version of cluster, see [here](https://learnubuntu.com/check-ubuntu-version)
80-
81-
1. Execute the command “lsb_release -a” in the terminal.
82-
83-
1. If the value for “Description” property in output is “Ubuntu 16.04 LTS”, then this update is applicable to the cluster.
64+
![Icon showing end of support with text.](media/hdinsight-release-notes/new-icon-for-end-of-support.png)
8465

85-
![Icon showing bug fixes with text.](media/hdinsight-release-notes/new-icon-for-bugfix.png)
66+
End of support for Azure HDInsight clusters on Spark 2.4 February 10, 2024. For more information, see [Spark versions supported in Azure HDInsight](./hdinsight-40-component-versioning.md#spark-versions-supported-in-azure-hdinsight)
8667

87-
* Support for Availability Zones selection for Kafka and HBase (write access) clusters.
68+
## Upcoming Changes
8869

89-
## Open source bug fixes
70+
* Cluster name change limitation
71+
* The max length of cluster name will be changed to 45 from 59 in Public, Mooncake and Fairfax.
72+
* Cluster permissions for secure storage
73+
* Customers can specify (during cluster creation) whether a secure channel should be used for HDInsight cluster nodes to contact the storage account.
74+
* Non-ESP ABFS clusters [Cluster Permissions for World Readable]
75+
* Plan to introduce a change in non-ESP ABFS clusters, which restricts non-Hadoop group users from executing Hadoop commands for storage operations. This change to improve cluster security posture. Customers need to plan for the updates.
76+
* Open-source upgrades
77+
* Apache Spark 3.3.0 and Hadoop 3.3.4 are under development on HDInsight 5.1 and will include several significant new features, performance and other improvements.
9078

91-
**Hive bug fixes**
79+
> [!NOTE]
80+
> We advise customers to use to latest versions of HDInsight [Images](./view-hindsight-cluster-image-version.md) as they bring in the best of open source updates, Azure updates and security fixes.
9281
93-
|Bug Fixes|Apache JIRA|
94-
|---|---|
95-
|[HIVE-26127](https://issues.apache.org/jira/browse/HIVE-26127)| INSERT OVERWRITE error - File Not Found|
96-
|[HIVE-24957](https://issues.apache.org/jira/browse/HIVE-24957)| Wrong results when subquery has COALESCE in correlation predicate|
97-
|[HIVE-24999](https://issues.apache.org/jira/browse/HIVE-24999)| HiveSubQueryRemoveRule generates invalid plan for IN subquery with multiple correlations|
98-
|[HIVE-24322](https://issues.apache.org/jira/browse/HIVE-24322)| If there's direct insert, the attempt ID has to be checked when reading the manifest fails|
99-
|[HIVE-23363](https://issues.apache.org/jira/browse/HIVE-23363)| Upgrade DataNucleus dependency to 5.2 |
100-
|[HIVE-26412](https://issues.apache.org/jira/browse/HIVE-26412)| Create interface to fetch available slots and add the default|
101-
|[HIVE-26173](https://issues.apache.org/jira/browse/HIVE-26173)| Upgrade derby to 10.14.2.0|
102-
|[HIVE-25920](https://issues.apache.org/jira/browse/HIVE-25920)| Bump Xerce2 to 2.12.2.|
103-
|[HIVE-26300](https://issues.apache.org/jira/browse/HIVE-26300)| Upgrade Jackson data bind version to 2.12.6.1+ to avoid CVE-2020-36518|
82+
### Next steps
83+
* [Azure HDInsight: Frequently asked questions](./hdinsight-faq.yml)

0 commit comments

Comments
 (0)