You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/synapse-analytics/machine-learning/setup-environment-cognitive-services.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -92,7 +92,7 @@ To get started on Azure Kubernetes Service, follow these steps:
92
92
93
93
1.[Deploy an Azure Kubernetes Service (AKS) cluster using the Azure portal](../../aks/learn/quick-kubernetes-deploy-portal.md)
94
94
95
-
1.[Install the Apache Spark 2.4.0 helm chart](https://hub.helm.sh/charts/microsoft/spark)
95
+
1.[Install the Apache Spark 2.4.0 helm chart](https://hub.helm.sh/charts/microsoft/spark) - warning: [Spark 2.4](../spark/apache-spark-24-runtime.md) is retired and out of the support.
96
96
97
97
1.[Install an Azure AI container using Helm](../../ai-services/computer-vision/deploy-computer-vision-on-premises.md)
Copy file name to clipboardExpand all lines: articles/synapse-analytics/overview-faq.yml
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -138,7 +138,7 @@ sections:
138
138
- question: |
139
139
What versions of Spark are available?
140
140
answer: |
141
-
As of May 2021, Azure Synapse Apache Spark fully supports Spark 2.4 and Spark 3.1. As of April 2022, Spark 3.2 is in preview. For a full list of core components and currently supported versions see [Apache Spark version support](./spark/apache-spark-version-support.md).
141
+
As of September 2023, Azure Synapse Apache Spark fully supports Spark 3.3. For a full list of core components and currently supported versions see [Apache Spark version support](./spark/apache-spark-version-support.md).
142
142
143
143
- question: |
144
144
Is there an equivalent of DButils in Azure Synapse Spark?
# Azure Synapse Runtime for Apache Spark 2.4 (EOLA)
13
+
# Azure Synapse Runtime for Apache Spark 2.4 (unsupported)
14
14
15
15
Azure Synapse Analytics supports multiple runtimes for Apache Spark. This document will cover the runtime components and versions for the Azure Synapse Runtime for Apache Spark 2.4.
16
16
17
-
> [!IMPORTANT]
18
-
> * End of life announced (EOLA) for Azure Synapse Runtime for Apache Spark 2.4 has been announced July 29, 2022.
19
-
> * In accordance with the Synapse runtime for Apache Spark lifecycle policy, Azure Synapse runtime for Apache Spark 2.4 will be retired and disabled as of September 29, 2023. After the EOL date, the retired runtimes are unavailable for new Spark pools and existing workflows can't execute. Metadata will temporarily remain in the Synapse workspace.
20
-
> * We recommend that you upgrade your Apache Spark 2.4 workloads to version 3.3 at your earliest convenience.
17
+
> [!WARNING]
18
+
> End of Support Notification for Azure Synapse Runtime for Apache Spark 2.4
19
+
> * Effective September 29, 2023, the Azure Synapse will discontinue official support for Spark 2.4 Runtimes.
20
+
> * Post September 29, we will not be addressing any support tickets related to Spark 2.4. There will be no release pipeline in place for bug or security fixes for Spark 2.4. Utilizing Spark 2.4 post the support cutoff date is undertaken at one's own risk. We strongly discourage its continued use due to potential security and functionality concerns.
21
+
> * Recognizing that certain customers may need additional time to transition to a higher runtime version, we are temporarily extending the usage option for Spark 2.4, but we will not provide any official support for it.
22
+
> * We strongly advise to proactively upgrade their workloads to a more recent version of the runtime (e.g., [Azure Synapse Runtime for Apache Spark 3.3 (GA)](./apache-spark-33-runtime.md)).
@@ -161,7 +161,7 @@ If the underlying data of your Hive tables are stored in Azure Blob storage acco
161
161
3. Provide **Name** of the linked service. Record the name of the linked service, this info will be used in Spark configuration shortly.
162
162
4. Select the Azure Blob Storage account. Make sure Authentication method is **Account key**. Currently Spark pool can only access Blob Storage account via account key.
163
163
5.**Test connection** and click **Create**.
164
-
6. After creating the linked service to Blob Storage account, when you run Spark queries, make sure you run below Spark code in the notebook to get access to the the Blob Storage account for the Spark session. Learn more about why you need to do this [here](./apache-spark-secure-credentials-with-tokenlibrary.md).
164
+
6. After creating the linked service to Blob Storage account, when you run Spark queries, make sure you run below Spark code in the notebook to get access to the Blob Storage account for the Spark session. Learn more about why you need to do this [here](./apache-spark-secure-credentials-with-tokenlibrary.md).
165
165
166
166
```python
167
167
%%pyspark
@@ -190,7 +190,7 @@ After setting up storage connections, you can query the existing tables in the H
190
190
No credentials found for account xxxxx.blob.core.windows.net in the configuration, and its container xxxxx is not accessible using anonymous credentials. Please check if the container exists first. If it is not publicly available, you have to provide account credentials.
191
191
```
192
192
193
-
When use key authentication to your storage account via linked service, you need to take an extra step to get the token for Spark session. Run below code to configure your Spark session before running the query. Learn more about why you need to do this here.
193
+
When using key authentication to your storage account via linked service, you need to take an extra step to get the token for Spark session. Run below code to configure your Spark session before running the query. Learn more about why you need to do this here.
194
194
195
195
```python
196
196
%%pyspark
@@ -254,4 +254,4 @@ You can easily fix this issue by appending `/usr/hdp/current/hadoop-client/*` to
Copy file name to clipboardExpand all lines: articles/synapse-analytics/spark/apache-spark-intelligent-cache-concept.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -93,7 +93,7 @@ You won't see the benefit of this feature if:
93
93
94
94
* Your workload requires large amounts of shuffle, then disabling the Intelligent Cache will free up available space to prevent your job from failing due to insufficient storage space.
95
95
96
-
* You're using a Spark 2.4 pool, you'll need to upgrade your pool to the latest version of Spark.
96
+
* You're using a Spark 3.1 pool, you'll need to upgrade your pool to the latest version of Spark.
Copy file name to clipboardExpand all lines: articles/synapse-analytics/spark/apache-spark-performance-hyperspace.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,7 +31,7 @@ This document is also available in notebook form, for [Python](https://github.co
31
31
## Setup
32
32
33
33
>[!Note]
34
-
> Hyperspace is supported in Azure Synapse Runtime for Apache Spark 2.4 (EOLA), Azure Synapse Runtime for Apache Spark 3.1 (EOLA), and Azure Synapse Runtime for Apache Spark 3.2 (EOLA). However, it should be noted that Hyperspace is not supported in Azure Synapse Runtime for Apache Spark 3.3 (GA).
34
+
> Hyperspace is supported in Azure Synapse Runtime for Apache Spark 3.1 (EOLA), and Azure Synapse Runtime for Apache Spark 3.2 (EOLA). However, it should be noted that Hyperspace is not supported in Azure Synapse Runtime for Apache Spark 3.3 (GA).
35
35
36
36
To begin with, start a new Spark session. Since this document is a tutorial merely to illustrate what Hyperspace can offer, you will make a configuration change that allows us to highlight what Hyperspace is doing on small datasets.
Copy file name to clipboardExpand all lines: articles/synapse-analytics/spark/apache-spark-version-support.md
+30-24Lines changed: 30 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
---
2
2
title: Apache Spark version support
3
3
description: Supported versions of Spark, Scala, Python, .NET
4
-
author: eskot
4
+
author: ekote
5
5
ms.service: synapse-analytics
6
6
ms.topic: reference
7
7
ms.subservice: spark
@@ -13,14 +13,39 @@ ms.reviewer: eskot
13
13
14
14
# Azure Synapse runtimes
15
15
16
-
Apache Spark pools in Azure Synapse use runtimes to tie together essential component versions such as Azure Synapse optimizations, packages, and connectors with a specific Apache Spark version. Each runtime will be upgraded periodically to include new improvements, features, and patches.
17
-
18
-
When you create a serverless Apache Spark pool, you will have the option to select the corresponding Apache Spark version. Based on this, the pool will come pre-installed with the associated runtime components and packages. The runtimes have the following advantages:
19
-
16
+
Apache Spark pools in Azure Synapse use runtimes to tie together essential component versions such as Azure Synapse optimizations, packages, and connectors with a specific Apache Spark version. Each runtime will be upgraded periodically to include new improvements, features, and patches. When you create a serverless Apache Spark pool, you will have the option to select the corresponding Apache Spark version. Based on this, the pool will come pre-installed with the associated runtime components and packages. The runtimes have the following advantages:
20
17
- Faster session startup times
21
18
- Tested compatibility with specific Apache Spark versions
22
19
- Access to popular, compatible connectors and open-source packages
23
20
21
+
22
+
## Supported Azure Synapse runtime releases
23
+
24
+
> [!WARNING]
25
+
> End of Support Notification for Azure Synapse Runtime for Apache Spark 2.4
26
+
> * Effective September 29, 2023, the Azure Synapse will discontinue official support for Spark 2.4 Runtimes.
27
+
> * Post September 29, we will not be addressing any support tickets related to Spark 2.4. There will be no release pipeline in place for bug or security fixes for Spark 2.4. Utilizing Spark 2.4 post the support cutoff date is undertaken at one's own risk. We strongly discourage its continued use due to potential security and functionality concerns.
28
+
> * Recognizing that certain customers may need additional time to transition to a higher runtime version, we are temporarily extending the usage option for Spark 2.4, but we will not provide any official support for it.
29
+
> * We strongly advise to proactively upgrade their workloads to a more recent version of the runtime (e.g., [Azure Synapse Runtime for Apache Spark 3.3 (GA)](./apache-spark-33-runtime.md)).
30
+
31
+
The following table lists the runtime name, Apache Spark version, and release date for supported Azure Synapse Runtime releases.
32
+
33
+
| Runtime name | Release date | Release stage | End of life announcement date | End of life effective date |
|[Azure Synapse Runtime for Apache Spark 3.3](./apache-spark-33-runtime.md)| Nov 17, 2022 | GA (as of Feb 23, 2023) | Nov 17, 2023 | Nov 17, 2024 |
36
+
|[Azure Synapse Runtime for Apache Spark 3.2](./apache-spark-32-runtime.md)| July 8, 2022 |__End of Life Announced (EOLA)__| July 8, 2023 | July 8, 2024 |
37
+
|[Azure Synapse Runtime for Apache Spark 3.1](./apache-spark-3-runtime.md)| May 26, 2021 |__End of Life Announced (EOLA)__| January 26, 2023 | January 26, 2024 |
38
+
|[Azure Synapse Runtime for Apache Spark 2.4](./apache-spark-24-runtime.md)| December 15, 2020 |__End of Life (EOL)__|__July 29, 2022__|__September 29, 2023__|
39
+
40
+
## Runtime release stages
41
+
42
+
For the complete runtime for Apache Spark lifecycle and support policies, refer to [Synapse runtime for Apache Spark lifecycle and supportability](./runtime-for-apache-spark-lifecycle-and-supportability.md).
43
+
44
+
## Runtime patching
45
+
46
+
Azure Synapse runtime for Apache Spark patches are rolled out monthly containing bug, feature and security fixes to the Apache Spark core engine, language environments, connectors and libraries.
47
+
48
+
24
49
> [!NOTE]
25
50
> - Maintenance updates will be automatically applied to new sessions for a given serverless Apache Spark pool.
26
51
> - You should test and validate that your applications run properly when using new runtime versions.
@@ -41,25 +66,6 @@ When you create a serverless Apache Spark pool, you will have the option to sele
41
66
> *```org/apache/log4j/chainsaw/*```
42
67
>
43
68
> While the above classes were not used in the default Log4j configurations in Synapse, it is possible that some user application could still depend on it. If your application needs to use these classes, use Library Management to add a secure version of Log4j to the Spark Pool. __Do not use Log4j version 1.2.17__, as it would be reintroducing the vulnerabilities.
44
-
>
45
-
46
-
## Supported Azure Synapse runtime releases
47
-
The following table lists the runtime name, Apache Spark version, and release date for supported Azure Synapse Runtime releases.
48
-
49
-
| Runtime name | Release date | Release stage | End of life announcement date | End of life effective date |
|[Azure Synapse Runtime for Apache Spark 3.3](./apache-spark-33-runtime.md)| Nov 17, 2022 | GA (as of Feb 23, 2023) | Nov 17, 2023 | Nov 17, 2024 |
52
-
|[Azure Synapse Runtime for Apache Spark 3.2](./apache-spark-32-runtime.md)| July 8, 2022 |__End of Life Announced (EOLA)__| July 8, 2023 | July 8, 2024 |
53
-
|[Azure Synapse Runtime for Apache Spark 3.1](./apache-spark-3-runtime.md)| May 26, 2021 |__End of Life Announced (EOLA)__| January 26, 2023 | January 26, 2024 |
54
-
|[Azure Synapse Runtime for Apache Spark 2.4](./apache-spark-24-runtime.md)| December 15, 2020 |__End of Life Announced (EOLA)__|__July 29, 2022__|__September 29, 2023__|
55
-
56
-
## Runtime release stages
57
-
58
-
For the complete runtime for Apache Spark lifecycle and support policies, refer to [Synapse runtime for Apache Spark lifecycle and supportability](./runtime-for-apache-spark-lifecycle-and-supportability.md).
59
-
60
-
## Runtime patching
61
-
62
-
Azure Synapse runtime for Apache Spark patches are rolled out monthly containing bug, feature and security fixes to the Apache Spark core engine, language environments, connectors and libraries.
63
69
64
70
The patch policy differs based on the [runtime lifecycle stage](./runtime-for-apache-spark-lifecycle-and-supportability.md):
65
71
1. Generally Available (GA) runtime: Receive no upgrades on major versions (i.e. 3.x -> 4.x). And will upgrade a minor version (i.e. 3.x -> 3.y) as long as there are no deprecation or regression impacts.
Copy file name to clipboardExpand all lines: articles/synapse-analytics/spark/data-sources/apache-spark-kusto-connector.md
+4-5Lines changed: 4 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,20 +12,19 @@ author: midesa
12
12
---
13
13
14
14
# Azure Data Explorer (Kusto) connector for Apache Spark
15
-
The Azure Data Explorer (Kusto) connector for Apache Spark is designed to efficiently transfer data between Kusto clusters and Spark. This connector is available in Python, Java, and .NET. It is built in to the Azure Synapse Apache Spark 2.4 runtime (EOLA).
15
+
The Azure Data Explorer (Kusto) connector for Apache Spark is designed to efficiently transfer data between Kusto clusters and Spark. This connector is available in Python, Java, and .NET.
16
16
17
17
## Authentication
18
18
When using Azure Synapse Notebooks or Apache Spark job definitions, the authentication between systems is made seamless with the linked service. The Token Service connects with Azure Active Directory to obtain security tokens for use when accessing the Kusto cluster.
19
19
20
-
For Azure Synapse Pipelines, the authentication will use the service principal name. Currently, managed identities are not supported with the Azure Data Explorer connector.
20
+
For Azure Synapse Pipelines, the authentication uses the service principal name. Currently, managed identities aren't supported with the Azure Data Explorer connector.
21
21
22
22
## Prerequisites
23
-
-[Connect to Azure Data Explorer](../../quickstart-connect-azure-data-explorer.md): You will need to set up a Linked Service to connect to an existing Kusto cluster.
23
+
-[Connect to Azure Data Explorer](../../quickstart-connect-azure-data-explorer.md): You need to set up a Linked Service to connect to an existing Kusto cluster.
24
24
25
25
## Limitations
26
-
- The Azure Data Explorer (Kusto) connector is currently only supported on the Azure Synapse Apache Spark 2.4 runtime (EOLA).
27
26
- The Azure Data Explorer linked service can only be configured with the Service Principal Name.
28
-
- Within Azure Synapse Notebooks or Apache Spark Job Definitions, the Azure Data Explorer connector will use Azure AD pass-through to connect to the Kusto Cluster.
27
+
- Within Azure Synapse Notebooks or Apache Spark Job Definitions, the Azure Data Explorer connector uses Azure AD pass-through to connect to the Kusto Cluster.
0 commit comments