Skip to content

Commit c6043ef

Browse files
authored
Merge pull request #294636 from whhender/fixing-broken-author
Resolving authors
2 parents 7749cb6 + 73c7518 commit c6043ef

9 files changed

+41
-41
lines changed

articles/synapse-analytics/spark/apache-spark-24-runtime.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
22
title: Azure Synapse Runtime for Apache Spark 2.4 (unsupported)
33
description: Versions of Spark, Scala, Python, and .NET for Apache Spark 2.4.
4-
author: ekote
5-
ms.author: eskot
4+
author: ms-arali
5+
ms.author: arali
66
ms.service: azure-synapse-analytics
77
ms.topic: reference
88
ms.subservice: spark

articles/synapse-analytics/spark/apache-spark-3-runtime.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
title: Azure Synapse Runtime for Apache Spark 3.1 (unsupported)
33
description: Supported versions of Spark, Scala, Python, and .NET for Apache Spark 3.1.
4-
author: ekote
5-
ms.author: eskot
6-
ms.reviewer: whhender, whhender
4+
author: ms-arali
5+
ms.author: arali
6+
ms.reviewer: whhender
77
ms.service: azure-synapse-analytics
88
ms.topic: reference
99
ms.subservice: spark
@@ -17,12 +17,12 @@ Azure Synapse Analytics supports multiple runtimes for Apache Spark. This docume
1717

1818
> [!CAUTION]
1919
> Deprecation and disablement notification for Azure Synapse Runtime for Apache Spark 3.1.
20-
>* **On August 29, 2024,** partial pools and jobs disablement will begin. We will continue with further, **full disablement by September 30, 2024.** **Immediately** migrate to higher runtime versions otherwise your jobs will stop executing.
20+
>* **On August 29, 2024,** partial pools and jobs disablement will begin. We'll continue with further, **full disablement by September 30, 2024.** **Immediately** migrate to higher runtime versions otherwise your jobs will stop executing.
2121
> * **All Spark jobs running on Azure Synapse Runtime for Apache Spark 3.1 will be fully disabled as of** **September 30, 2024.**
2222
* End of Support for Azure Synapse Runtime for Apache Spark 3.1 announced January 26, 2023.
2323
* Effective January 26, 2024, the Azure Synapse has stopped official support for Spark 3.1 Runtimes.
24-
* Post January 26, 2024, we will not be addressing any support tickets related to Spark 3.1. There will be no release pipeline in place for bug or security fixes for Spark 3.1. Utilizing Spark 3.1 post the support cutoff date is undertaken at one's own risk. We strongly discourage its continued use due to potential security and functionality concerns.
25-
* Recognizing that certain customers may need additional time to transition to a higher runtime version, we are temporarily extending the usage option for Spark 3.1, but we will not provide any official support for it.
24+
* Post January 26, 2024, we won't be addressing any support tickets related to Spark 3.1. There will be no release pipeline in place for bug or security fixes for Spark 3.1. Utilizing Spark 3.1 post the support cutoff date is undertaken at one's own risk. We strongly discourage its continued use due to potential security and functionality concerns.
25+
* Recognizing that certain customers might need more time to transition to a higher runtime version, we're temporarily extending the usage option for Spark 3.1, but we won't provide any official support for it.
2626
* **We strongly advise proactively upgrading workloads to a more recent version of the runtime (e.g., [Azure Synapse Runtime for Apache Spark 3.4 (GA)](./apache-spark-34-runtime.md))**.
2727
## Component versions
2828

articles/synapse-analytics/spark/apache-spark-32-runtime.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
title: Azure Synapse Runtime for Apache Spark 3.2
33
description: Supported versions of Spark, Scala, Python, and .NET for Apache Spark 3.2.
4-
author: ekote
5-
ms.author: eskot
6-
ms.reviewer: whhender, whhender
4+
author: ms-arali
5+
ms.author: arali
6+
ms.reviewer: whhender
77
ms.service: azure-synapse-analytics
88
ms.topic: reference
99
ms.subservice: spark

articles/synapse-analytics/spark/apache-spark-33-runtime.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
title: Azure Synapse Runtime for Apache Spark 3.3
33
description: New runtime is GA and ready for production workloads. Spark 3.3.1, Python 3.10, Delta Lake 2.2.
4-
author: ekote
5-
ms.author: eskot
6-
ms.reviewer: whhender, whhender
4+
author: ms-arali
5+
ms.author: arali
6+
ms.reviewer: whhender
77
ms.service: azure-synapse-analytics
88
ms.topic: reference
99
ms.subservice: spark
@@ -41,9 +41,9 @@ Azure Synapse Analytics supports multiple runtimes for Apache Spark. This docume
4141
> .NET for Apache Spark
4242
> * The [.NET for Apache Spark](https://github.com/dotnet/spark) is an open-source project under the .NET Foundation that currently requires the .NET 3.1 library, which has reached the out-of-support status. We would like to inform users of Azure Synapse Spark of the removal of the .NET for Apache Spark library in the Azure Synapse Runtime for Apache Spark version 3.3. Users may refer to the [.NET Support Policy](https://dotnet.microsoft.com/platform/support/policy/dotnet-core) for more details on this matter.
4343
>
44-
> * As a result, it will no longer be possible for users to utilize Apache Spark APIs via C# and F#, or execute C# code in notebooks within Synapse or through Apache Spark Job definitions in Synapse. It is important to note that this change affects only Azure Synapse Runtime for Apache Spark 3.3 and above.
44+
> * As a result, it will no longer be possible for users to utilize Apache Spark APIs via C# and F#, or execute C# code in notebooks within Synapse or through Apache Spark Job definitions in Synapse. It's important to note that this change affects only Azure Synapse Runtime for Apache Spark 3.3 and above.
4545
>
46-
> * We will continue to support .NET for Apache Spark in all previous versions of the Azure Synapse Runtime according to [their lifecycle stages](runtime-for-apache-spark-lifecycle-and-supportability.md). However, we do not have plans to support .NET for Apache Spark in Azure Synapse Runtime for Apache Spark 3.3 and future versions. We recommend that users with existing workloads written in C# or F# migrate to Python or Scala. Users are advised to take note of this information and plan accordingly.
46+
> * We'll continue to support .NET for Apache Spark in all previous versions of the Azure Synapse Runtime according to [their lifecycle stages](runtime-for-apache-spark-lifecycle-and-supportability.md). However, we don't have plans to support .NET for Apache Spark in Azure Synapse Runtime for Apache Spark 3.3 and future versions. We recommend that users with existing workloads written in C# or F# migrate to Python or Scala. Users are advised to take note of this information and plan accordingly.
4747
4848
## Libraries
4949
To check the libraries included in Azure Synapse Runtime for Apache Spark 3.3 for Java/Scala, Python, and R go to [Azure Synapse Runtime for Apache Spark 3.3](https://github.com/microsoft/synapse-spark-runtime/tree/main/Synapse/spark3.3)
@@ -58,4 +58,4 @@ To check the libraries included in Azure Synapse Runtime for Apache Spark 3.3 fo
5858

5959
## Migration between Apache Spark versions - support
6060

61-
For guidance on migrating from older runtime versions to Azure Synapse Runtime for Apache Spark 3.3 or 3.4 refer to [Runtime for Apache Spark Overview](./apache-spark-version-support.md#migration-between-apache-spark-versions---support).
61+
For guidance on migrating from older runtime versions to Azure Synapse Runtime for Apache Spark 3.3 or 3.4, refer to [Runtime for Apache Spark Overview](./apache-spark-version-support.md#migration-between-apache-spark-versions---support).

articles/synapse-analytics/spark/apache-spark-34-runtime.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
title: Azure Synapse Runtime for Apache Spark 3.4
33
description: New runtime is in GA stage. Try it and use Spark 3.4.1, Python 3.10, Delta Lake 2.4.
4-
author: ekote
5-
ms.author: eskot
6-
ms.reviewer: whhender, whhender
4+
author: ms-arali
5+
ms.author: arali
6+
ms.reviewer: whhender
77
ms.service: azure-synapse-analytics
88
ms.topic: reference
99
ms.subservice: spark

articles/synapse-analytics/spark/apache-spark-performance-hyperspace.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
22
title: Hyperspace indexes for Apache Spark
33
description: Performance optimization for Apache Spark using Hyperspace indexes
4-
author: ekote
5-
ms.author: eskot
6-
ms.reviewer: whhender, whhender
4+
author: ms-arali
5+
ms.author: arali
6+
ms.reviewer: whhender
77
ms.service: azure-synapse-analytics
88
ms.topic: conceptual
99
ms.subservice: spark

articles/synapse-analytics/spark/apache-spark-version-support.md

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
22
title: Apache Spark version support
33
description: Supported versions of Spark, Scala, Python
4-
author: ekote
5-
ms.author: eskot
4+
author: ms-arali
5+
ms.author: arali
66
ms.reviewer: maghan, whhender, whhender
77
ms.date: 03/08/2024
88
ms.service: azure-synapse-analytics
@@ -29,7 +29,7 @@ The following table lists the runtime name, Apache Spark version, and release da
2929
| Runtime name | Release date | Release stage | End of Support announcement date | End of Support effective date |
3030
| --- | --- |------------------------------| --- | --- |
3131
| [Azure Synapse Runtime for Apache Spark 3.4](./apache-spark-34-runtime.md) | Nov 21, 2023 | GA (as of Apr 8, 2024) | Q2 2025| Q1 2026|
32-
| [Azure Synapse Runtime for Apache Spark 3.3](./apache-spark-33-runtime.md) | Nov 17, 2022 |**end of support announced**|July 12th, 2024| 3/31/2025 |
32+
| [Azure Synapse Runtime for Apache Spark 3.3](./apache-spark-33-runtime.md) | Nov 17, 2022 |**end of support announced**|July 12, 2024| 3/31/2025 |
3333
| [Azure Synapse Runtime for Apache Spark 3.2](./apache-spark-32-runtime.md) | July 8, 2022 | __deprecated and soon disabled__ | July 8, 2023 | __July 8, 2024__ |
3434
| [Azure Synapse Runtime for Apache Spark 3.1](./apache-spark-3-runtime.md) | May 26, 2021 | __deprecated and soon disabled__ | January 26, 2023 | __January 26, 2024__ |
3535
| [Azure Synapse Runtime for Apache Spark 2.4](./apache-spark-24-runtime.md) | December 15, 2020 | __deprecated and soon disabled__ | July 29, 2022 | __September 29, 2023__ |
@@ -60,7 +60,7 @@ Azure Synapse runtimes for Apache Spark patches are rolled out monthly containin
6060
> * ```org/apache/log4j/jdbc/JDBCAppender.class```
6161
> * ```org/apache/log4j/chainsaw/*```
6262
>
63-
> While the above classes were not used in the default Log4j configurations in Synapse, it is possible that some user application could still depend on it. If your application needs to use these classes, use Library Management to add a secure version of Log4j to the Spark Pool. __Do not use Log4j version 1.2.17__, as it would be reintroducing the vulnerabilities.
63+
> While the above classes weren't used in the default Log4j configurations in Synapse, it's possible that some user application could still depend on it. If your application needs to use these classes, use Library Management to add a secure version of Log4j to the Spark Pool. __Do not use Log4j version 1.2.17__, as it would be reintroducing the vulnerabilities.
6464
6565
The patch policy differs based on the [runtime lifecycle stage](./runtime-for-apache-spark-lifecycle-and-supportability.md):
6666

@@ -75,20 +75,20 @@ The patch policy differs based on the [runtime lifecycle stage](./runtime-for-ap
7575

7676
## Migration between Apache Spark versions - support
7777

78-
This guide provides a structured approach for users looking to upgrade their Azure Synapse Runtime for Apache Spark workloads from versions 2.4, 3.1, 3.2, or 3.3 to [the latest GA version, such as 3.4](./apache-spark-34-runtime.md). Upgrading to the most recent version enables users to benefit from performance enhancements, new features, and improved security measures. It is important to note that transitioning to a higher version may require adjustments to your existing Spark code due to incompatibilities or deprecated features.
78+
This guide provides a structured approach for users looking to upgrade their Azure Synapse Runtime for Apache Spark workloads from versions 2.4, 3.1, 3.2, or 3.3 to [the latest GA version, such as 3.4](./apache-spark-34-runtime.md). Upgrading to the most recent version enables users to benefit from performance enhancements, new features, and improved security measures. It's important to note that transitioning to a higher version may require adjustments to your existing Spark code due to incompatibilities or deprecated features.
7979

8080
### Step 1: Evaluate and plan
81-
- **Assess Compatibility:** Start with reviewing Apache Spark migration guides to identify any potential incompatibilities, deprecated features, and new APIs between your current Spark version (2.4, 3.1, 3.2, or 3.3) and the target version (e.g., 3.4).
81+
- **Assess Compatibility:** Start with reviewing Apache Spark migration guides to identify any potential incompatibilities, deprecated features, and new APIs between your current Spark version (2.4, 3.1, 3.2, or 3.3) and the target version (for example, 3.4).
8282
- **Analyze Codebase:** Carefully examine your Spark code to identify the use of deprecated or modified APIs. Pay particular attention to SQL queries and User Defined Functions (UDFs), which may be affected by the upgrade.
8383

8484
### Step 2: Create a new Spark pool for testing
85-
- **Create a New Pool:** In Azure Synapse, go to the Spark pools section and set up a new Spark pool. Select the target Spark version (e.g., 3.4) and configure it according to your performance requirements.
85+
- **Create a New Pool:** In Azure Synapse, go to the Spark pools section and set up a new Spark pool. Select the target Spark version (for example, 3.4) and configure it according to your performance requirements.
8686
- **Configure Spark Pool Configuration:** Ensure that all libraries and dependencies in your new Spark pool are updated or replaced to be compatible with Spark 3.4.
8787

8888
### Step 3: Migrate and test your code
8989
- **Migrate Code:** Update your code to be compliant with the new or revised APIs in Apache Spark 3.4. This involves addressing deprecated functions and adopting new features as detailed in the official Apache Spark documentation.
9090
- **Test in Development Environment:** Test your updated code within a development environment in Azure Synapse, not locally. This step is essential for identifying and fixing any issues before moving to production.
91-
- **Deploy and Monitor:** After thorough testing and validation in the development environment, deploy your application to the new Spark 3.4 pool. It is critical to monitor the application for any unexpected behaviors. Utilize the monitoring tools available in Azure Synapse to keep track of your Spark applications' performance.
91+
- **Deploy and Monitor:** After thorough testing and validation in the development environment, deploy your application to the new Spark 3.4 pool. It's critical to monitor the application for any unexpected behaviors. Utilize the monitoring tools available in Azure Synapse to keep track of your Spark applications' performance.
9292

9393
**Question:** What steps should be taken in migrating from 2.4 to 3.X?
9494

@@ -102,7 +102,7 @@ This guide provides a structured approach for users looking to upgrade their Azu
102102

103103
**Question:** Why can't I upgrade to 3.4 without recreating a new Spark pool?
104104

105-
**Answer:** This is not allowed from UX, customer can use Azure PowerShell to update Spark version. Please use "ForceApplySetting", so that any existing clusters (with old version) are decommissioned.
105+
**Answer:** This isn't allowed from UX, customer can use Azure PowerShell to update Spark version. Use "ForceApplySetting", so that any existing clusters (with old version) are decommissioned.
106106

107107
**Sample query:**
108108

articles/synapse-analytics/spark/data-sources/apache-spark-sql-connector.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
22
title: Azure SQL and SQL Server
33
description: This article provides information on how to use the connector for moving data between Azure MS SQL and serverless Apache Spark pools.
4-
author: ekote
5-
ms.author: eskot
4+
author: ms-arali
5+
ms.author: arali
66
ms.service: azure-synapse-analytics
77
ms.topic: overview
88
ms.subservice: spark
@@ -13,14 +13,14 @@ ms.custom: has-adal-ref
1313
# Azure SQL Database and SQL Server connector for Apache Spark
1414
The Apache Spark connector for Azure SQL Database and SQL Server enables these databases to act as input data sources and output data sinks for Apache Spark jobs. It allows you to use real-time transactional data in big data analytics and persist results for ad-hoc queries or reporting.
1515

16-
Compared to the built-in JDBC connector, this connector provides the ability to bulk insert data into SQL databases. It can outperform row-by-row insertion with 10x to 20x faster performance. The Spark connector for SQL Server and Azure SQL Database also supports Microsoft Entra [authentication](/sql/connect/spark/connector#azure-active-directory-authentication), enabling you to connect securely to your Azure SQL databases from Azure Synapse Analytics.
16+
Compared to the built-in JDBC connector, this connector provides the ability to bulk insert data into SQL databases. It can outperform row-by-row insertion with 10 to 20 times faster performance. The Spark connector for SQL Server and Azure SQL Database also supports Microsoft Entra [authentication](/sql/connect/spark/connector#azure-active-directory-authentication), enabling you to connect securely to your Azure SQL databases from Azure Synapse Analytics.
1717

1818
This article covers how to use the DataFrame API to connect to SQL databases using the MS SQL connector. This article provides detailed examples using the PySpark API. For all of the supported arguments and samples for connecting to SQL databases using the MS SQL connector, see [Azure Data SQL samples](https://github.com/microsoft/sql-server-samples#azure-data-sql-samples-repository).
1919

2020

2121

2222
## Connection details
23-
In this example, we will use the Microsoft Spark utilities to facilitate acquiring secrets from a pre-configured Key Vault. To learn more about Microsoft Spark utilities, please visit [introduction to Microsoft Spark Utilities](../microsoft-spark-utilities.md).
23+
In this example, we'll use the Microsoft Spark utilities to facilitate acquiring secrets from a preconfigured Key Vault. To learn more about Microsoft Spark utilities, visit [introduction to Microsoft Spark Utilities](../microsoft-spark-utilities.md).
2424

2525
```python
2626
# The servername is in the format "jdbc:sqlserver://<AzureSQLServerName>.database.windows.net:1433"
@@ -35,7 +35,7 @@ password = mssparkutils.credentials.getSecret('azure key vault name','secret nam
3535
```
3636

3737
> [!NOTE]
38-
> Currently, there is no linked service or Microsoft Entra pass-through support with the Azure SQL connector.
38+
> Currently, there's no linked service or Microsoft Entra pass-through support with the Azure SQL connector.
3939
4040
## Use the Azure SQL and SQL Server connector
4141

@@ -144,12 +144,12 @@ jdbc_df = spark.read \
144144
> - A required dependency must be installed in order to authenticate using Active Directory.
145145
> - The format of `user` when using ActiveDirectoryPassword should be the UPN format, for example `[email protected]`.
146146
> - For **Scala**, the `com.microsoft.aad.adal4j` artifact will need to be installed.
147-
> - For **Python**, the `adal` library will need to be installed. This is available via pip.
147+
> - For **Python**, the `adal` library will need to be installed. This is available via pip.
148148
> - Check the [sample notebooks](https://github.com/microsoft/sql-spark-connector/tree/master/samples) for examples and for latest drivers and versions, visit [Apache Spark connector: SQL Server & Azure SQL](/sql/connect/spark/connector).
149149
150150
## Support
151151

152-
The Apache Spark Connector for Azure SQL and SQL Server is an open-source project. This connector does not come with any Microsoft support. For issues with or questions about the connector, create an Issue in this project repository. The connector community is active and monitoring submissions.
152+
The Apache Spark Connector for Azure SQL and SQL Server is an open-source project. This connector doesn't come with any Microsoft support. For issues with or questions about the connector, create an Issue in this project repository. The connector community is active and monitoring submissions.
153153

154154
## Next steps
155155
- [Learn more about the SQL Server and Azure SQL connector](/sql/connect/spark/connector)

articles/synapse-analytics/spark/synapse-spark-sql-pool-import-export.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
---
22
title: Azure Synapse Dedicated SQL Pool Connector for Apache Spark
33
description: Azure Synapse Dedicated SQL Pool Connector for Apache Spark to move data between the Synapse Serverless Spark Pool and the Synapse Dedicated SQL Pool.
4-
author: kalyankadiyala-Microsoft
4+
author: dawn2111
55
ms.service: azure-synapse-analytics
66
ms.topic: overview
77
ms.subservice: spark
88
ms.date: 01/22/2025
9-
ms.author: kakadiya
9+
ms.author: prdawn
1010
ms.reviewer: ktuckerdavis, aniket.adnaik
1111
---
1212
# Azure Synapse Dedicated SQL Pool Connector for Apache Spark

0 commit comments

Comments
 (0)