Skip to content

Commit 9e771ef

Browse files
Merge pull request #238834 from fbsolo-ms1/build-updates-for-YP
Requested file updates.
2 parents ea564a9 + dcc839a commit 9e771ef

7 files changed

+149
-191
lines changed

articles/machine-learning/apache-spark-azure-ml-concepts.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -14,16 +14,16 @@ ms.custom: cliv2, sdkv2, build-2023
1414
#Customer intent: As a full-stack machine learning pro, I want to use Apache Spark in Azure Machine Learning.
1515
---
1616

17-
# Apache Spark in Azure Machine Learning (preview)
17+
# Apache Spark in Azure Machine Learning
1818

19-
Azure Machine Learning integration with Azure Synapse Analytics (preview) provides easy access to distributed computation resources through the Apache Spark framework. This integration offers these Apache Spark computing experiences:
19+
Azure Machine Learning integration with Azure Synapse Analytics provides easy access to distributed computation resources through the Apache Spark framework. This integration offers these Apache Spark computing experiences:
2020

21-
- Serverless Spark compute (preview)
22-
- Attached Synapse Spark pool (preview)
21+
- Serverless Spark compute
22+
- Attached Synapse Spark pool
2323

2424
[!INCLUDE [machine-learning-preview-generic-disclaimer](../../includes/machine-learning-preview-generic-disclaimer.md)]
2525

26-
## Serverless Spark compute (preview)
26+
## Serverless Spark compute
2727

2828
With the Apache Spark framework, Azure Machine Learning serverless Spark compute is the easiest way to accomplish distributed computing tasks in the Azure Machine Learning environment. Azure Machine Learning offers a fully managed, serverless, on-demand Apache Spark compute cluster. Its users can avoid the need to create an Azure Synapse workspace and a Synapse Spark pool.
2929

@@ -118,8 +118,8 @@ To access data and other resources, a Spark job can use either a user identity p
118118
119119
## Next steps
120120

121-
- [Attach and manage a Synapse Spark pool in Azure Machine Learning (preview)](./how-to-manage-synapse-spark-pool.md)
122-
- [Interactive data wrangling with Apache Spark in Azure Machine Learning (preview)](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
123-
- [Submit Spark jobs in Azure Machine Learning (preview)](./how-to-submit-spark-jobs.md)
121+
- [Attach and manage a Synapse Spark pool in Azure Machine Learning](./how-to-manage-synapse-spark-pool.md)
122+
- [Interactive data wrangling with Apache Spark in Azure Machine Learning](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
123+
- [Submit Spark jobs in Azure Machine Learning](./how-to-submit-spark-jobs.md)
124124
- [Code samples for Spark jobs using the Azure Machine Learning CLI](https://github.com/Azure/azureml-examples/tree/main/cli/jobs/spark)
125125
- [Code samples for Spark jobs using the Azure Machine Learning Python SDK](https://github.com/Azure/azureml-examples/tree/main/sdk/python/jobs/spark)

articles/machine-learning/apache-spark-environment-configuration.md

Lines changed: 13 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Apache Spark - Environment Configuration
2+
title: Apache Spark - environment configuration
33
titleSuffix: Azure Machine Learning
44
description: Learn how to configure your Apache Spark environment for interactive data wrangling
55
author: ynpandey
@@ -8,27 +8,20 @@ ms.reviewer: franksolomon
88
ms.service: machine-learning
99
ms.subservice: mldata
1010
ms.topic: how-to
11-
ms.date: 03/06/2023
11+
ms.date: 05/22/2023
1212
#Customer intent: As a Full Stack ML Pro, I want to perform interactive data wrangling in Azure Machine Learning with Apache Spark.
1313
---
1414

15-
# Quickstart: Interactive Data Wrangling with Apache Spark in Azure Machine Learning (preview)
15+
# Quickstart: Interactive Data Wrangling with Apache Spark in Azure Machine Learning
1616

17-
[!INCLUDE [preview disclaimer](../../includes/machine-learning-preview-generic-disclaimer.md)]
17+
To handle interactive Azure Machine Learning notebook data wrangling, Azure Machine Learning integration with Azure Synapse Analytics provides easy access to the Apache Spark framework. This access allows for Azure Machine Learning Notebook interactive data wrangling.
1818

19-
To handle interactive Azure Machine Learning notebook data wrangling, Azure Machine Learning integration with Azure Synapse Analytics (preview) provides easy access to the Apache Spark framework. This access allows for Azure Machine Learning Notebook interactive data wrangling.
20-
21-
In this quickstart guide, you learn how to perform interactive data wrangling using Azure Machine Learning Managed (Automatic) Synapse Spark compute, Azure Data Lake Storage (ADLS) Gen 2 storage account, and user identity passthrough.
19+
In this quickstart guide, you learn how to perform interactive data wrangling using Azure Machine Learning serverless Spark compute, Azure Data Lake Storage (ADLS) Gen 2 storage account, and user identity passthrough.
2220

2321
## Prerequisites
2422
- An Azure subscription; if you don't have an Azure subscription, [create a free account](https://azure.microsoft.com/free) before you begin.
2523
- An Azure Machine Learning workspace. See [Create workspace resources](./quickstart-create-resources.md).
2624
- An Azure Data Lake Storage (ADLS) Gen 2 storage account. See [Create an Azure Data Lake Storage (ADLS) Gen 2 storage account](../storage/blobs/create-data-lake-storage-account.md).
27-
- To enable this feature:
28-
1. Navigate to the Azure Machine Learning studio UI
29-
2. In the icon section at the top right of the screen, select **Manage preview features** (megaphone icon)
30-
3. In the **Managed preview feature** panel, toggle the **Run notebooks and jobs on managed Spark** feature to **on**
31-
:::image type="content" source="./media/apache-spark-environment-configuration/how-to-enable-managed-spark-preview.png" lightbox="media/apache-spark-environment-configuration/how-to-enable-managed-spark-preview.png" alt-text="Screenshot showing the option to enable the Managed Spark preview.":::
3225

3326
## Store Azure storage account credentials as secrets in Azure Key Vault
3427

@@ -113,20 +106,20 @@ Once the user identity has the appropriate roles assigned, data in the Azure sto
113106
114107
## Ensuring resource access for Spark jobs
115108

116-
Spark jobs can use either a managed identity or user identity passthrough to access data and other resources. The following table summarizes the different mechanisms for resource access while using Azure Machine Learning serverless Spark compute (preview) and attached Synapse Spark pool.
109+
To access data and other resources, Spark jobs can use either a managed identity or user identity passthrough. The following table summarizes the different mechanisms for resource access while using Azure Machine Learning serverless Spark compute and attached Synapse Spark pool.
117110

118111
|Spark pool|Supported identities|Default identity|
119112
| ---------- | -------------------- | ---------------- |
120-
|Serverless Spark compute (preview)|User identity and managed identity|User identity|
113+
|Serverless Spark compute|User identity and managed identity|User identity|
121114
|Attached Synapse Spark pool|User identity and managed identity|Managed identity - compute identity of the attached Synapse Spark pool|
122115

123-
If the CLI or SDK code defines an option to use managed identity, Azure Machine Learning serverless Spark compute (preview) relies on a user-assigned managed identity attached to the workspace. You can attach a user-assigned managed identity to an existing Azure Machine Learning workspace using Azure Machine Learning CLI v2, or with `ARMClient`.
116+
If the CLI or SDK code defines an option to use managed identity, Azure Machine Learning serverless Spark compute relies on a user-assigned managed identity attached to the workspace. You can attach a user-assigned managed identity to an existing Azure Machine Learning workspace using Azure Machine Learning CLI v2, or with `ARMClient`.
124117

125118
## Next steps
126119

127-
- [Apache Spark in Azure Machine Learning (preview)](./apache-spark-azure-ml-concepts.md)
128-
- [Attach and manage a Synapse Spark pool in Azure Machine Learning (preview)](./how-to-manage-synapse-spark-pool.md)
129-
- [Interactive Data Wrangling with Apache Spark in Azure Machine Learning (preview)](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
130-
- [Submit Spark jobs in Azure Machine Learning (preview)](./how-to-submit-spark-jobs.md)
120+
- [Apache Spark in Azure Machine Learning](./apache-spark-azure-ml-concepts.md)
121+
- [Attach and manage a Synapse Spark pool in Azure Machine Learning](./how-to-manage-synapse-spark-pool.md)
122+
- [Interactive Data Wrangling with Apache Spark in Azure Machine Learning](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
123+
- [Submit Spark jobs in Azure Machine Learning](./how-to-submit-spark-jobs.md)
131124
- [Code samples for Spark jobs using Azure Machine Learning CLI](https://github.com/Azure/azureml-examples/tree/main/cli/jobs/spark)
132-
- [Code samples for Spark jobs using Azure Machine Learning Python SDK](https://github.com/Azure/azureml-examples/tree/main/sdk/python/jobs/spark)
125+
- [Code samples for Spark jobs using Azure Machine Learning Python SDK](https://github.com/Azure/azureml-examples/tree/main/sdk/python/jobs/spark)

articles/machine-learning/how-to-manage-synapse-spark-pool.md

Lines changed: 23 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Attach and manage a Synapse Spark pool in Azure Machine Learning (preview)
2+
title: Attach and manage a Synapse Spark pool in Azure Machine Learning
33
titleSuffix: Azure Machine Learning
44
description: Learn how to attach and manage Spark pools with Azure Synapse
55
author: fbsolo-ms1
@@ -8,15 +8,13 @@ ms.reviewer: franksolomon
88
ms.service: machine-learning
99
ms.subservice: mldata
1010
ms.topic: how-to
11-
ms.date: 12/01/2022
11+
ms.date: 05/22/2023
1212
ms.custom: template-how-to, devx-track-azurecli
1313
---
1414

15-
# Attach and manage a Synapse Spark pool in Azure Machine Learning (preview)
15+
# Attach and manage a Synapse Spark pool in Azure Machine Learning
1616

17-
[!INCLUDE [preview disclaimer](../../includes/machine-learning-preview-generic-disclaimer.md)]
18-
19-
In this article, you will learn how to attach a [Synapse Spark Pool](../synapse-analytics/spark/apache-spark-concepts.md#spark-pools) in Azure Machine Learning. You can attach a Synapse Spark Pool in Azure Machine Learning in one of these ways:
17+
In this article, you'll learn how to attach a [Synapse Spark Pool](../synapse-analytics/spark/apache-spark-concepts.md#spark-pools) in Azure Machine Learning. You can attach a Synapse Spark Pool in Azure Machine Learning in one of these ways:
2018

2119
- Using Azure Machine Learning studio UI
2220
- Using Azure Machine Learning CLI
@@ -28,11 +26,6 @@ In this article, you will learn how to attach a [Synapse Spark Pool](../synapse-
2826
- An Azure Machine Learning workspace. See [Create workspace resources](./quickstart-create-resources.md).
2927
- [Create an Azure Synapse Analytics workspace in Azure portal](../synapse-analytics/quickstart-create-workspace.md).
3028
- [Create an Apache Spark pool using the Azure portal](../synapse-analytics/quickstart-create-apache-spark-pool-portal.md).
31-
- To enable this feature:
32-
1. Navigate to Azure Machine Learning studio UI.
33-
2. Select **Manage preview features** (megaphone icon) among the icons on the top right side of the screen.
34-
3. In **Managed preview feature** panel, toggle on **Run notebooks and jobs on managed Spark** feature.
35-
:::image type="content" source="media/how-to-manage-synapse-spark-pool/how_to_enable_managed_spark_preview.png" alt-text="Screenshot showing option for enabling Managed Spark preview.":::
3629

3730
# [CLI](#tab/cli)
3831
[!INCLUDE [cli v2](../../includes/machine-learning-cli-v2.md)]
@@ -59,30 +52,30 @@ Azure Machine Learning provides multiple options for attaching and managing a Sy
5952

6053
# [Studio UI](#tab/studio-ui)
6154

62-
To attach a Synapse Spark Pool using the Studio Compute tab:
55+
To attach a Synapse Spark Pool using the Studio Compute tab:
6356

6457
:::image type="content" source="media/how-to-manage-synapse-spark-pool/synapse_compute_synapse_spark_pool.png" alt-text="Screenshot showing creation of a new Synapse Spark Pool.":::
6558

6659
1. In the **Manage** section of the left pane, select **Compute**.
6760
1. Select **Attached computes**.
6861
1. On the **Attached computes** screen, select **New**, to see the options for attaching different types of computes.
69-
1. Select **Synapse Spark pool (preview)**.
62+
2. Select **Synapse Spark pool**.
7063

71-
The **Attach Synapse Spark pool (preview)** panel will open on the right side of the screen. In this panel:
64+
The **Attach Synapse Spark pool** panel will open on the right side of the screen. In this panel:
7265

73-
1. Enter a **Name**, which will refer to the attached Synapse Spark Pool inside the Azure Machine Learning.
66+
1. Enter a **Name**, which refers to the attached Synapse Spark Pool inside the Azure Machine Learning.
7467

75-
1. Select an Azure **Subscription** from the dropdown menu.
68+
2. Select an Azure **Subscription** from the dropdown menu.
7669

77-
1. Select a **Synapse workspace** from the dropdown menu.
70+
3. Select a **Synapse workspace** from the dropdown menu.
7871

79-
1. Select a **Spark Pool** from the dropdown menu.
72+
4. Select a **Spark Pool** from the dropdown menu.
8073

81-
1. Toggle the **Assign a managed identity** option, to enable it.
74+
5. Toggle the **Assign a managed identity** option, to enable it.
8275

83-
1. Select a managed **Identity type** to use with this attached Synapse Spark Pool.
76+
6. Select a managed **Identity type** to use with this attached Synapse Spark Pool.
8477

85-
1. Select **Update**, to complete the Synapse Spark Pool attach process.
78+
7. Select **Update**, to complete the Synapse Spark Pool attach process.
8679

8780
# [CLI](#tab/cli)
8881

@@ -181,7 +174,7 @@ Class SynapseSparkCompute: This is an experimental class, and may change at any
181174
}
182175
```
183176

184-
If the attached Synapse Spark pool, with the name specified in the YAML specification file, already exists in the workspace, then `az ml compute attach` command execution will update the existing pool with the information provided in the YAML specification file. You can update the
177+
If the attached Synapse Spark pool, with the name specified in the YAML specification file, already exists in the workspace, then `az ml compute attach` command execution updates the existing pool with the information provided in the YAML specification file. You can update the
185178

186179
- identity type
187180
- user assigned identities
@@ -270,7 +263,7 @@ This sample shows the expected output of the above command:
270263

271264
[!INCLUDE [sdk v2](../../includes/machine-learning-sdk-v2.md)]
272265

273-
Azure Machine Learning Python SDK (preview) provides convenient functions for attaching and managing Synapse Spark pool, using Python code in Azure Machine Learning Notebooks.
266+
Azure Machine Learning Python SDK provides convenient functions for attaching and managing Synapse Spark pool, using Python code in Azure Machine Learning Notebooks.
274267

275268
To attach a Synapse Compute using Python SDK, first create an instance of [azure.ai.ml.MLClient class](/python/api/azure-ai-ml/azure.ai.ml.mlclient). This provides convenient functions for interaction with Azure Machine Learning services. The following code sample uses `azure.identity.DefaultAzureCredential` for connecting to a workspace in resource group of a specified Azure subscription. In the following code sample, define the `SynapseSparkCompute` with the parameters:
276269
- `name` - user-defined name of the new attached Synapse Spark pool.
@@ -393,7 +386,7 @@ To ensure that the attached Synapse Spark Pool works properly, assign the [Admin
393386

394387
1. In **Role** dropdown menu, select **Synapse Administrator**.
395388

396-
1. In the **Select user** search box, start typing the name of your Azure Machine Learning Workspace. It will show you a list of attached Synapse Spark pools. Select your desired Synapse Spark pool from the list.
389+
1. In the **Select user** search box, start typing the name of your Azure Machine Learning Workspace. It shows you a list of attached Synapse Spark pools. Select your desired Synapse Spark pool from the list.
397390

398391
1. Select **Apply**.
399392

@@ -422,7 +415,7 @@ To update managed identity for the attached Synapse Spark pool:
422415
1. To assign a user-assigned managed identity:
423416
1. Select **User-assigned** as the **Identity type**.
424417
1. Select an Azure **Subscription** from the dropdown menu.
425-
1. Type the first few letters of the name of user-assigned managed identity in the box showing text **Search by name**. A list with matching user-assigned managed identity names will appear. Select the user-assigned managed identity you want from the list. You can select multiple user-assigned managed identities, and assign them to the attached Synapse Spark pool.
418+
1. Type the first few letters of the name of user-assigned managed identity in the box showing text **Search by name**. A list with matching user-assigned managed identity names appears. Select the user-assigned managed identity you want from the list. You can select multiple user-assigned managed identities, and assign them to the attached Synapse Spark pool.
426419
1. Select **Update**.
427420

428421
# [CLI](#tab/cli)
@@ -616,7 +609,7 @@ Are you sure you want to perform this operation? (y/n): y
616609

617610
[!INCLUDE [sdk v2](../../includes/machine-learning-sdk-v2.md)]
618611

619-
We will use an `MLClient.compute.begin_delete()` function call. Pass the `name` of the attached Synapse Spark pool, along with the action `Detach`, to the function. This code snippet detaches a Synapse Spark pool from an Azure Machine Learning workspace:
612+
We'll use an `MLClient.compute.begin_delete()` function call. Pass the `name` of the attached Synapse Spark pool, along with the action `Detach`, to the function. This code snippet detaches a Synapse Spark pool from an Azure Machine Learning workspace:
620613

621614
```python
622615
# import required libraries
@@ -637,12 +630,12 @@ ml_client.compute.begin_delete(name=synapse_name, action="Detach")
637630
```
638631
---
639632

640-
## Managed Synapse Spark Pool in Azure Machine Learning
633+
## Serverless Spark compute in Azure Machine Learning
641634

642-
Some user scenarios may require access to a Synapse Spark Pool, during an Azure Machine Learning job submission, without a need to attach a Spark pool. The Azure Synapse Analytics integration with Azure Machine Learning (preview) also provides a serverless Spark compute (preview) experience that allows access to a Spark pool in a job, without a need to attach the compute to a workspace first. [Learn more about the serverless Spark compute (preview) experience](interactive-data-wrangling-with-apache-spark-azure-ml.md).
635+
Some user scenarios may require access to a serverless Spark compute, during an Azure Machine Learning job submission, without a need to attach a Spark pool. The Azure Synapse Analytics integration with Azure Machine Learning also provides a serverless Spark compute experience. This allows access to a Spark compute in a job, without a need to attach the compute to a workspace first. [Learn more about the serverless Spark compute experience](interactive-data-wrangling-with-apache-spark-azure-ml.md).
643636

644637
## Next steps
645638

646-
- [Interactive Data Wrangling with Apache Spark in Azure Machine Learning (preview)](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
639+
- [Interactive Data Wrangling with Apache Spark in Azure Machine Learning](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
647640

648-
- [Submit Spark jobs in Azure Machine Learning (preview)](./how-to-submit-spark-jobs.md)
641+
- [Submit Spark jobs in Azure Machine Learning](./how-to-submit-spark-jobs.md)

0 commit comments

Comments
 (0)