Skip to content

Commit d6203ad

Browse files
authored
Fixing formatting
1 parent bd382ca commit d6203ad

File tree

1 file changed

+18
-18
lines changed

1 file changed

+18
-18
lines changed

articles/machine-learning/apache-spark-azure-ml-concepts.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ ms.custom: cliv2, sdkv2
1717
# Apache Spark in Azure Machine Learning (preview)
1818
The Azure Machine Learning integration with Azure Synapse Analytics (preview) provides easy access to distributed computing, using the Apache Spark framework. This integration offers these Apache Spark computing experiences:
1919
- Managed (Automatic) Spark compute
20-
2. Attached Synapse Spark pool
20+
- Attached Synapse Spark pool
2121

2222
## Managed (Automatic) Spark compute
2323
Azure Machine Learning Managed (Automatic) Spark compute is the easiest way to execute distributed computing tasks in the Azure Machine Learning environment, using the Apache Spark framework. Azure Machine Learning users can use a fully managed, serverless, on-demand Apache Spark compute cluster. Those users can avoid the need to create an Azure Synapse Workspace and an Azure Synapse Spark pool. Users can define the resources, including
@@ -34,22 +34,22 @@ to access the Managed (Automatic) Spark compute in Azure Machine Learning Notebo
3434
### Some points to consider
3535
Managed (Automatic) Spark compute works well for most user scenarios that require quick access to distributed computing using Apache Spark. To make an informed decision, however, users should consider the advantages and disadvantages of this approach.
3636

37-
> ### Advantages
38-
>
39-
> - No dependencies on other Azure resources to be created for Apache Spark
40-
> - No permissions required in the subscription to create Synapse-related resources
41-
> - No need for SQL pool quota
42-
43-
> ### Disadvantages
44-
>
45-
> - Persistent Hive metastore is missing. Therefore, Managed (Automatic) Spark compute only supports in-memory Spark SQL
46-
> - No available tables or databases
47-
> - Missing Purview integration
48-
> - Linked Services not available
49-
> - Fewer Data sources/connectors
50-
> - Missing pool-level configuration
51-
> - Missing pool-level library management
52-
> - Partial support for `mssparkutils`
37+
### Advantages
38+
39+
- No dependencies on other Azure resources to be created for Apache Spark
40+
- No permissions required in the subscription to create Synapse-related resources
41+
- No need for SQL pool quota
42+
43+
### Disadvantages
44+
45+
- Persistent Hive metastore is missing. Therefore, Managed (Automatic) Spark compute only supports in-memory Spark SQL
46+
- No available tables or databases
47+
- Missing Purview integration
48+
- Linked Services not available
49+
- Fewer Data sources/connectors
50+
- Missing pool-level configuration
51+
- Missing pool-level library management
52+
- Partial support for `mssparkutils`
5353

5454
### Network configuration
5555
As of January 2023, the Managed (Automatic) Spark compute doesn't support managed VNet or private endpoint creation to Azure Synapse.
@@ -101,4 +101,4 @@ This [quickstart guide](./quickstart-spark-jobs.md) describes how to start using
101101
- [Interactive Data Wrangling with Apache Spark in Azure Machine Learning (preview)](./interactive-data-wrangling-with-apache-spark-azure-ml.md)
102102
- [Submit Spark jobs in Azure Machine Learning (preview)](./how-to-submit-spark-jobs.md)
103103
- [Code samples for Spark jobs using Azure Machine Learning CLI](https://github.com/Azure/azureml-examples/tree/main/cli/jobs/spark)
104-
- [Code samples for Spark jobs using Azure Machine Learning Python SDK](https://github.com/Azure/azureml-examples/tree/main/sdk/python/jobs/spark)
104+
- [Code samples for Spark jobs using Azure Machine Learning Python SDK](https://github.com/Azure/azureml-examples/tree/main/sdk/python/jobs/spark)

0 commit comments

Comments
 (0)