Skip to content

Commit fa67284

Browse files
committed
moving and renaming spack notebook quickstart
1 parent 0316334 commit fa67284

File tree

3 files changed

+28
-15
lines changed

3 files changed

+28
-15
lines changed

.openpublishing.redirection.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17828,6 +17828,11 @@
1782817828
"redirect_url": "/azure/synapse-analytics/sql-data-warehouse/performance-tuning-ordered-cci",
1782917829
"redirect_document_id": true
1783017830
},
17831+
{
17832+
"source_path": "articles/synapse-analytics/apache-spark-notebook-create-spark-use-sql.md",
17833+
"redirect_url": "/azure/synapse-analytics/quickstart-apache-spark-notebook",
17834+
"redirect_document_id": true
17835+
},
1783117836
{
1783217837
"source_path": "articles/sql-database/sql-database-auditing-get-started.md",
1783317838
"redirect_url": "/azure/sql-database/sql-database-auditing",

articles/synapse-analytics/spark/apache-spark-notebook-create-spark-use-sql.md renamed to articles/synapse-analytics/quickstart-apache-spark-notebook.md

Lines changed: 21 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: 'Quickstart: Create an Apache Spark pool (preview) in Azure Synapse Analytics'
2+
title: 'Quickstart: Create an Apache Spark notebook'
33
description: This quickstart shows how to use the web tools to create an Apache Spark pool (preview) in Azure Synapse Analytics, and run a Spark SQL query.
44
services: synapse-analytics
55
author: euangMS
@@ -11,25 +11,33 @@ ms.topic: quickstart
1111
ms.date: 04/15/2020
1212
---
1313

14-
# Quickstart: Create an Apache Spark pool (preview) in Synapse Analytics using web tools
14+
# Quickstart: Create an Apache Spark pool (preview) in Azure Synapse Analytics using web tools
1515

16-
In this quickstart, you learn how to create an Apache Spark pool (preview) in Azure Synapse Analytics using web tools. You then learn to connect to the Apache Spark pool and run Spark SQL queries against files and tables. Apache Spark enables fast data analytics and cluster computing using in-memory processing. For information on Spark on Synapse Analytics, see [Overview: Apache Spark on Azure Synapse Analytics](apache-spark-overview.md).
16+
In this quickstart, you learn how to create an Apache Spark pool (preview) in Azure Synapse using web tools. You then learn to connect to the Apache Spark pool and run Spark SQL queries against files and tables. Apache Spark enables fast data analytics and cluster computing using in-memory processing. For information on Spark in Azure Synapse, see [Overview: Apache Spark on Azure Synapse](apache-spark-overview.md).
1717

1818
> [!IMPORTANT]
1919
> Billing for Spark instances is prorated per minute, whether you are using them or not. Be sure to shutdown your Spark instance after you have finished using it, or set a short timeout. For more information, see the **Clean up resources** section of this article.
2020
21-
If you don't have an Azure subscription, [create a free account](https://azure.microsoft.com/free/) before you begin.
21+
If you don't have an Azure subscription, [create a free account before you begin](https:/azure.microsoft.com/free/).
22+
23+
## Prerequisites
24+
25+
- Azure subscription - [create one for free](https:/azure.microsoft.com/free/)
26+
- [Synapse Analytics workspace](quickstart-create-workspace.md)
27+
- [Apache Spark pool](quickstart-create-apache-spark-pool.md)
2228

23-
## Create an Apache Spark pool
29+
## Sign in to the Azure portal
2430

25-
This article shows you how to create a new Apache Spark pool using web tools.
31+
Sign in to the [Azure portal](https:/portal.azure.com/)
32+
33+
If you don't have an Azure subscription, [create a free account](https://azure.microsoft.com/free/) before you begin.
2634

2735
## Create a notebook
2836

2937
A notebook is an interactive environment that supports various programming languages. The notebook allows you to interact with your data, combine code with markdown, text and perform simple visualizations.
3038

31-
1. From the Azure portal view for the Synapse Analytics workspace you want to use, select **Launch Synapse Studio**.
32-
2. Once Synapse Analytics Studio has launched, select **Develop**. Then, hover over the **Notebooks** entry. Select the ellipsis (**...**).
39+
1. From the Azure portal view for the Azure Synapse workspace you want to use, select **Launch Synapse Studio**.
40+
2. Once Synapse Studio has launched, select **Develop**. Then, hover over the **Notebooks** entry. Select the ellipsis (**...**).
3341
3. From there, select **New notebook**. A new notebook is created and opened with an automatically generated name.
3442
![New notebook](./media/apache-spark-notebook-create-spark-use-sql/spark-get-started-new-notebook.png "New notebook")
3543

@@ -85,7 +93,7 @@ SQL (Structured Query Language) is the most common and widely used language for
8593
SHOW TABLES
8694
```
8795

88-
When you use a Notebook with your Synapse Analytics Apache Spark pool, you get a preset `sqlContext` that you can use to run queries using Spark SQL. `%%sql` tells the notebook to use the preset `sqlContext` to run the query. The query retrieves the top 10 rows from a system table that comes with all Synapse Analytics Apache Spark pools by default.
96+
When you use a Notebook with your Azure Synapse Apache Spark pool, you get a preset `sqlContext` that you can use to run queries using Spark SQL. `%%sql` tells the notebook to use the preset `sqlContext` to run the query. The query retrieves the top 10 rows from a system table that comes with all Azure Synapse Apache Spark pools by default.
8997

9098
2. Run another query to see the data in `demo_df`.
9199

@@ -98,7 +106,7 @@ SQL (Structured Query Language) is the most common and widely used language for
98106

99107
By default the results view shows a grid, but there is a view switcher underneath the grid that allows the view to switch between grid and graph views.
100108

101-
![Query output in Synapse Analytics Spark](./media/apache-spark-notebook-create-spark-use-sql/spark-get-started-query.png "Query output in Synapse Analytics Spark")
109+
![Query output in Azure Synapse Spark](./media/apache-spark-notebook-create-spark-use-sql/spark-get-started-query.png "Query output in Azure Synapse Spark")
102110

103111
3. In the **View** switcher, select **Chart**
104112
4. Select the **View options** icon from the far right-hand side.
@@ -108,7 +116,7 @@ SQL (Structured Query Language) is the most common and widely used language for
108116
8. In the **Aggregation** field, select to "AVG".
109117
9. Select **Apply**.
110118

111-
![Chart output in Synapse Analytics Spark](./media/apache-spark-notebook-create-spark-use-sql/spark-get-started-query-chart-output.png "Chart output in Synapse Analytics Spark")
119+
![Chart output in Azure Synapse Spark](./media/apache-spark-notebook-create-spark-use-sql/spark-get-started-query-chart-output.png "Chart output in Azure Synapse Spark")
112120

113121
10. It is possible to get the same experience of running SQL but without having to switch languages. You can do this by replacing the SQL cell above with this PySpark cell, the output experience is the same because the **display** command is used:
114122

@@ -120,13 +128,13 @@ SQL (Structured Query Language) is the most common and widely used language for
120128

121129
## Clean up resources
122130

123-
Synapse Analytics saves your data in Azure Data Lake Storage. You can safely let a Spark instance shut down when it is not in use. You are charged for a Synapse Analytics Apache Spark pool as long as it is running, even when it is not in use. Since the charges for the pool are many times more than the charges for storage, it makes economic sense to let Spark instances shut down when they are not in use.
131+
Azure Synapse saves your data in Azure Data Lake Storage. You can safely let a Spark instance shut down when it is not in use. You are charged for an Azure Synapse Apache Spark pool as long as it is running, even when it is not in use. Since the charges for the pool are many times more than the charges for storage, it makes economic sense to let Spark instances shut down when they are not in use.
124132

125133
To ensure the Spark instance is shut down, end any connected sessions(notebooks). The pool shuts down when the **idle time** specified in the Apache Spark pool is reached. You can also select **end session** from the status bar at the bottom of the notebook.
126134

127135
## Next steps
128136

129-
In this quickstart, you learned how to create a Synapse Analytics Apache Spark pool and run a basic Spark SQL query.
137+
In this quickstart, you learned how to create a Azure Synapse Apache Spark pool and run a basic Spark SQL query.
130138

131139
- [Azure Synapse Analytics](../overview-what-is.md)
132140
- [.NET for Apache Spark documentation](/dotnet/spark?toc=/azure/synapse-analytics/toc.json&bc=/azure/synapse-analytics/breadcrumb/toc.json)

articles/synapse-analytics/toc.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,8 +20,8 @@
2020
href: quickstart-synapse-studio.md
2121
- name: Create pools
2222
items:
23-
- name: Create Apache Spark pool using web tools
24-
href: ./spark/apache-spark-notebook-create-spark-use-sql.md
23+
- name: Create Apache Spark notebook using web tools
24+
href: quickstart-apache-spark-notebook.md
2525
- name: Create Apache Spark pool using Azure portal
2626
href: quickstart-create-apache-spark-pool.md
2727
- name: Create SQL pool

0 commit comments

Comments
 (0)