You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/synapse-analytics/get-started.md
+19-19Lines changed: 19 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,7 +53,7 @@ This tutorial will guide you through all the basic steps needed to setup and use
53
53
|**Region**|Match the region of the storage account||
54
54
|||
55
55
56
-
* Under **Select Data Lake Storage Gen 2** select the account and container you previously created
56
+
* Under **Select Data Lake Storage Gen 2**, select the account and container you previously created
57
57
58
58
> [!NOTE]
59
59
> We refer to the storage account chosen hereas the "primary" storage account of the Synapse workspace. This account
@@ -66,7 +66,7 @@ This tutorial will guide you through all the basic steps needed to setup and use
66
66
67
67
This may have already been done for you. In any case, you should verify.
68
68
69
-
* Open the [Azure portal](https://portal.azure.com) open the primary storage account chosen for your workspace.
69
+
* Open the [Azure portal](https://portal.azure.com)and open the primary storage account chosen for your workspace.
70
70
* Select **Access control (IAM)** from the left navigation. Then assign the following roles or ensure they are already assigned.
71
71
* Assign the workspace identity to the **Storage Blob Data Contributor** role on the storage account. The workspace identity has the same name as the workspace. In this document, the workspace name is `myworkspace` so the workspace identity is `myworkspaced`
72
72
* Select **Save**.
@@ -75,7 +75,7 @@ This may have already been done for you. In any case, you should verify.
75
75
76
76
Once your Synapse workspace is created, you have two ways to open Synapse Studio:
77
77
* Open your Synapse workspace in the [Azure portal](https://portal.azure.com) and at the top of the **Overview** section select **Launch Synapse Studio**
78
-
* Directly go to https://web.azuresynapse.net and login to your workspace.
78
+
* Directly go to https://web.azuresynapse.net and sign in to your workspace.
79
79
80
80
## Create a SQL pool
81
81
@@ -212,7 +212,7 @@ We have data available in a table in `SQLDB1`. Now we load it into a Spark datab
212
212
213
213
With notebooks you can control how render charts. The following
214
214
code shows a simple example using the popular libraries `matplotlib` and `seaborn`. It will
215
-
render the same kind od line chart you saw when running the SQL queries earlier.
215
+
render the same kind of line chart you saw when running the SQL queries earlier.
216
216
217
217
```py
218
218
%%pyspark
@@ -230,10 +230,10 @@ matplotlib.pyplot.show()
230
230
## Load data from a Spark table into a SQL pool table
231
231
232
232
Earlier we copied data from a SQL pool table `SQLDB1.dbo.Trip` into a Spark table `nyctaxi.trip`. Then, using
233
-
Spark, we aggregated the data into the the Spark table `nyctaxi.passengercountstats`. Now we will copy the data
233
+
Spark, we aggregated the data into the Spark table `nyctaxi.passengercountstats`. Now we will copy the data
234
234
from `nyctaxi.passengercountstats` into a SQL pool table called `SQLDB1.dbo.PassengerCountStats`.
235
235
236
-
Run the cell below in your notebook. It will copy the aggregated Spark table back into
236
+
Run the cell below in your notebook. It will copy the aggregated Spark table back into
237
237
the SQL pool table.
238
238
239
239
```scala
@@ -263,11 +263,11 @@ You can orchestrate a wide variety of tasks in Azure Synapse. In this section, y
263
263
*Select**+** then select**Pipeline**. A new pipeline will be created.
264
264
* Navigate to the Develop hub and find the notebook you previously created.
265
265
* Drag that notebook into the pipeline.
266
-
*In the pipeline select**Add trigger > New/edit**.
266
+
*In the pipeline,select**Add trigger > New/edit**.
267
267
*In**Choose trigger**select**New**, and then in recurrence set the trigger to run every 1 hour.
268
268
*Select**OK**.
269
269
*Select**Publish All**and the pipeline will run every hour.
270
-
* If you want to make the pipeline run now without waiting for the next hour select**Add trigger > New/edit**.
270
+
* If you want to make the pipeline run now without waiting for the next hour,select**Add trigger > New/edit**.
271
271
272
272
## Working with data in a storage account
273
273
@@ -278,7 +278,7 @@ So far, we've covered scenarios were data resided in databases in the workspace.
278
278
279
279
### Creating CSV and Parquet files in your Storage account
280
280
281
-
Run the the following code in a notebook. It creates a CSV file and a parquet file in the storage account
281
+
Run the following code in a notebook. It creates a CSV file and a parquet file in the storage account
* In the script the **Attach to** field will be set to **SQL on-demand**.
318
+
* In the script, the **Attach to** field will be set to **SQL on-demand**.
319
319
* Run the script.
320
320
321
321
## Visualize data with Power BI
322
322
323
-
From the NYX taxi data, we created arregated datasets in two tables:
323
+
From the NYX taxi data, we created aggregated datasets in two tables:
324
324
*`nyctaxi.passengercountstats`
325
325
*`SQLDB1.dbo.PassengerCountStats`
326
326
327
-
You can link a Power BI workspace to you Synapse workspace. This allows you to easily get data into your PowerBI worksapce and you can edit your PowerBI reports directly in your Synapse workspace.
327
+
You can link a Power BI workspace to your Synapse workspace. This allows you to easily get data into your Power BI workspace and you can edit your Power BI reports directly in your Synapse workspace.
328
328
329
329
### Create a Power BI Workspace
330
330
331
331
* Log into [powerbi.microsoft.com](https://powerbi.microsoft.com/).
332
332
* Create a new Power BI workspace called `NYCTaxiWorkspace1`.
333
333
334
-
### Link your Synapse Workspace to your new PowerBI workspace
334
+
### Link your Synapse Workspace to your new Power BI workspace
335
335
336
336
* In Synapse Studio, navigate to the **Manage > Linked Services**.
337
337
* Select **+ New**and select **Connect to Power BI**andset these fields:
@@ -370,16 +370,16 @@ You can link a Power BI workspace to you Synapse workspace. This allows you to e
370
370
### Configure authentication for your dataset
371
371
372
372
* Open [powerbi.microsoft.com](https://powerbi.microsoft.com/) and**Sign in**
373
-
* At the left, under **Workspaces** select the the `NYCTaxiWorkspace1` workspace.
373
+
* At the left, under **Workspaces** select the `NYCTaxiWorkspace1` workspace.
374
374
* Inside that workspace you should see a dataset called `Passenger Analysis`and a report called `Passenger Analysis`.
375
375
* Hover over the `PassengerAnalysis` dataset and select the icon with the three dots and select **Settings**.
376
376
* In **Data source credentials**set the **Authentication method** to **OAuth2**and select **Sign in**.
377
377
378
-
### Edit a report report in Synapse Studio
378
+
### Edit a report in Synapse Studio
379
379
380
380
* Go back to Synapse Studio and select **Close and refresh**
381
-
* Navigate to the **Devlop** hub
382
-
* Hover over **powerBI**and click on the three Refresh the **Power BI reports** node.
381
+
* Navigate to the **Develop** hub
382
+
* Hover over **PowerBI**and click on the refresh the **Power BI reports** node.
383
383
* Now under the **Power BI** you should see:
384
384
* Under **NYCTaxiWorkspace1 > Power BI datasets**, a new dataset called **PassengerAnalysis**.
385
385
* Under **NYCTaxiWorkspace1 > Power BI reports**, a new report called **PassengerAnalysis**.
@@ -388,7 +388,7 @@ You can link a Power BI workspace to you Synapse workspace. This allows you to e
388
388
389
389
## Monitor activities
390
390
391
-
* In Synapse Studio, Navigate to the monitor hub.
391
+
* In Synapse Studio, navigate to the monitor hub.
392
392
* In this location you can see a history of all the activities taking place in the workspace and which ones are active now.
393
393
* Explore the **Pipeline runs**, **Apache Spark applications**, and**SQL requests**and you can see what you've already done in the workspace.
0 commit comments