Skip to content

Commit 9e6b0ba

Browse files
committed
edit from Saveen
1 parent 553a805 commit 9e6b0ba

File tree

1 file changed

+19
-19
lines changed

1 file changed

+19
-19
lines changed

articles/synapse-analytics/get-started.md

Lines changed: 19 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ This tutorial will guide you through all the basic steps needed to setup and use
5353
|**Region**|Match the region of the storage account||
5454
|||
5555

56-
* Under **Select Data Lake Storage Gen 2** select the account and container you previously created
56+
* Under **Select Data Lake Storage Gen 2**, select the account and container you previously created
5757

5858
> [!NOTE]
5959
> We refer to the storage account chosen hereas the "primary" storage account of the Synapse workspace. This account
@@ -66,7 +66,7 @@ This tutorial will guide you through all the basic steps needed to setup and use
6666

6767
This may have already been done for you. In any case, you should verify.
6868

69-
* Open the [Azure portal](https://portal.azure.com) open the primary storage account chosen for your workspace.
69+
* Open the [Azure portal](https://portal.azure.com) and open the primary storage account chosen for your workspace.
7070
* Select **Access control (IAM)** from the left navigation. Then assign the following roles or ensure they are already assigned.
7171
* Assign the workspace identity to the **Storage Blob Data Contributor** role on the storage account. The workspace identity has the same name as the workspace. In this document, the workspace name is `myworkspace` so the workspace identity is `myworkspaced`
7272
* Select **Save**.
@@ -75,7 +75,7 @@ This may have already been done for you. In any case, you should verify.
7575

7676
Once your Synapse workspace is created, you have two ways to open Synapse Studio:
7777
* Open your Synapse workspace in the [Azure portal](https://portal.azure.com) and at the top of the **Overview** section select **Launch Synapse Studio**
78-
* Directly go to https://web.azuresynapse.net and login to your workspace.
78+
* Directly go to https://web.azuresynapse.net and sign in to your workspace.
7979

8080
## Create a SQL pool
8181

@@ -212,7 +212,7 @@ We have data available in a table in `SQLDB1`. Now we load it into a Spark datab
212212
213213
With notebooks you can control how render charts. The following
214214
code shows a simple example using the popular libraries `matplotlib` and `seaborn`. It will
215-
render the same kind od line chart you saw when running the SQL queries earlier.
215+
render the same kind of line chart you saw when running the SQL queries earlier.
216216
217217
```py
218218
%%pyspark
@@ -230,10 +230,10 @@ matplotlib.pyplot.show()
230230
## Load data from a Spark table into a SQL pool table
231231
232232
Earlier we copied data from a SQL pool table `SQLDB1.dbo.Trip` into a Spark table `nyctaxi.trip`. Then, using
233-
Spark, we aggregated the data into the the Spark table `nyctaxi.passengercountstats`. Now we will copy the data
233+
Spark, we aggregated the data into the Spark table `nyctaxi.passengercountstats`. Now we will copy the data
234234
from `nyctaxi.passengercountstats` into a SQL pool table called `SQLDB1.dbo.PassengerCountStats`.
235235
236-
Run the cell below in your notebook. It will copy the aggregated Spark table back into
236+
Run the cell below in your notebook. It will copy the aggregated Spark table back into
237237
the SQL pool table.
238238
239239
```scala
@@ -263,11 +263,11 @@ You can orchestrate a wide variety of tasks in Azure Synapse. In this section, y
263263
* Select **+** then select **Pipeline**. A new pipeline will be created.
264264
* Navigate to the Develop hub and find the notebook you previously created.
265265
* Drag that notebook into the pipeline.
266-
* In the pipeline select **Add trigger > New/edit**.
266+
* In the pipeline, select **Add trigger > New/edit**.
267267
* In **Choose trigger** select **New**, and then in recurrence set the trigger to run every 1 hour.
268268
* Select **OK**.
269269
* Select **Publish All** and the pipeline will run every hour.
270-
* If you want to make the pipeline run now without waiting for the next hour select **Add trigger > New/edit**.
270+
* If you want to make the pipeline run now without waiting for the next hour, select **Add trigger > New/edit**.
271271

272272
## Working with data in a storage account
273273

@@ -278,7 +278,7 @@ So far, we've covered scenarios were data resided in databases in the workspace.
278278
279279
### Creating CSV and Parquet files in your Storage account
280280
281-
Run the the following code in a notebook. It creates a CSV file and a parquet file in the storage account
281+
Run the following code in a notebook. It creates a CSV file and a parquet file in the storage account
282282
283283
```py
284284
%%pyspark
@@ -294,7 +294,7 @@ df.write.mode("overwrite").parquet("/NYCTaxi/PassengerCountStats.parquet")
294294
* Select **Linked**
295295
* Navigate to **Storage accounts > myworkspace (Primary - contosolake)**
296296
* Select **users (Primary)"**
297-
* You should see a folder called `NYCTaxi' and inside . Inside you should see two folders 'PassengerCountStats.csv' and 'PassengerCountStats.parquet'.
297+
* You should see a folder called `NYCTaxi'. Inside you should see two folders 'PassengerCountStats.csv' and 'PassengerCountStats.parquet'.
298298
* Navigate into the `PassengerCountStats.parquet' folder.
299299
* Right-click on the parquet file inside, and select **new notebook**, it will create a notebook with a cell like this:
300300

@@ -315,23 +315,23 @@ df.write.mode("overwrite").parquet("/NYCTaxi/PassengerCountStats.parquet")
315315
) AS [r];
316316
```
317317

318-
* In the script the **Attach to** field will be set to **SQL on-demand**.
318+
* In the script, the **Attach to** field will be set to **SQL on-demand**.
319319
* Run the script.
320320

321321
## Visualize data with Power BI
322322

323-
From the NYX taxi data, we created arregated datasets in two tables:
323+
From the NYX taxi data, we created aggregated datasets in two tables:
324324
* `nyctaxi.passengercountstats`
325325
* `SQLDB1.dbo.PassengerCountStats`
326326

327-
You can link a Power BI workspace to you Synapse workspace. This allows you to easily get data into your PowerBI worksapce and you can edit your PowerBI reports directly in your Synapse workspace.
327+
You can link a Power BI workspace to your Synapse workspace. This allows you to easily get data into your Power BI workspace and you can edit your Power BI reports directly in your Synapse workspace.
328328

329329
### Create a Power BI Workspace
330330

331331
* Log into [powerbi.microsoft.com](https://powerbi.microsoft.com/).
332332
* Create a new Power BI workspace called `NYCTaxiWorkspace1`.
333333

334-
### Link your Synapse Workspace to your new PowerBI workspace
334+
### Link your Synapse Workspace to your new Power BI workspace
335335

336336
* In Synapse Studio, navigate to the **Manage > Linked Services**.
337337
* Select **+ New** and select **Connect to Power BI** and set these fields:
@@ -370,16 +370,16 @@ You can link a Power BI workspace to you Synapse workspace. This allows you to e
370370
### Configure authentication for your dataset
371371

372372
* Open [powerbi.microsoft.com](https://powerbi.microsoft.com/) and **Sign in**
373-
* At the left, under **Workspaces** select the the `NYCTaxiWorkspace1` workspace.
373+
* At the left, under **Workspaces** select the `NYCTaxiWorkspace1` workspace.
374374
* Inside that workspace you should see a dataset called `Passenger Analysis` and a report called `Passenger Analysis`.
375375
* Hover over the `PassengerAnalysis` dataset and select the icon with the three dots and select **Settings**.
376376
* In **Data source credentials** set the **Authentication method** to **OAuth2** and select **Sign in**.
377377

378-
### Edit a report report in Synapse Studio
378+
### Edit a report in Synapse Studio
379379

380380
* Go back to Synapse Studio and select **Close and refresh**
381-
* Navigate to the **Devlop** hub
382-
* Hover over **power BI** and click on the three Refresh the **Power BI reports** node.
381+
* Navigate to the **Develop** hub
382+
* Hover over **Power BI** and click on the refresh the **Power BI reports** node.
383383
* Now under the **Power BI** you should see:
384384
* Under **NYCTaxiWorkspace1 > Power BI datasets**, a new dataset called **PassengerAnalysis**.
385385
* Under **NYCTaxiWorkspace1 > Power BI reports**, a new report called **PassengerAnalysis**.
@@ -388,7 +388,7 @@ You can link a Power BI workspace to you Synapse workspace. This allows you to e
388388

389389
## Monitor activities
390390

391-
* In Synapse Studio, Navigate to the monitor hub.
391+
* In Synapse Studio, navigate to the monitor hub.
392392
* In this location you can see a history of all the activities taking place in the workspace and which ones are active now.
393393
* Explore the **Pipeline runs**, **Apache Spark applications**, and **SQL requests** and you can see what you've already done in the workspace.
394394

0 commit comments

Comments
 (0)