You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/hdinsight/spark/apache-spark-use-bi-tools.md
+19-19Lines changed: 19 additions & 19 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ description: Tutorial - Use Microsoft Power BI to visualize Apache Spark data st
4
4
ms.service: hdinsight
5
5
ms.topic: tutorial
6
6
ms.custom: hdinsightactive, mvc
7
-
ms.date: 05/26/2023
7
+
ms.date: 04/25/2024
8
8
#Customer intent: As a developer new to Apache Spark and to Apache Spark in Azure HDInsight, I want to learn how to virtualize Spark data in BI tools.
9
9
---
10
10
@@ -39,7 +39,7 @@ The [Jupyter Notebook](https://jupyter.org/) that you created in the [previous t
39
39
40
40
The output looks like:
41
41
42
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-show-tables.png" alt-text="Show tables in Spark." border="true":::
42
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-show-tables.png" alt-text="Screenshot showing tables in Spark." border="true":::
43
43
44
44
If you closed the notebook before starting this tutorial, `hvactemptable` is cleaned up, so it's not included in the output. Only Hive tables that are stored in the metastore (indicated by **False** under the **isTemporary** column) can be accessed from the BI tools. In this tutorial, you connect to the **hvac** table that you created.
45
45
@@ -52,7 +52,7 @@ The [Jupyter Notebook](https://jupyter.org/) that you created in the [previous t
52
52
53
53
The output looks like:
54
54
55
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-select-limit.png" alt-text="Show rows from hvac table in Spark." border="true":::
55
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-select-limit.png" alt-text="Screenshot showing rows from hvac table in Spark." border="true":::
56
56
57
57
3. From the **File** menu on the notebook, select **Close and Halt**. Shut down the notebook to release the resources.
58
58
@@ -68,11 +68,11 @@ The first steps in working with Spark are to connect to the cluster in Power BI
68
68
69
69
2. From the **Home** tab, navigate to **Get Data** > **More..**.
70
70
71
-
:::image type="content" source="./media/apache-spark-use-bi-tools/hdinsight-spark-power-bi-desktop-get-data.png " alt-text="Get data into Power BI Desktop from HDInsight Apache Spark." border="true":::
71
+
:::image type="content" source="./media/apache-spark-use-bi-tools/hdinsight-spark-power-bi-desktop-get-data.png " alt-text="Screenshot showing get data into Power BI Desktop from HDInsight Apache Spark." border="true":::
72
72
73
73
3. Enter `Spark` in the search box, select **Azure HDInsight Spark**, and then select **Connect**.
74
74
75
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-import-data-power-bi.png " alt-text="Get data into Power BI from Apache Spark BI." border="true":::
75
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-import-data-power-bi.png " alt-text="Screenshot showing get data into Power BI from Apache Spark BI." border="true":::
76
76
77
77
4. Enter your cluster URL (in the form `mysparkcluster.azurehdinsight.net`) in the **Server** text box.
78
78
@@ -84,7 +84,7 @@ The first steps in working with Spark are to connect to the cluster in Power BI
84
84
85
85
7. Select the `hvac` table, wait to see a preview of the data, and then select **Load**.
86
86
87
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-select-table.png " alt-text="Spark cluster user name and password." border="true":::
87
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-select-table.png " alt-text="Screenshot showing Spark cluster user name and password." border="true":::
88
88
89
89
Power BI Desktop has the information it needs to connect to the Spark cluster and load data from the `hvac` table. The table and its columns are displayed in the **Fields** pane.
90
90
@@ -94,21 +94,21 @@ The first steps in working with Spark are to connect to the cluster in Power BI
94
94
95
95
2. Drag the **BuildingID** field to **Axis**, and drag the **ActualTemp** and **TargetTemp** fields to **Value**.
96
96
97
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-add-value-columns.png " alt-text="add value columns." border="true":::
97
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-add-value-columns.png " alt-text="Screenshot showing add value columns." border="true":::
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-area-graph-sum.png " alt-text="Screenshot showing area graph sum." border="true":::
102
102
103
103
By default the visualization shows the sum for **ActualTemp** and **TargetTemp**. Select the down arrow next to **ActualTemp** and **TragetTemp** in the Visualizations pane, you can see **Sum** is selected.
104
104
105
105
3. Select the down arrows next to **ActualTemp** and **TragetTemp** in the Visualizations pane, select **Average** to get an average of actual and target temperatures for each building.
106
106
107
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-average-of-values.png " alt-text="average of values." border="true":::
107
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-average-of-values.png " alt-text="Screenshot showing average of values." border="true":::
108
108
109
109
Your data visualization shall be similar to the one in the screenshot. Move your cursor over the visualization to get tool tips with relevant data.
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-area-graph.png " alt-text="Screenshot showing area graph" border="true":::
112
112
113
113
9. Navigate to **File** > **Save**, enter the name `BuildingTemperature` for the file, then select **Save**.
114
114
@@ -120,43 +120,43 @@ The Power BI service allows you to share reports and dashboards across your orga
120
120
121
121
1. From the **Home** tab, select **Publish**.
122
122
123
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-publish.png " alt-text="Publish from Power BI Desktop." border="true"::: Desktop" border="true":::
123
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-publish.png " alt-text="Screenshot showing publish from Power BI Desktop." border="true":::
124
124
125
125
1. Select a workspace to publish your dataset and report to, then select **Select**. In the following image, the default **My Workspace** is selected.
126
126
127
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-select-workspace.png " alt-text="Select workspace to publish dataset and report to." border="true":::
127
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-select-workspace.png " alt-text="Screenshot showing select workspace to publish dataset and report to." border="true":::
128
128
129
129
1. After the publishing is succeeded, select **Open 'BuildingTemperature.pbix' in Power BI**.
130
130
131
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-publish-success.png " alt-text="Publish success, click to enter credentials." border="true":::
131
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-publish-success.png " alt-text="Screenshot showing publish success, click to enter credentials." border="true":::
132
132
133
133
1. In the Power BI service, select **Enter credentials**.
134
134
135
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-enter-credentials.png " alt-text="Enter credentials in Power BI service." border="true":::" border="true":::
135
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-enter-credentials.png " alt-text="Screenshot showing how to enter credentials in Power BI service." border="true":::
136
136
137
137
1. Select **Edit credentials**.
138
138
139
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-edit-credentials.png " alt-text="Edit credentials in Power BI service." border="true":::
139
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-edit-credentials.png " alt-text="Screenshot showing Edit credentials in Power BI service." border="true":::
140
140
141
141
1. Enter the HDInsight login account information, and then select **Sign in**. The default account name is *admin*.
142
142
143
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-sign-in.png " alt-text="Sign in to Spark cluster." border="true":::Spark cluster" border="true":::
143
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-sign-in.png " alt-text="Screenshot showing Sign in to Spark cluster." border="true"
144
144
145
145
1. In the left pane, go to **Workspaces** > **My Workspace** > **REPORTS**, then select **BuildingTemperature**.
146
146
147
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-service-left-pane.png " alt-text="Report listed under reports in left pane." border="true":::
147
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-service-left-pane.png " alt-text="Screenshot showing Report listed under reports in left pane." border="true":::
148
148
149
149
You should also see **BuildingTemperature** listed under **DATASETS** in the left pane.
150
150
151
151
The visual you created in Power BI Desktop is now available in the Power BI service.
152
152
153
153
1. Hover your cursor over the visualization, and then select the pin icon on the upper right corner.
154
154
155
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-service-report.png " alt-text="Report in the Power BI service." border="true":::
155
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-service-report.png " alt-text="Screenshot showing report in the Power BI service." border="true":::
156
156
157
157
1. Select "New dashboard", enter the name `Building temperature`, then select **Pin**.
158
158
159
-
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-pin-dashboard.png " alt-text="Pin to new dashboard." border="true"::: to new dashboard" border="true":::
159
+
:::image type="content" source="./media/apache-spark-use-bi-tools/apache-spark-bi-pin-dashboard.png " alt-text="Screenshot showing pin to new dashboard." border="true":::
0 commit comments