You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/storage/blobs/storage-blob-inventory-report-analytics.md
+40-13Lines changed: 40 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -47,9 +47,9 @@ You might have to wait up to 24 hours after enabling inventory reports for your
47
47
48
48
2. In the Synapse workspace, assign the **Contributor** role to your user identity. See [Azure RBAC: Owner role for the workspace](../../synapse-analytics/get-started-add-admin.md#azure-rbac-owner-role-for-the-workspace).
49
49
50
-
3. Give the Synapse workspace permission to access the inventory reports in your storage account by navigating to your **inventory report account**, and then assigning the **Storage Blob Data Contributor** role to the system managed identity of the workspace. See [Assign Azure roles using the Azure portal](../../role-based-access-control/role-assignments-portal.md).
50
+
3. Give the Synapse workspace permission to access the inventory reports in your storage account by navigating to your inventory report account, and then assigning the **Storage Blob Data Contributor** role to the system managed identity of the workspace. See [Assign Azure roles using the Azure portal](../../role-based-access-control/role-assignments-portal.md).
51
51
52
-
4. Navigate to **primary storage account** and assign the **Blob Storage Contributor** role to your user identity.
52
+
4. Navigate to primary storage account and assign the **Blob Storage Contributor** role to your user identity.
53
53
54
54
## Set up Synapse Studio
55
55
@@ -77,7 +77,7 @@ In this section, you'll generate statistical data that you'll visualize in a rep
77
77
78
78
- Set `accessKey` to the account key of the inventory report account.
79
79
80
-
3. Upload the **BlobInventoryStorageAccountConfiguration.json** file to the container in your primary storage account that you specified when you created the Synapse workspace.
80
+
3. Upload this file to the container in your primary storage account that you specified when you created the Synapse workspace.
81
81
82
82
#### Import the sample PySpark Notebook
83
83
@@ -112,16 +112,24 @@ In this section, you'll generate statistical data that you'll visualize in a rep
112
112
113
113
#### Run the PySpark notebook
114
114
115
-
In the PySpark notebook, select **Run all**.
115
+
1.In the PySpark notebook, select **Run all**.
116
116
117
-
It will take a few minutes to start the Spark session and another few minutes to process the inventory reports. The first run could take a while if there are numerous inventory reports to process. Subsequent runs will only process the new inventory reports created since the last run.
117
+
It will take a few minutes to start the Spark session and another few minutes to process the inventory reports. The first run could take a while if there are numerous inventory reports to process. Subsequent runs will only process the new inventory reports created since the last run.
118
118
119
-
> [!NOTE]
120
-
> If you make any changes to the notebook will the notebook is running, make sure to publish those changes by using the **Publish** button.
119
+
> [!NOTE]
120
+
> If you make any changes to the notebook will the notebook is running, make sure to publish those changes by using the **Publish** button.
121
+
122
+
2. Verify that the notebook ran successfully by selecting the **Data** tab.
123
+
124
+
A database named **reportdata** should appear in the **Workspace** tab of the **Data** pane.
121
125
126
+
> [!div class="mx-imgBorder"]
127
+
> 
128
+
129
+
122
130
## Visualize the data
123
131
124
-
1. Download the [ReportAnalysis.pbit](https://github.com/microsoft/Blob-Inventory-Report-Analytics/blob/main/src/ReportAnalysis.pbit) sample file.
132
+
1. Download the [ReportAnalysis.pbit](https://github.com/microsoft/Blob-Inventory-Report-Analytics/blob/main/src/ReportAnalysis.pbit) sample report file.
125
133
126
134
2. Open Power BI Desktop. For installation guidance, see [Get Power BI Desktop](/power-bi/fundamentals/desktop-get-the-desktop).
127
135
@@ -132,7 +140,7 @@ It will take a few minutes to start the Spark session and another few minutes to
132
140
> [!div class="mx-imgBorder"]
133
141
> 
134
142
135
-
4. Browse to the location of the **ReportAnalysis.ipynb** file that you downloaded, and then select **Open**.
143
+
4. Browse to the location of the **ReportAnalysis.pbit** file that you downloaded, and then select **Open**.
136
144
137
145
A dialog box appears which asks you to provide the name of the Synapse workspace and the data base name.
138
146
@@ -141,13 +149,32 @@ It will take a few minutes to start the Spark session and another few minutes to
A report appears which provides visualizations of the data retrieved by the notebook.
152
+
A report appears which provides visualizations of the data retrieved by the notebook. The following images show the types of the charts and graphs that appear in this report.
145
153
154
+
> [!div class="mx-imgBorder"]
155
+
> 
146
156
147
-
## Common errors
157
+
> [!div class="mx-imgBorder"]
158
+
> 
148
159
149
-
Figure out how best to introduce these. Maybe just link to the sample.
160
+
> [!div class="mx-imgBorder"]
161
+
> 
150
162
151
163
## Next steps
152
164
153
-
Need next steps.
165
+
- Learn about ways to analyze individual containers in your storage account. See these articles:
166
+
167
+
[Calculate blob count and total size per container using Azure Storage inventory](calculate-blob-count-size.md)
168
+
169
+
[How to calculate Container Level Statistics in Azure Blob Storage with Azure Databricks](https://techcommunity.microsoft.com/t5/azure-paas-blog/how-to-calculate-container-level-statistics-in-azure-blob/ba-p/3614650)
170
+
171
+
- Learn about ways to optimize your costs based on the analysis of your blobs and containers. See these articles:
172
+
173
+
[Plan and manage costs for Azure Blob Storage](storage-plan-manage-costs.md)
174
+
175
+
[Estimate the cost of archiving data](archive-cost-estimation.md)
176
+
177
+
[Optimize costs by automatically managing the data lifecycle](lifecycle-management-overview.md)
0 commit comments