You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/purview/data-stewardship.md
+30-65Lines changed: 30 additions & 65 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,12 +7,12 @@ ms.service: purview
7
7
ms.subservice: purview-insights
8
8
ms.custom: event-tier1-build-2022
9
9
ms.topic: how-to
10
-
ms.date: 05/16/2022
10
+
ms.date: 04/25/2023
11
11
---
12
12
13
13
# Get insights into data stewardship from Microsoft Purview
14
14
15
-
As described in the [insights concepts](concept-insights.md), data stewardship is report that is part of the "Health" section of the Data Estate Insights App. This report offers a one-stop shop experience for data, governance, and quality focused users like chief data officers and data stewards to get actionable insights into key areas of gap in their data estate, for better governance.
15
+
As described in the [insights concepts](concept-insights.md), the data stewardship report is part of the "Health" section of the Data Estate Insights App. This report offers a one-stop shop experience for data, governance, and quality focused users like chief data officers and data stewards to get actionable insights into key areas of gap in their data estate.
16
16
17
17
In this guide, you'll learn how to:
18
18
@@ -28,17 +28,15 @@ Before getting started with Microsoft Purview Data Estate Insights, make sure th
28
28
29
29
* Set up and completed a scan your storage source.
30
30
31
+
*[Enable and schedule your data estate insights reports](how-to-schedule-data-estate-insights.md).
32
+
31
33
For more information to create and complete a scan, see [the manage data sources in Microsoft Purview article](manage-data-sources.md).
32
34
33
35
## Understand your data estate and catalog health in Data Estate Insights
34
36
35
37
In Microsoft Purview Data Estate Insights, you can get an overview of all assets inventoried in the Data Map, and any key gaps that can be closed by governance stakeholders, for better governance of the data estate.
36
38
37
-
1. Navigate to your Microsoft Purview account in the Azure portal.
38
-
39
-
1. On the **Overview** page, in the **Get Started** section, select the **Open Microsoft Purview governance portal** tile.
40
-
41
-
:::image type="content" source="./media/data-stewardship/portal-access.png" alt-text="Screenshot of Microsoft Purview account in Azure portal with the Microsoft Purview governance portal button highlighted.":::
39
+
1. Access the [Microsoft Purview Governance Portal](https://web.purview.azure.com/) and open your Microsoft Purview account.
42
40
43
41
1. On the Microsoft Purview **Home** page, select **Data Estate Insights** on the left menu.
44
42
@@ -48,64 +46,57 @@ In Microsoft Purview Data Estate Insights, you can get an overview of all assets
48
46
49
47
:::image type="content" source="./media/data-stewardship/data-stewardship-table-of-contents.png" alt-text="Screenshot of the Microsoft Purview governance portal Data Estate Insights menu with Data Stewardship highlighted under the Health section.":::
50
48
49
+
## View data stewardship dashboard
51
50
52
-
### View data stewardship dashboard
51
+
The dashboard is purpose-built for the governance and quality focused users, like data stewards and chief data officers, to understand the data estate health of their organization. The dashboard shows high level KPIs that need to reduce governance risks:
53
52
54
-
The dashboard is purpose-built for the governance and quality focused users, like data stewards and chief data officers, to understand the data estate health and catalog adoption health of their organization. The dashboard shows high level KPIs that need to reduce governance risks:
55
-
56
-
***Asset curation**: All data assets are categorized into three buckets - "Fully curated", "Partially curated" and "Not curated", based on certain attributes of assets being present. An asset is "Fully curated" if it has at least one classification tag, an assigned Data Owner and a description. If any of these attributes is missing, but not all, then the asset is categorized as "Partially curated" and if all of them are missing, then it's "Not curated".
57
-
***Asset data ownership**: Assets that have the owner attribute within "Contacts" tab as blank are categorized as "No owner", else it's categorized as "Owner assigned".
58
-
***Catalog usage and adoption**: This KPI shows a sum of monthly active users of the catalog across different pages.
53
+
***Asset curation**: All data assets are categorized into three buckets - "Fully curated", "Partially curated" and "Not curated", based on certain attributes of assets being present. An asset is "Fully curated" if it has at least one classification tag, an assigned Data Owner and a description. If any of these attributes is missing, but not all, then the asset is categorized as "Partially curated" and if all of them are missing, then it's "Not curated".
54
+
***Asset data ownership**: Assets that have the owner attribute within "Contacts" tab as blank are categorized as "No owner", else it's categorized as "Owner assigned".
59
55
60
56
:::image type="content" source="./media/data-stewardship/kpis-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, showing the three main KPI charts." lightbox="media/data-stewardship/data-stewardship-kpis-large.png":::
61
-
62
-
63
-
As users look at the main dashboard layout, it's divided into two tabs - [**Data estate**](#data-estate) and [**Catalog adoption**](#catalog-adoption).
64
-
65
-
#### Data estate
66
57
67
-
This section of **data stewardship** gives governance and quality focused users, like data stewards and chief data officers, an overview of their data estate, as well as running trends.
58
+
### Data estate health
68
59
69
-
:::image type="content" source="./media/data-stewardship/data-estate-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with the layout tabs highlighted in the middle of the page and data estate selected." lightbox="media/data-stewardship/data-estate-large.png":::
70
-
71
-
##### Data estate health
72
-
Data estate health is a scorecard view that helps management and governance focused users, like chief data officers, understand critical governance metrics that can be looked at by collection hierarchy.
60
+
Data estate health is a scorecard view that helps management and governance focused users, like chief data officers, understand critical governance metrics that can be looked at by collection hierarchy.
73
61
74
62
:::image type="content" source="./media/data-stewardship/data-estate-health-small.png" alt-text="Screenshot of the data stewardship data estate health table in the middle of the dashboard." lightbox="media/data-stewardship/data-estate-health-large.png":::
75
63
76
64
You can view the following metrics:
77
-
***Total asset**: Count of assets by collection drill-down
65
+
***Assets**: Count of assets by collection drill-down
78
66
***With sensitive classifications**: Count of assets with any system classification applied
79
67
***Fully curated assets**: Count of assets that have a data owner, at least one classification and a description.
80
-
***Owners assigned**: Count of assets with data owner assigned on them
68
+
***Owner assigned**: Count of assets with data owner assigned on them
81
69
***No classifications**: Count of assets with no classification tag
82
-
***Net new assets**: Count of new assets pushed in the Data Map in the last 30 days
83
-
***Deleted assets**: Count of deleted assets from the Data Map in the last 30 days
70
+
***Out of date**: Percentage of assets that have not been updated in over 365 days.
71
+
***New**: Count of new assets pushed in the Data Map in the last 30 days
72
+
***Updated**: Count of assets updated in the Data Map in the last 30 days
73
+
***Deleted**: Count of deleted assets from the Data Map in the last 30 days
84
74
85
-
You can also drill down by collection paths. As you hover on each column name, it provides description of the column and takes you to the detailed graph for further drill-down.
75
+
You can also drill down by collection paths. As you hover on each column name, it provides description of the column, provides recommended percentage ranges, and takes you to the detailed graph for further drill-down.
86
76
87
77
:::image type="content" source="./media/data-stewardship/hover-menu.png" alt-text="Screenshot of the data stewardship data estate health table, with the fully curated column hovered over. A summary is show, and the view more in Stewardship insights option is selected.":::
88
78
89
79
:::image type="content" source="./media/data-stewardship/detailed-view.png" alt-text="Screenshot of the asset curation detailed view, as shown after selecting the view more in stewardship insights option is selected.":::
90
80
91
-
##### Asset curation
92
-
All data assets are categorized into three buckets - ***"Fully curated"***, ***"Partially curated"*** and ***"Not curated"***, based on whether assets have been given certain attributes.
81
+
### Asset curation
82
+
83
+
All data assets are categorized into three buckets - ***Fully curated***, ***Partially curated*** and ***Not curated***, based on whether assets have been given certain attributes.
93
84
94
85
:::image type="content" source="./media/data-stewardship/asset-curation-small.png" alt-text="Screenshot of the data stewardship insights health dashboard, with the asset curation bar chart highlighted." lightbox="media/data-stewardship/asset-curation-large.png":::
95
86
96
-
An asset is ***"Fully curated"*** if it has at least one classification tag, an assigned data owner, and a description.
87
+
An asset is ***Fully curated*** if it has at least one classification tag, an assigned data owner, and a description.
97
88
98
-
If any of these attributes is missing, but not all, then the asset is categorized as ***"Partially curated"***. If all of them are missing, then it's listed as ***"Not curated"***.
89
+
If any of these attributes is missing, but not all, then the asset is categorized as ***Partially curated***. If all of them are missing, then it's listed as ***Not curated***.
99
90
100
91
You can drill down by collection hierarchy.
101
92
102
93
:::image type="content" source="./media/data-stewardship/asset-curation-collection-filter.png" alt-text="Screenshot of the data stewardship asset curation chart, with the collection filter opened to show all available collections.":::
103
94
104
-
For further information about which assets aren't fully curated, you can select ***"View details"*** link that will take you into the deeper view.
95
+
For further information about which assets aren't fully curated, you can select **View details** link that will take you into the deeper view.
105
96
106
97
:::image type="content" source="./media/data-stewardship/asset-curation-view-details.png" alt-text="Screenshot of the data stewardship asset curation chart, with the view details button highlighted below the chart.":::
107
98
108
-
In the ***"View details"*** page, if you select a specific collection, it will list all assets with attribute values or blanks, that make up the ***"fully curated"*** assets.
99
+
In the **View details** page, if you select a specific collection, it will list all assets with attribute values or blanks, that make up the ***fully curated*** assets.
109
100
110
101
:::image type="content" source="./media/data-stewardship/asset-curation-select-collection.png" alt-text="Screenshot of the asset curation detailed view, shown after selecting View Details beneath the asset curation chart.":::
111
102
@@ -115,6 +106,7 @@ First, it tells you what was the ***classification source***, if the asset is cl
115
106
116
107
Second, if an asset is unclassified, it tells us why it's not classified, in the column ***Reasons for unclassified***.
117
108
Currently, Data estate insights can tell one of the following reasons:
109
+
118
110
* No match found
119
111
* Low confidence score
120
112
* Not applicable
@@ -125,45 +117,18 @@ You can select any asset and add missing attributes, without leaving the **Data
125
117
126
118
:::image type="content" source="./media/data-stewardship/edit-asset.png" alt-text="Screenshot of the asset list page, with an asset selected and the edit menu open.":::
127
119
128
-
#####Trends and gap analysis
120
+
### Trends and gap analysis
129
121
130
122
This graph shows how the assets and key metrics have been trending over:
123
+
131
124
* Last 30 days: The graph takes last run of the day or recording of the last run across days as a data point.
132
125
* Last six weeks: The graph takes last run of the week where week ends on Sunday. If there was no run on Sunday, then it takes the last recorded run.
133
126
* Last 12 months: The graph takes last run of the month.
134
127
* Last four quarters: The graph takes last run of the calendar quarter.
135
128
136
129
:::image type="content" source="./media/data-stewardship/trends-and-gap-analysis-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with data estate selected, showing the trends and gap analysis graph at the bottom of the page." lightbox="media/data-stewardship/trends-and-gap-analysis-large.png":::
137
130
138
-
#### Catalog adoption
139
-
140
-
This tab of the **data stewardship** insight gives management focused users like, chief data officers, a view of what is activity is happening in the catalog. The hypothesis is, the more activity on the catalog, the better usage, hence the better are the chances of governance program to have a high return on investment.
141
-
142
-
:::image type="content" source="./media/data-stewardship/catalog-adoption-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with the layout tabs highlighted in the middle of the page and catalog adoption selected." lightbox="media/data-stewardship/catalog-adoption-large.png":::
143
-
144
-
##### Active users trend by catalog features
145
-
146
-
Active users trend by area of the catalog, and the graph focuses on activities in **search and browse**, and **asset edits**.
147
-
148
-
If there are active users of search and browse, meaning the user has typed a search keyword and hit enter, or selected browse by assets, we count it as an active user of "search and browse".
149
-
150
-
If a user has edited an asset by selecting "save" after making changes, we consider that user as an active user of "asset edits".
151
-
152
-
:::image type="content" source="./media/data-stewardship/active-users-trend-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with the active users trend graph highlighted." lightbox="media/data-stewardship/active-users-trend-large.png":::
153
-
154
-
##### Most viewed assets in last 30 days
155
-
156
-
You can see the most viewed assets in the catalog, their current curation level, and number of views. This list is currently limited to five items.
157
-
158
-
:::image type="content" source="./media/data-stewardship/most-viewed-assets-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with the most viewed assets table highlighted.":::
159
-
160
-
##### Most searched keywords in last 30 days
161
-
162
-
You can view count of top five searches with a result returned. The table also shows what key words were searched without any results in the catalog.
163
-
164
-
:::image type="content" source="./media/data-stewardship/top-searched-keywords-small.png" alt-text="Screenshot of the data stewardship insights summary graphs, with the most searched keywords table highlighted.":::
165
-
166
131
## Next steps
167
132
168
-
Learn more about Microsoft Purview Data estate insights through:
169
-
*[Concepts](concept-insights.md)
133
+
Learn more about Microsoft Purview Data Estate Insights through:
0 commit comments