Skip to content

Commit f055166

Browse files
authored
Merge pull request #220771 from whhender/purview-freshness-ankitgup
Purview freshness ankitgup
2 parents 0d70aac + 1b47fc2 commit f055166

12 files changed

+215
-112
lines changed
Lines changed: 40 additions & 78 deletions
Original file line numberDiff line numberDiff line change
@@ -1,112 +1,74 @@
11
---
2-
title: Apply classifications on assets
3-
description: This document describes how to apply classifications on assets.
4-
author: SunetraVirdi
5-
ms.author: suvirdi
2+
title: Automatically apply classifications on assets
3+
description: This document describes how to automatically apply classifications on assets.
4+
author: ankitscribbles
5+
ms.author: ankitgup
66
ms.service: purview
77
ms.subservice: purview-data-map
88
ms.topic: how-to
9-
ms.date: 09/27/2021
9+
ms.date: 12/30/2022
1010
---
11-
# Apply classifications on assets in Microsoft Purview
11+
# Automatically apply classifications on assets in Microsoft Purview
1212

13-
This article discusses how to apply classifications on assets.
13+
After data sources are [registered](manage-data-sources.md#register-a-new-source) in the Microsoft Purview Data Map, the next step is to [scan](concept-scans-and-ingestion.md) the data sources. The scanning process establishes a connection to the data source, captures technical metadata, and can automatically classify data using either the [supported system classifications](supported-classifications.md) or [rules for your custom classifications](create-a-custom-classification-and-classification-rule.md#custom-classification-rules). For example, if you have a file named *multiple.docx* and it has a National ID number in its content, during the scanning process Microsoft Purview adds the classification **EU National Identification Number** to the file asset's detail page.
1414

15-
## Introduction
15+
These [classifications](concept-classification.md) help you and your team identify the kinds of data you have across your data estate. For example: if files or tables contain credit card numbers, or addresses. Then you can more easily search for certain kinds of information, like customer IDs, or prioritize security for sensitive data types.
1616

17-
Classifications can be system or custom types. System classifications are present in Microsoft Purview by default. Custom classifications can be created based on a regular expression pattern and keyword lists. Classifications can be applied to assets either automatically via scanning or manually.
17+
Classifications can be automatically applied on file and column assets during scanning.
1818

19-
This document explains how to apply classifications to your data.
19+
In this article we'll discuss:
2020

21-
## Prerequisites
21+
- [How Microsoft Purview classifies data](#how-microsoft-purview-classifies-assets)
22+
- [The steps to automatically apply classifications](#automatically-apply-classifications)
2223

23-
- Create custom classifications based on your need.
24-
- Set up scan on your data sources.
24+
## How Microsoft Purview classifies assets
2525

26-
## Apply classifications
27-
In Microsoft Purview, you can apply system or custom classifications on a file, table, or column asset. This article describes the steps to manually apply classifications on your assets.
26+
When a data source is scanned, Microsoft Purview compares data in the asset to a list of possible classifications called a [scan rule set](create-a-scan-rule-set.md).
2827

29-
### Apply classification to a file asset
30-
Microsoft Purview can scan and automatically classify documents. For example, if you have a file named *multiple.docx* and it has a National ID number in its content, Microsoft Purview adds the classification **EU National Identification Number** to the file asset's detail page.
28+
There are [system scan rule sets](create-a-scan-rule-set.md#system-scan-rule-sets) already available for each data source that contains every currently available system classification for that data source. Or, you can [create a custom scan rule set](create-a-scan-rule-set.md) to make a list of classifications tailored to your data set.
3129

32-
In some scenarios, you might want to manually add classifications to your file asset or if you have multiple files that are grouped into a resource set, add classifications at the resource set level.
30+
Making a custom rule sets for your data can be a good idea if your data is limited to specific kinds of information, or regions, as comparing your data to fewer classification types will speed up the scanning process. For example, if your dataset only contains European data, you could create a custom scan rule set that excludes identification for other regions.
3331

34-
Follow these steps to add a custom or system classification to a partition resource set:
32+
You might also make a custom rule set if you've created [custom classifications](create-a-custom-classification-and-classification-rule.md#steps-to-create-a-custom-classification) and [classification rules](create-a-custom-classification-and-classification-rule.md#custom-classification-rules), so that your custom classifications can be automatically applied during scanning.
3533

36-
1. Search or browse the partition and navigate to the asset detail page.
34+
For more information about the available system classifications and how your data is classified, see the [system classifications page.](supported-classifications.md)
3735

38-
:::image type="content" source="./media/apply-classifications/asset-detail-page.png" alt-text="Screenshot showing the asset detail page.":::
36+
## Automatically apply classifications
3937

40-
1. On the **Overview** tab, view the **Classifications** section to see if there are any existing classifications. Select **Edit**.
38+
>[!NOTE]
39+
>Table assets are not automatically assigned classifications, because the classifications are assigned to their columns, but you can [manually apply classifications to table assets](manually-apply-classifications.md#manually-apply-classification-to-a-table-asset).
4140
42-
1. From the **Classifications** drop-down list, select the specific classifications you're interested in. For example, **Credit Card Number**, which is a system classification and **CustomerAccountID**, which is a custom classification.
41+
After data sources are [registered](manage-data-sources.md#register-a-new-source), you can automatically classify data in that source's data assets by running a [scan](concept-scans-and-ingestion.md).
4342

44-
:::image type="content" source="./media/apply-classifications/select-classifications.png" alt-text="Screenshot showing how to select classifications to add to an asset.":::
43+
1. Check the **Scan** section of the [source article](microsoft-purview-connector-overview.md) for your data source to confirm any prerequisites or authentication are set up and ready for a scan.
4544

46-
1. Select **Save**
45+
1. Search the Microsoft Purview Data Map the registered source that has the data assets (files and columns), you want to classify.
4746

48-
1. On the **Overview** tab, confirm that the classifications you selected appear under the **Classifications** section.
47+
1. Select the **New Scan** icon under the resource.
4948

50-
:::image type="content" source="./media/apply-classifications/confirm-classifications.png" alt-text="Screenshot showing how to confirm classifications were added to an asset.":::
49+
:::image type="content" source="./media/apply-classifications/new-scan.png" alt-text="Screenshot of the Microsoft Purview Data Map, with the new scan button selected under a registered source.":::
5150

52-
### Apply classification to a table asset
51+
>[!TIP]
52+
>If you don't see the New Scan button, you may not have correct permissions. To run a scan, you'll need at least [data source administrator permissions](catalog-permissions.md) on the collection where the source is registered.
5353
54-
When Microsoft Purview scans your data sources, it doesn't automatically assign classifications to table assets. If you want your table asset to have a classification, you must add it manually.
54+
1. Select your credential and authenticate with your source. (For more information about authenticating with your source, see the **prerequisite** and **scan** sections of your specific source [source article](microsoft-purview-connector-overview.md).) Select **Continue**.
5555

56-
To add a classification to a table asset:
56+
1. If necessary, select the assets in the source you want to scan. You can scan all assets, or a subset of folders, files, or tables depending on the source.
5757

58-
1. Find a table asset that you're interested in. For example, **Customer** table.
58+
1. Select your scan rule set. You'll see a list of available scan rule sets and can select one, or you can choose to create a new scan rule set using the **New scan rule set** button at the top. The scan rule set will determine which classifications will be compared and applied to your data. For more information, see [how Microsoft Purview classifies assets](#how-microsoft-purview-classifies-assets).
5959

60-
1. Confirm that no classifications are assigned to the table. Select **Edit**
60+
:::image type="content" source="./media/apply-classifications/select-scan-rule-set.png" alt-text="Screenshot of the scan rule set page of the scan menu, with the new scan rule set and existing scan rule set buttons highlighted.":::
6161

62-
:::image type="content" source="./media/apply-classifications/select-edit-from-table-asset.png" alt-text="Screenshot showing how to view and edit the classifications of a table asset.":::
62+
>[!TIP]
63+
>For more information about the options available when creating a scan rule set, start at step 4 of these [steps to create a scan rule set](create-a-scan-rule-set.md#steps-to-create-a-scan-rule-set).
6364
64-
1. From the **Classifications** drop-down list, select one or more classifications. This example uses a custom classification named **CustomerInfo**, but you can select any classifications for this step.
65+
1. Schedule your scan.
6566

66-
:::image type="content" source="./media/apply-classifications/select-classifications-in-table.png" alt-text="Screenshot showing how to select classifications to add to a table asset.":::
67+
1. Save and run your scan. Applicable classifications in your scan rule set will be automatically applied to the assets you scan. You'll be able to view and manage them once the scan is complete.
6768

68-
1. Select **Save** to save the classifications.
69-
70-
1. On the **Overview** page, verify that Microsoft Purview added your new classifications.
71-
72-
:::image type="content" source="./media/apply-classifications/verify-classifications-added-to-table.png" alt-text="Screenshot showing how to verify that classifications were added to a table asset.":::
73-
74-
### Add classification to a column asset
75-
76-
Microsoft Purview automatically scans and adds classifications to all column assets. However, if you want to change the classification, you can do so at the column level.
77-
78-
To add a classification to a column:
79-
80-
1. Find and select the column asset, and then select **Edit** from the **Overview** tab.
81-
82-
1. Select the **Schema** tab
83-
84-
:::image type="content" source="./media/apply-classifications/edit-column-schema.png" alt-text="Screenshot showing how to edit the schema of a column.":::
85-
86-
1. Identify the columns you're interested in and select **Add a classification**. This example adds a **Common Passwords** classification to the **PasswordHash** column.
87-
88-
:::image type="content" source="./media/apply-classifications/add-classification-to-column.png" alt-text="Screenshot showing how to add a classification to a column.":::
89-
90-
1. Select **Save**
91-
92-
1. Select the **Schema** tab and confirm that the classification has been added to the column.
93-
94-
:::image type="content" source="./media/apply-classifications/confirm-classification-added.png" alt-text="Screenshot showing how to confirm that a classification was added to a column schema.":::
95-
96-
## View classification details
97-
Microsoft Purview captures important details like who applied a classification and when it was applied. To view the details, hover over the classification to revel the Classification details card. The classification details card shows the following information:
98-
- Classification name - Name of the classification applied on the asset or column.
99-
- Applied by - Who applied the classification. Possible values are scan and user name.
100-
- Applied time - Local timestamp when the classification was applied via scan or manually.
101-
- Classification type - System or custom.
102-
103-
Users with *Data Curator* role will see additional details for classifications that were applied automatically via scan. These details will include sample count that the scanner read to classify the data and distinct data count in the sample that the scanner found.
104-
105-
:::image type="content" source="./media/apply-classifications/view-classification-detail.png" alt-text="Screenshot showing how to view classification detail.":::
106-
107-
## Impact of rescanning on existing classifications
108-
109-
Classifications are applied the first time, based on sample set check on your data and matching it against the set regex pattern. At the time of rescan, if new classifications apply, the column gets additional classifications on it. Existing classifications stay on the column, and must be removed manually.
69+
[!INCLUDE [classification-details](includes/classification-details.md)]
11070

11171
## Next steps
112-
To learn how to create a custom classification, see [Create a custom classification](create-a-custom-classification-and-classification-rule.md).
72+
73+
- To learn how to create a custom classification, see [create a custom classification](create-a-custom-classification-and-classification-rule.md).
74+
- To learn about how to manually apply classifications, see [manually apply classifications](manually-apply-classifications.md).

articles/purview/catalog-permissions.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: viseshag
66
ms.service: purview
77
ms.custom: event-tier1-build-2022
88
ms.topic: conceptual
9-
ms.date: 06/17/2022
9+
ms.date: 12/19/2022
1010
---
1111

1212
# Access control in the Microsoft Purview governance portal
@@ -42,8 +42,9 @@ The Microsoft Purview governance portal uses a set of predefined roles to contro
4242
|User Scenario|Appropriate Role(s)|
4343
|-------------|-----------------|
4444
|I just need to find assets, I don't want to edit anything|Data reader|
45-
|I need to edit information about assets, assign classifications, associate them with glossary entries, and so on.|Data curator|
46-
|I need to edit the glossary or set up new classification definitions|Data curator|
45+
|I need to edit and manage information about assets|Data curator|
46+
|I want to create custom classifications | Data curator **or** data source administrator |
47+
|I need to edit the business glossary |Data curator|
4748
|I need to view Data Estate Insights to understand the governance posture of my data estate|Data curator|
4849
|My application's Service Principal needs to push data to the Microsoft Purview Data Map|Data curator|
4950
|I need to set up scans via the Microsoft Purview governance portal|Data curator on the collection **or** data curator **and** data source administrator where the source is registered.|

articles/purview/concept-best-practices-classification.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,6 +155,7 @@ Here are some considerations to bear in mind as you're defining classifications:
155155

156156
## Next steps
157157

158-
- [Apply system classification](./apply-classifications.md)
158+
- [Automatically apply classifications](./apply-classifications.md)
159+
- [Manually apply classifications](./manually-apply-classifications.md)
159160
- [Create custom classification](./create-a-custom-classification-and-classification-rule.md)
160161

articles/purview/concept-classification.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,5 +49,6 @@ Custom classification rules can be based on a *regular expression* pattern or *d
4949

5050
* [Read about classification best practices](concept-best-practices-classification.md)
5151
* [Create custom classifications](create-a-custom-classification-and-classification-rule.md)
52-
* [Apply classifications](apply-classifications.md)
52+
* [Automatically apply classifications](apply-classifications.md)
53+
* [Manually apply classifications](manually-apply-classifications.md)
5354
* [Use the Microsoft Purview governance portal](use-azure-purview-studio.md)

0 commit comments

Comments
 (0)