Skip to content

Commit 7e1291b

Browse files
Merge pull request #227721 from linda33wj/purview
Update resource set and PBI/Synapse connector docs
2 parents 1527421 + 11652c5 commit 7e1291b

File tree

5 files changed

+14
-27
lines changed

5 files changed

+14
-27
lines changed

articles/purview/catalog-managed-vnet.md

Lines changed: 2 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: zeinam
66
ms.service: purview
77
ms.subservice: purview-data-catalog
88
ms.topic: how-to
9-
ms.date: 01/31/2023
9+
ms.date: 02/17/2023
1010
ms.custom: references_regions
1111
# Customer intent: As a Microsoft Purview admin, I want to set up Managed Virtual Network and managed private endpoints for my Microsoft Purview account.
1212
---
@@ -51,12 +51,7 @@ Currently, the following data sources are supported to have a managed private en
5151
- Azure SQL Managed Instance
5252
- Azure Synapse Analytics
5353

54-
Additionally, you can deploy managed private endpoints for your Azure Key Vault resources if you need to run scans using any authentication options rather than Managed Identities, such as SQL Authentication or Account Key.
55-
56-
> [!IMPORTANT]
57-
> If you are planning to scan Azure Synapse workspaces using Managed Virtual Network, you are also required to [configure Azure Synapse workspace firewall access](register-scan-synapse-workspace.md#set-up-azure-synapse-workspace-firewall-access) to enable **Allow Azure services and resources to access this workspace**. Currently, we do not support setting up scans for an Azure Synapse workspace from the Microsoft Purview governance portal, if you cannot enable **Allow Azure services and resources to access this workspace** on your Azure Synapse workspaces. If you cannot enable the firewall:
58-
> - You can use [Microsoft Purview REST API - Scans - Create Or Update](/rest/api/purview/scanningdataplane/scans/create-or-update/) to create a new scan for your Synapse workspaces including dedicated and serverless pools.
59-
> - You must use **SQL Authentication** as authentication mechanism.
54+
Additionally, you can deploy managed private endpoints for your Azure Key Vault resources if you need to run scans using any authentication options rather than Managed Identities, such as SQL Authentication or Account Key.
6055

6156
### Managed Virtual Network
6257

articles/purview/concept-asset-normalization.md

Lines changed: 1 addition & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: nayenama
66
ms.service: purview
77
ms.subservice: purview-data-catalog
88
ms.topic: conceptual
9-
ms.date: 02/24/2022
9+
ms.date: 02/17/2023
1010
ms.custom: ignite-fall-2021
1111
---
1212

@@ -79,13 +79,6 @@ Before: `https://myAccount.file.core.windows.net//myshare/folderA////folderB/`
7979

8080
After: `https://myaccount.file.core.windows.net/myshare/folderA/folderB/`
8181

82-
### Lowercase ADF sections
83-
Applies to: Azure Data Factory
84-
85-
Before: `/subscriptions/01234567-abcd-9876-0000-ba9876543210/resourceGroups/fooBar/providers/Microsoft.DataFactory/factories/fooFactory/pipelines/barPipeline/activities/barFoo`
86-
87-
After: `/subscriptions/01234567-abcd-9876-0000-ba9876543210/resourceGroups/foobar/providers/microsoft.datafactory/factories/foofactory/pipelines/barpipeline/activities/barfoo`
88-
8982
### Convert to ADL scheme
9083
Applies to: Azure Data Lake Storage Gen1
9184

articles/purview/concept-resource-sets.md

Lines changed: 8 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: nayenama
66
ms.service: purview
77
ms.subservice: purview-data-catalog
88
ms.topic: conceptual
9-
ms.date: 01/23/2023
9+
ms.date: 02/17/2023
1010
---
1111

1212
# Understanding resource sets
@@ -15,7 +15,7 @@ This article helps you understand how Microsoft Purview uses resource sets to ma
1515

1616
## Background info
1717

18-
At-scale data processing systems typically store a single table in storage as multiple files. In the Microsoft Purview data catalog, this concept is represented by using resource sets. A resource set is a single object in the catalog that represents a large number of assets in storage.
18+
At-scale data processing systems typically store a single table in storage as multiple files. In the Microsoft Purview Data Catalog, this concept is represented by using resource sets. A resource set is a single object in the catalog that represents a large number of assets in storage.
1919

2020
For example, suppose your Spark cluster has persisted a DataFrame into an Azure Data Lake Storage (ADLS) Gen2 data source. Although in Spark the table looks like a single logical resource, on the disk there are likely thousands of Parquet files, each of which represents a partition of the total DataFrame's contents. IoT data and web log data have the same challenge. Imagine you have a sensor that outputs log files several times a second. It won't take long until you have hundreds of thousands of log files from that single sensor.
2121

@@ -45,9 +45,9 @@ When Microsoft Purview detects resources that it thinks are part of a resource s
4545

4646
## Advanced resource sets
4747

48-
Microsoft Purview can customize and further enrich your resource set assets through the **Advanced Resource Sets** capability. Advanced resource sets allows Microsoft Purview to understand the underlying partitions of data ingested and enables the creation of [resource set pattern rules](how-to-resource-set-pattern-rules.md) that customize how Microsoft Purview groups resource sets during scanning.
48+
Microsoft Purview can customize and further enrich your resource set assets through the **Advanced Resource Sets** capability. Advanced resource sets allow Microsoft Purview to understand the underlying partitions of data ingested and enables the creation of [resource set pattern rules](how-to-resource-set-pattern-rules.md) that customize how Microsoft Purview groups resource sets during scanning.
4949

50-
When Advanced Resource Sets are enabled, Microsoft Purview run extra aggregations to compute the following information about resource set assets:
50+
When Advanced Resource Sets are enabled, Microsoft Purview runs extra aggregations to compute the following information about resource set assets:
5151

5252
- A sample path from a file that comprises the resource set.
5353
- A partition count that shows how many files make up the resource set.
@@ -59,14 +59,14 @@ These properties can be found on the asset details page of the resource set.
5959

6060
### Turning on advanced resource sets
6161

62-
Advanced resource sets is off by default in all new Microsoft Purview instances. Advanced resource sets can be enabled from **Account information** in the management hub. Only those users who are added to the Data Curator role at root collection, can manage Advanced Resource Sets settings.
62+
Advanced resource sets are off by default in all new Microsoft Purview instances. Advanced resource sets can be enabled from **Account information** in the management hub. Only those users who are added to the Data Curator role at root collection, can manage Advanced Resource Sets settings.
6363

6464
:::image type="content" source="media/concept-resource-sets/advanced-resource-set-toggle.png" alt-text="Turn on Advanced resource set." border="true":::
6565

6666
After enabling advanced resource sets, the additional enrichments will occur on all newly ingested assets. The Microsoft Purview team recommends waiting an hour before scanning in new data lake data after toggling on the feature.
6767

6868
> [!IMPORTANT]
69-
> Enabling advanced resource sets will impact the refresh rate of asset and classification insights. When advanced resource sets is on, asset and classification insights will only update twice a day.
69+
> Enabling advanced resource sets will impact the refresh rate of asset and classification insights. When advanced resource sets are on, asset and classification insights will only update twice a day.
7070
7171
## Built-in resource set patterns
7272

@@ -89,7 +89,7 @@ Microsoft Purview supports the following resource set patterns. These patterns c
8989
| Date(yyyy/mm/dd)InPath | {Year}/{Month}/{Day} | Year/month/day pattern spanning multiple folders |
9090

9191

92-
## How resource sets are displayed in the Microsoft Purview data catalog
92+
## How resource sets are displayed in the Microsoft Purview Data Catalog
9393

9494
When Microsoft Purview matches a group of assets into a resource set, it attempts to extract the most useful information to use as a display name in the catalog. Some examples of the default naming convention applied:
9595

@@ -119,12 +119,11 @@ When scanning a storage account, Microsoft Purview uses a set of defined pattern
119119
- Putting an asset into the wrong resource set
120120
- Incorrectly marking an asset as not being a resource set
121121

122-
To customize or override how Microsoft Purview detects which assets are grouped as resource sets and how they are displayed within the catalog, you can define pattern rules in the management center. For step-by-step instructions and syntax, please see [resource set pattern rules](how-to-resource-set-pattern-rules.md).
122+
To customize or override how Microsoft Purview detects which assets are grouped as resource sets and how they're displayed within the catalog, you can define pattern rules in the management center. For step-by-step instructions and syntax, see [resource set pattern rules](how-to-resource-set-pattern-rules.md).
123123

124124
## Known limitations with resource sets
125125

126126
- By default, resource set assets will only be deleted by a scan if [Advanced Resource sets](#advanced-resource-sets) are enabled. If this capability is off, resource set assets can only be deleted manually or via API.
127-
- Currently, resource set assets will apply the first schema and classification discovered by the scan. Subsequent scans won't update the schema.
128127

129128
## Next steps
130129

articles/purview/register-scan-power-bi-tenant-cross-tenant.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -160,7 +160,7 @@ Use either of the following deployment checklists during the setup, or for troub
160160
- `*.analysis.windows.net`
161161

162162
1. Network connectivity from the self-hosted runtime to Microsoft services is enabled.
163-
1. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed.
163+
1. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed. Restart the machine after you newly install the JDK for it to take effect.
164164
1. In Power BI tenant, In Azure Active Directory create a security group.
165165
1. In Power BI tenant, from Azure Active Directory tenant, make sure [Service Principal is member of the new security group](#authenticate-to-power-bi-tenant).
166166
1. On the Power BI Tenant Admin portal, validate if [Allow service principals to use read-only Power BI admin APIs](#associate-the-security-group-with-power-bi-tenant) is enabled for the new security group.

articles/purview/register-scan-power-bi-tenant.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -138,7 +138,7 @@ Use any of the following deployment checklists during the setup or for troublesh
138138
- `*.analysis.windows.net`
139139

140140
3. Network connectivity from Self-hosted runtime to Microsoft services is enabled.
141-
4. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed.
141+
4. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed. Restart the machine after you newly install the JDK for it to take effect.
142142

143143
1. In Azure Active Directory tenant, create a security group.
144144

@@ -171,7 +171,7 @@ Use any of the following deployment checklists during the setup or for troublesh
171171

172172
1. Validate Self-hosted runtime settings:
173173
1. Latest version of [Self-hosted runtime](https://www.microsoft.com/download/details.aspx?id=39717) is installed on the VM.
174-
2. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed.
174+
2. [JDK 8 or later](https://www.oracle.com/java/technologies/javase-jdk11-downloads.html) is installed. Restart the machine after you newly install the JDK for it to take effect.
175175

176176
1. Validate App registration settings to make sure:
177177
1. App registration exists in your Azure Active Directory tenant.

0 commit comments

Comments
 (0)