Skip to content

Commit 399b166

Browse files
Merge pull request #245259 from linda33wj/main
Update SHIR & connector articles
2 parents bc36bfb + 24c7e9a commit 399b166

File tree

3 files changed

+19
-10
lines changed

3 files changed

+19
-10
lines changed

articles/purview/manage-integration-runtimes.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: purview
77
ms.subservice: purview-data-map
88
ms.custom: devx-track-extended-java
99
ms.topic: how-to
10-
ms.date: 05/08/2023
10+
ms.date: 07/18/2023
1111
---
1212

1313
# Create and manage a self-hosted integration runtime
@@ -147,7 +147,8 @@ Make sure the account has the permission of Log-on as a service. Otherwise self-
147147
You can associate a self-hosted integration runtime with multiple on-premises machines or virtual machines in Azure. These machines are called nodes. You can have up to four nodes associated with a self-hosted integration runtime. The benefits of having multiple nodes are:
148148

149149
- Higher availability of the self-hosted integration runtime so that it's no longer the single point of failure for scan. This availability helps ensure continuity when you use up to four nodes.
150-
- Run more concurrent scans. Each self-hosted integration runtime can empower many scans at the same time, auto determined based on the machine's CPU/memory. You can install more nodes if you have more concurrency need. Each scan will be executed on one of the nodes. Having more nodes doesn't improve the performance of a single scan execution.
150+
- Run more concurrent scans. Each self-hosted integration runtime can empower many scan runs at the same time, auto determined based on the machine's CPU/memory. You can install more nodes if you have more concurrency need.
151+
- When scanning sources like Azure Blob, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2 and Azure Files, each scan run can leverage all those nodes to boost the scan performance. For other sources, scan will be executed on one of the nodes.
151152

152153
You can associate multiple nodes by installing the self-hosted integration runtime software from [Download Center](https://www.microsoft.com/download/details.aspx?id=39717). Then, register it by using the same authentication key.
153154

articles/purview/register-scan-power-bi-tenant.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: jingwang
66
ms.service: purview
77
ms.subservice: purview-data-map
88
ms.topic: how-to
9-
ms.date: 06/08/2023
9+
ms.date: 07/17/2023
1010
ms.custom: template-how-to, ignite-fall-2021
1111
---
1212

@@ -188,7 +188,7 @@ Use any of the following deployment checklists during the setup or for troublesh
188188
3. Under **Authentication**, **Allow public client flows** is enabled.
189189

190190
2. Review network configuration and validate if:
191-
1. A [private endpoint for Power BI tenant](/power-bi/enterprise/service-security-private-links) is deployed.
191+
1. If your Power BI doesn't allow public access, make sure [private endpoint for Power BI tenant](/power-bi/enterprise/service-security-private-links) is deployed.
192192
2. All required [private endpoints for Microsoft Purview](./catalog-private-link-end-to-end.md) are deployed.
193193
3. Network connectivity from Self-hosted runtime to Power BI tenant is enabled. The following endpoints must be reachable from self-hosted runtime VM:
194194
- `*.powerbi.com`

articles/purview/register-scan-snowflake.md

Lines changed: 14 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: jingwang
66
ms.service: purview
77
ms.subservice: purview-data-map
88
ms.topic: how-to
9-
ms.date: 06/12/2023
9+
ms.date: 07/17/2023
1010
ms.custom: template-how-to
1111
---
1212

@@ -37,13 +37,18 @@ When scanning Snowflake source, Microsoft Purview supports:
3737
- Tasks
3838
- Sequences
3939

40-
- Fetching static lineage on assets relationships among tables, views, and streams.
40+
- Fetching static lineage on assets relationships among tables, views, streams, and stored procedures.
4141

42-
When setting up scan, you can choose to scan one or more Snowflake database(s) entirely, or further scope the scan to a subset of schemas matching the given name(s) or name pattern(s).
42+
For stored procedures, you can choose the level of details to extract on [scan settings](#scan). Stored procedure lineage is supported for Snowflake Scripting (SQL) and JavaScript languages, and generated based on the procedure definition.
43+
44+
When setting up scan, you can choose to scan one or more Snowflake database(s) entirely based on the given name(s) or name pattern(s), or further scope the scan to a subset of schemas matching the given name(s) or name pattern(s).
4345

4446
### Known limitations
4547

46-
When object is deleted from the data source, currently the subsequent scan won't automatically remove the corresponding asset in Microsoft Purview.
48+
- When object is deleted from the data source, currently the subsequent scan won't automatically remove the corresponding asset in Microsoft Purview.
49+
- Stored procedure lineage is not supported for the following patterns:
50+
- Stored procedure defined in Java, Python and Scala languages.
51+
- Stored procedure using SQL [EXECUTE IMMEDIATE](https://docs.snowflake.com/en/sql-reference/sql/execute-immediate) with static SQL query as variable.
4752

4853
## Prerequisites
4954

@@ -145,7 +150,7 @@ On the **Register sources (Snowflake)** screen, follow these steps:
145150

146151
1. Enter a **Name** that the data source will be listed within the Catalog.
147152

148-
1. Enter the **server** URL used to connect to the Snowflake account in the form of `<account_identifier>.snowflakecomputing.com`, for example, `xy12345.east-us-2.azure.snowflakecomputing.com`. Learn more about Snowflake [account identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier.html#).
153+
1. Enter the **server** URL used to connect to the Snowflake account in the form of `<account_identifier>.snowflakecomputing.com`, for example, `orgname-accountname.snowflakecomputing.com`. Learn more about Snowflake [account identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier.html#).
149154

150155
1. Select a collection or create a new one (Optional)
151156

@@ -208,11 +213,14 @@ To create and run a new scan, follow these steps:
208213

209214
1. **Stored procedure details**: Controls the number of details imported from stored procedures:
210215

211-
- Signature: The name and parameters of stored procedures.
216+
- Signature (default): The name and parameters of stored procedures.
212217
- Code, signature: The name, parameters and code of stored procedures.
213218
- Lineage, code, signature: The name, parameters and code of stored procedures, and the data lineage derived from the code.
214219
- None: Stored procedure details aren't included.
215220
221+
> [!Note]
222+
> If you use Self-hosted Integration Runtime for scan, customized setting other than the default Signature is supported since version 5.30.8541.1. The earlier versions always extract the name and parameters of stored procedures.
223+
216224
1. **Maximum memory available** (applicable when using self-hosted integration runtime): Maximum memory (in GB) available on customer's VM to be used by scanning processes. It's dependent on the size of Snowflake source to be scanned.
217225
218226
> [!Note]

0 commit comments

Comments
 (0)