Skip to content

Commit df1a88a

Browse files
committed
Tweak Databricks docs
1 parent 7247fc6 commit df1a88a

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

articles/purview/catalog-lineage-user-guide.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ author: linda33wj
55
ms.author: jingwang
66
ms.service: purview
77
ms.topic: conceptual
8-
ms.date: 09/20/2022
8+
ms.date: 01/09/2023
99
---
1010
# Microsoft Purview Data Catalog lineage user guide
1111

@@ -44,6 +44,7 @@ Databases & storage solutions such as Oracle, Teradata, and SAP have query engin
4444

4545
|**Category**| **Data source** |
4646
|---|---|
47+
|Azure| [Azure Databricks](register-scan-azure-databricks.md)
4748
|Database| [Cassandra](register-scan-cassandra-source.md)|
4849
|| [Db2](register-scan-db2.md) |
4950
|| [Google BigQuery](register-scan-google-bigquery-source.md)|

articles/purview/register-scan-azure-databricks.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@ When scanning Azure Databricks source, Microsoft Purview supports:
3838

3939
This connector brings metadata from Databricks metastore. Comparing to scan via [Hive Metastore connector](register-scan-hive-metastore) in case you use it to scan Azure Databricks earlier:
4040

41-
- You can directly set up scan for Azure Databricks workspaces without direct HMS access. It uses Databricks personal access token for authentication and connect to a cluster to perform scan.
41+
- You can directly set up scan for Azure Databricks workspaces without direct HMS access. It uses Databricks personal access token for authentication and connects to a cluster to perform scan.
4242
- The Databricks workspace info is captured.
4343
- The relationship between tables and storage assets is captured.
4444

@@ -117,7 +117,7 @@ Use the following steps to scan Azure Databricks to automatically identify asset
117117

118118
1. **Cluster ID**: Specify the cluster ID that Microsoft Purview will connect to and perform the scan. You can find it in Azure Databricks workspace -> Compute -> your cluster -> Tags -> Automatically added tags -> `ClusterId`.
119119

120-
1. **Mount points**: Provide the mount point and Azure Storage source location string when you have external storage manually mounted to Databricks. Use the format `/mnt/<path>=abfss://<container>@<adls_gen2_storage_account>.dfs.core.windows.net/;/mnt/<path>=wasbs://<container>@<blob_storage_account>.blob.core.windows.net` It will be used to capture the relationship between tables and the corresponding storage assets in Microsoft Purview. This setting is optional, if it's not specified, such relationship will not be retrieved.
120+
1. **Mount points**: Provide the mount point and Azure Storage source location string when you have external storage manually mounted to Databricks. Use the format `/mnt/<path>=abfss://<container>@<adls_gen2_storage_account>.dfs.core.windows.net/;/mnt/<path>=wasbs://<container>@<blob_storage_account>.blob.core.windows.net` It will be used to capture the relationship between tables and the corresponding storage assets in Microsoft Purview. This setting is optional, if it's not specified, such relationship won't be retrieved.
121121

122122
You can get the list of mount points in your Databricks workspace by running the following Python command in a notebook:
123123

@@ -170,7 +170,7 @@ From the Databricks workspace asset, you can find the associated Hive Metastore
170170
171171
Refer to the [supported capabilities](#supported-capabilities) section on the supported Azure Databricks scenarios. For more information about lineage in general, see [data lineage](concept-data-lineage.md) and [lineage user guide](catalog-lineage-user-guide.md).
172172
173-
Go to the Hive table/view asset -> lineage tab, you can see the asset relationship when applicable. For relationship between table and external storage assets, you will see Hive Table asset and the storage asset are directly connected bi-directionally, as they mutually impact each other.
173+
Go to the Hive table/view asset -> lineage tab, you can see the asset relationship when applicable. For relationship between table and external storage assets, you'll see Hive Table asset and the storage asset are directly connected bi-directionally, as they mutually impact each other.
174174
175175
:::image type="content" source="media/register-scan-azure-databricks/lineage.png" alt-text="Screenshot that shows Azure Databricks lineage example." border="true":::
176176

0 commit comments

Comments
 (0)