Skip to content

Commit 800feef

Browse files
amee-sumoJV0812yleiferman
authored
Databricks Audit Source (C2C collection) (#5895)
* Databricks Audit Source (C2C collection) * added links and icon * Update databricks-audit-source.md * minor edits * Update docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source.md Co-authored-by: Jagadisha V <[email protected]> * Update databricks-audit-source.md * Update databricks-audit-source.md --------- Co-authored-by: Jagadisha V <[email protected]> Co-authored-by: Yair Leiferman <[email protected]>
1 parent ecfac96 commit 800feef

File tree

9 files changed

+166
-1
lines changed

9 files changed

+166
-1
lines changed
Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
---
2+
title: Databricks Audit Source (Collection)
3+
image: https://help.sumologic.com/img/sumo-square.png
4+
keywords:
5+
- c2c
6+
- databricks-audit-source
7+
hide_table_of_contents: true
8+
---
9+
10+
import useBaseUrl from '@docusaurus/useBaseUrl';
11+
12+
We're excited to announce the release of our new cloud-to-cloud source for Databricks Audit. This source securely collects audit logs from the Databricks platform and sends them to Sumo Logic for streamlined analysis. [Learn more](/docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source).

cid-redirects.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2948,6 +2948,7 @@
29482948
"/docs/integrations/microsoft-azure/microsoft-defender-for-identity/": "/docs/integrations/microsoft-azure/azure-security-microsoft-defender-for-identity",
29492949
"/cid/1112": "/docs/integrations/saas-cloud/carbon-black-inventory/",
29502950
"/cid/1111": "/docs/integrations/microsoft-azure/azure-open-ai",
2951+
"/cid/1113": "/docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source/",
29512952
"/Cloud_SIEM_Enterprise": "/docs/cse",
29522953
"/Cloud_SIEM_Enterprise/Administration": "/docs/cse/administration",
29532954
"/Cloud_SIEM_Enterprise/Administration/Cloud_SIEM_Enterprise_Feature_Update_(2022)": "/docs/cse/administration",

docs/integrations/product-list/product-list-a-l.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -186,6 +186,7 @@ For descriptions of the different types of integrations Sumo Logic offers, see [
186186
| :-- | :-- | :-- |
187187
| <img src={useBaseUrl('img/platform-services/automation-service/app-central/logos/darkowl.png')} alt="Thumbnail icon" width="100"/> | [DarkOwl](https://www.darkowl.com/) | Automation integration: [DarkOwl](/docs/platform-services/automation-service/app-central/integrations/darkowl/) |
188188
| <img src={useBaseUrl('img/platform-services/automation-service/app-central/logos/darktrace.png')} alt="Thumbnail icon" width="100"/> | [Darktrace](https://darktrace.com/) | Automation integration: [Darktrace](/docs/platform-services/automation-service/app-central/integrations/darktrace/) <br/>Cloud SIEM integration: [Darktrace](https://github.com/SumoLogic/cloud-siem-content-catalog/blob/master/vendors/91f4544f-6118-4bdc-8b30-01f045d20e4c.md) |
189+
| <img src={useBaseUrl('img/collector/databricks-icon.png')} alt="Thumbnail icon" width="80"/> | [Databricks](https://www.databricks.com/) | Collector: [Databricks Audit Source](/docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source/) |
189190
| <img src={useBaseUrl('https://upload.wikimedia.org/wikipedia/en/7/7e/Datadog_logo.svg')} alt="Thumbnail icon" width="50"/> | [Datadog](https://www.datadoghq.com/) | App: [Datadog](/docs/integrations/saas-cloud/datadog/) <br/>Webhook: [Webhook Connection for Datadog](/docs/alerts/webhook-connections/datadog/) |
190191
| <img src={useBaseUrl('img/integrations/misc/dataminr-logo.svg')} alt="Thumbnail icon" width="100"/> | [Dataminr](https://www.dataminr.com/) | Cloud SIEM integration: [Dataminr](https://github.com/SumoLogic/cloud-siem-content-catalog/blob/master/vendors/0019f757-3674-4688-9d6c-063366cfcfa9.md) <br/>Partner integration: [Dataminr Pulse for Sumo Logic](https://github.com/SumoLogic/sumologic-public-partner-apps/tree/master/DataminrPulse) |
191192
| <img src={useBaseUrl('/img/platform-services/automation-service/app-central/logos/datto.png')} alt="Thumbnail icon" width="50"/> | [Datto](https://www.datto.com/) | Automated integration: [Datto RMM](/docs/platform-services/automation-service/app-central/integrations/datto-rmm/) <br/>Cloud SIEM integration: [Datto](https://github.com/SumoLogic/cloud-siem-content-catalog/blob/master/vendors/1E4721C4-DC98-456D-B0DF-80365924683A.md) |
Lines changed: 109 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,109 @@
1+
---
2+
id: databricks-audit-source
3+
title: Databricks Audit Source
4+
sidebar_label: Databricks Audit
5+
tags:
6+
- cloud-to-cloud
7+
- databricks-audit-logs
8+
description: Learn how to collect Databricks Audit logs and send them to Sumo Logic for monitoring, auditing, and compliance.
9+
---
10+
11+
import ForwardToSiem from '/docs/reuse/forward-to-siem.md';
12+
import useBaseUrl from '@docusaurus/useBaseUrl';
13+
14+
<img src={useBaseUrl('img/collector/databricks-icon.png')} alt="icon" width="80"/>
15+
16+
The Sumo Logic source for Databricks enables you to collect audit logs from Databricks into Sumo Logic. This integration helps you to capture structured records of user and system activities within the Databricks workspace, including SQL queries, job executions, cluster events, and workspace changes. These logs facilitate auditing, security monitoring, and regulatory compliance.
17+
18+
## Data collected
19+
20+
| Polling Interval | Data |
21+
| :--- | :--- |
22+
| 5 minutes | [Audit Logs](https://docs.databricks.com/api/workspace/statementexecution/executestatement) |
23+
24+
## Setup
25+
26+
### Vendor configuration
27+
28+
The Databricks Audit source requires you to provide the Databricks Audit Base URL (API Gateway URL), Warehouse ID, Client ID, and Client Secret to configure the source. Follow the below steps to generate the required values:
29+
30+
#### Base URL
31+
32+
Follow the below steps to get the Base URL for user configuration:
33+
1. [Sign in](https://accounts.cloud.databricks.com/login?tuuid=ee23ab7a-a2b9-4512-8973-fc787eb3c01e) to your **Databricks** workspace.
34+
2. The Base URL is visible in your browser’s address bar after login. For example, `https://YOUR-INSTANCE.databricks.com`.
35+
36+
#### Warehouse ID
37+
38+
The Warehouse ID is required to query Databricks SQL and fetch audit logs. Follow the below steps to get the Warehouse ID for user configuration:
39+
1. [Sign in](https://accounts.cloud.databricks.com/login?tuuid=ee23ab7a-a2b9-4512-8973-fc787eb3c01e) to your **Databricks** workspace.
40+
2. Navigate to **SQL Warehouses** in the sidebar.
41+
3. Click the warehouse name you want to use.
42+
4. On the warehouse **details page**, select the **Properties** tab.
43+
5. Locate and copy the **Warehouse ID**. For example, `bd4dc8ef7e54782c`.
44+
45+
#### Client ID and Client Secret
46+
47+
To generate the Client ID and Client Secret, refer to the [Create an OAuth Secret](https://docs.databricks.com/aws/en/dev-tools/auth/oauth-m2m?language=Environment#-step-1-create-an-oauth-secret) section.
48+
49+
### Source configuration
50+
51+
When you create a Databricks Audit Source, you add it to a Hosted Collector. Before creating the Source, identify the Hosted Collector you want to use or create a new Hosted Collector. For instructions, see [Configure a Hosted Collector and Source](/docs/send-data/hosted-collectors/configure-hosted-collector).
52+
53+
To configure Databricks Audit Source:
54+
1. [**New UI**](/docs/get-started/sumo-logic-ui). In the Sumo Logic main menu select **Data Management**, and then under **Data Collection** select **Collection**. You can also click the **Go To...** menu at the top of the screen and select **Collection**.<br/>[**Classic UI**](/docs/get-started/sumo-logic-ui-classic). In the main Sumo Logic menu, select **Manage Data > Collection > Collection**.
55+
1. On the Collectors page, click **Add Source** next to a Hosted Collector.
56+
1. Search for and select the **Databricks Audit** icon.
57+
1. Enter a **Name** to display for the Source in Sumo Logic. The description is optional.
58+
1. (Optional) For **Source Category**, enter any string to tag the output collected from the Source. Category metadata is stored in a searchable field called `_sourceCategory`.
59+
1. (Optional) **Fields**. Click the **+Add Field** link to define the fields you want to associate. Each field needs a name (key) and value.
60+
* <img src={useBaseUrl('img/reuse/green-check-circle.png')} alt="green check circle.png" width="20"/> A green circle with a check mark is shown when the field exists and is enabled in the Fields table schema.
61+
* <img src={useBaseUrl('img/reuse/orange-exclamation-point.png')} alt="orange exclamation point.png" width="20"/> An orange triangle with an exclamation point is shown when the field doesn't exist in the Fields table schema. In this case, you'll see an option to automatically add or enable the nonexistent fields to the Fields table schema. If a field is sent to Sumo Logic that does not exist in the Fields schema it is ignored, known as dropped.
62+
1. Enter the [Base URL](#base-url) of your account.
63+
1. Enter the [Warehouse ID](#warehouse-id) collected from the [vendor configuration](#vendor-configuration] to fetch audit logs.
64+
1. Enter the [Client ID and Client Secret](#client-id-and-client-secret) collected from the [vendor configuration](#vendor-configuration] to authorize access to your Databricks resources.
65+
1. The **Polling Interval** is set for 5 minutes by default. You can adjust it based on your needs.
66+
1. When you are finished configuring the Source, click **Save**.
67+
68+
## JSON schema
69+
70+
Sources can be configured using UTF-8 encoded JSON files with the Collector Management API. See [Use JSON to Configure Sources](/docs/send-data/use-json-configure-sources) for details. 
71+
72+
| Parameter | Type | Value | Required | Description |
73+
|:--|:--|:--|:--|:--|
74+
| schemaRef | JSON Object | `{"type":"Databricks Audit Logs"}` | Yes | Define the specific schema type. |
75+
| sourceType | String | `"Universal"` | Yes | Type of source. |
76+
| config | JSON Object | [Configuration object](#configuration-object) | Yes | Source type specific values. |
77+
78+
### Configuration Object
79+
80+
| Parameter | Type | Required | Default | Description | Example |
81+
|:--|:--|:--|:--|:--|:--|
82+
| name | String | Yes | `null` | Type a desired name of the source. The name must be unique per Collector. This value is assigned to the [metadata](/docs/search/get-started-with-search/search-basics/built-in-metadata) field `_source`. | `"mySource"` |
83+
| description | String | No | `null` | Type a description of the source. | `"Testing source"`
84+
| category | String | No | `null` | Type a category of the source. This value is assigned to the [metadata](/docs/search/get-started-with-search/search-basics/built-in-metadata) field `_sourceCategory`. See [best practices](/docs/send-data/best-practices) for details. | `"mySource/test"`
85+
| fields | JSON Object | No | `null` | JSON map of key-value fields (metadata) to apply to the Collector or Source. Use the boolean field `_siemForward` to enable forwarding to SIEM.|`{"_siemForward": false, "fieldA": "valueA"}` |
86+
| baseURL | String | Yes | `null` | Base URL of the Databricks workspace. For example, `https://<workspace-name>.databricks.com`. | |
87+
| warehouseID | String | Yes | `null` | Unique identifier of the SQL Warehouse within the Databricks workspace, used to query and fetch audit logs. For example, `bd4dc8ef7e54782c`. | |
88+
| clientID | String | Yes | `null` | Client ID of the account. | |
89+
| clientSecret | String | Yes | `null` | Client Secret of the account | |
90+
| pollingIntervalMin | Integer | No | 5 minutes | Time interval after which the source will check for new data. <br/>Minimum: 5 minutes <br/>Maximum: 24 hours | |
91+
92+
### JSON example
93+
94+
```json reference
95+
https://github.com/SumoLogic/sumologic-documentation/blob/main/static/files/c2c/databricks-audit/example.json
96+
```
97+
98+
### Terraform example
99+
100+
```sh reference
101+
https://github.com/SumoLogic/sumologic-documentation/blob/main/static/files/c2c/databricks-audit/example.tf
102+
```
103+
104+
105+
## FAQ
106+
107+
:::info
108+
Click [here](/docs/c2c/info) for more information about Cloud-to-Cloud sources.
109+
:::

docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/index.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -272,7 +272,13 @@ In this section, we'll introduce the following concepts:
272272
</div>
273273
<div className="box smallbox card">
274274
<div className="container">
275-
<a href={useBaseUrl('docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/digital-guardian-source')}><img src={useBaseUrl('img/send-data/digital-guardian-logo.svg')} alt="Thumbnail icon" width="45"/><h4>Digital Guardian</h4></a>
275+
<a href="/docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source"><img src={useBaseUrl('img/collector/databricks-icon.png')} alt="icon" width="80"/><h4>Databricks Audit</h4></a>
276+
<p>Track user and system activities to support governance and compliance using Databricks Audit logs.</p>
277+
</div>
278+
</div>
279+
<div className="box smallbox card">
280+
<div className="container">
281+
<a href="/docs/send-data/hosted-collectors/cloud-to-cloud-integration-framework/digital-guardian-source"><img src={useBaseUrl('img/send-data/digital-guardian-logo.svg')} alt="Thumbnail icon" width="45"/><h4>Digital Guardian</h4></a>
276282
<p>Learn how to collect export data from the Digital Guardian and send it to Sumo Logic.</p>
277283
</div>
278284
</div>

sidebars.ts

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -463,6 +463,7 @@ module.exports = {
463463
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/cyberark-source',
464464
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/cyberark-audit-source',
465465
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/cybereason-source',
466+
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/databricks-audit-source',
466467
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/digital-guardian-source',
467468
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/docusign-source',
468469
'send-data/hosted-collectors/cloud-to-cloud-integration-framework/dropbox-source',
Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
{
2+
"api.version": "v1",
3+
"source": {
4+
"config": {
5+
"name": "Databricks Audit Logs",
6+
"baseURL": "https://<workspace-name>.databricks.com",
7+
"warehouseId": "bf7gs36gfr54392c",
8+
"clientId": "bf7gs36gfs-r54392c-lkdsdas",
9+
"clientSecret": "wdwxxxxxqwedwedxxxxxqewdxxxxxxqwedxxxx",
10+
"pollingInterval": "5m"
11+
},
12+
"schemaRef": {
13+
"type": "Databricks Audit Logs"
14+
},
15+
"sourceType": "Universal"
16+
}
17+
}
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
resource "sumologic_cloud_to_cloud_source" "databricks-audit-source" {
2+
collector_id = sumologic_collector.collector.id
3+
schema_ref = {
4+
type = "Databricks Audit Source"
5+
}
6+
config = jsonencode({
7+
"name": "Databricks Audit Logs",
8+
"baseURL": "https://<workspace-name>.databricks.com",
9+
"warehouseId": "bf7gs36gfr54392c",
10+
"clientId": "bf7gs36gfs-r54392c-lkdsdas",
11+
"clientSecret": "wdwxxxxxqwedwedxxxxxqewdxxxxxxqwedxxxx",
12+
"pollingInterval": "5m"
13+
})
14+
}
15+
resource "sumologic_collector" "collector" {
16+
name = "my-collector"
17+
description = "Just testing this"
18+
}
21.8 KB
Loading

0 commit comments

Comments
 (0)