Veracity DataWorkbench Python


Testing

DataWorkbench

What is it?

Veracity DataWorkbench is a Python SDK designed to bridge your Databricks environment with Veracity Data Workbench. It simplifies access to data cataloging, lineage tracking, and APIs.

Features
Installation
How to use it
Configuration
Examples
API Reference
Contributing
License

Features

DataCatalogue: Register and manage datasets in the Veracity Data Workbench Data Catalogue.

Installation

This package is pre-installed in Veracity-hosted Databricks environments (if analytics features are enabled).

To install the latest version locally:

pip install https://github.com/veracity/DataWorkbench/releases/latest/download/dataworkbench-1.0-py3-none-any.whl

Make sure you have the required credentials and environment variables set when running outside Databricks.

How to use it

In Veracity-hosted Databricks, the SDK is ready to use:

import dataworkbench

To use it on your local machine, it requires you to set a set of variables to connect to the Veracity Dataworkbench API.

Basic Example

from dataworkbench import DataCatalogue

df = spark.createDataFrame([("a", 1), ("b", 2), ("c", 3)], ["letter", "number"])

datacatalogue = DataCatalogue()  # Naming subject to change
datacatalogue.save(df, "Dataset Name", "Description", tags={"environment": ["test"]})

Examples

Saving a Spark DataFrame to the Data Catalogue

from dataworkbench import DataCatalogue

df = spark.createDataFrame([("a", 1), ("b", 2), ("c", 3)], ["letter", "number"])

datacatalogue = DataCatalogue()  # Naming subject to change
datacatalogue.save(df, "Dataset Name", "Description", tags={"environment": ["test"]})

API Reference

DataCatalogue

save(df, name, description=None, tags=None): Save a Spark DataFrame to the Data Workbench Data Catalogue

License

Dataworkbench is licensed under WHICH LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Veracity DataWorkbench Python

DataWorkbench

What is it?

Table of Contents

Features

Installation

How to use it

Basic Example

Examples

Saving a Spark DataFrame to the Data Catalogue

API Reference

DataCatalogue

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Veracity DataWorkbench Python

DataWorkbench

What is it?

Table of Contents

Features

Installation

How to use it

Basic Example

Examples

Saving a Spark DataFrame to the Data Catalogue

API Reference

DataCatalogue

License