Skip to content

Conversation

@shabbirk-db
Copy link

Changes made:

  • Fixed _resources to create and write to Volumes instead of dbfs.
  • Commented out autoloader in the notebook
  • Created streaming_table ingestion layer directly in dbt for the three streams, reading from Volumes.
  • This will create full lineage in UC from Volumes all the way to Gold layer.

To do:

  • Update documentation to reflect the changes
  • Re-spec cluster to be UC native when created.
  • Potentially generate the data as part of the dbdemos installation so we can skip the data-ingestion notebook entirely and make a more streamlined experience (e.g. dbt + ML only)
  • Update the ML model to read from the new tables and make it UC friendly if needed

shabbirk-db and others added 10 commits April 29, 2024 07:28
Commented out autoloader (this is now a streaming table in dbt)
Fixed _resources to create and write to Volumes instead of dbfs.

To do:
* Update documentation to reflect the changes
* Re-spec cluster to be UC native when created.
* Potentially generate the data as part of the dbdemos installation so we can skip the data-ingestion notebook entirely and make a more streamlined experience (e.g. dbt + ML only)
Adding Volumes and UC compatibility
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant