[dbt-athena] Migration from Glue to dbt Python models: Lake Formation compatibility #974
moritzsanne
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Background
We're currently using Airflow for orchestration and dbt for SQL-based transformations. Some of our transformations require
custom code written in Python, which we've been running via AWS Glue. We're looking to consolidate our workflow by migrating these Python transformations to dbt Python models.
Issue
While exploring dbt Python models for Athena (per the documentation), we've encountered a potential blocker: Python models require "a Spark-enabled workgroup created in Athena," but Amazon Athena for Apache Spark doesn't support AWS Lake Formation.
This is problematic as we're currently migrating from IAM-based access control to Lake Formation-based access control for our data lake.
Questions
We're looking for ideas and guidance on:
Any insights, examples, or alternative approaches would be greatly appreciated. We're trying to determine the best path forward that balances our needs for Python transformations with our Lake Formation governance requirements.
Beta Was this translation helpful? Give feedback.
All reactions