feat(glue): add JDBC upstream lineage for Glue jobs by alokr-dhub · Pull Request #16505 · datahub-project/datahub

alokr-dhub · 2026-03-10T05:20:20Z

Summary

Glue jobs that read from or write to JDBC sources (Postgres, MySQL, MariaDB, Redshift, Oracle, SQL Server) now produce lineage edges in DataHub. Previously these nodes fell through to the "unsupported connector" path and were silently skipped.
Added JDBC_PLATFORM_MAP to map JDBC protocol names to DataHub platform names, and JDBC_DEFAULT_SCHEMA to inject the correct default schema (public for Postgres/Redshift, dbo for SQL Server) when dbtable has no schema prefix.
The dataset URN is constructed as database.schema.table to match what the native source connectors (e.g. postgres, mysql) produce, enabling lineage stitching without additional configuration.
No new dataset MCEs are emitted for JDBC nodes — the datasets are expected to already exist from a separate source connector ingestion run.

The PR conforms to DataHub's Contributing Guideline (particularly PR Title Format)
Links to related issues (if applicable)
Tests for the changes have been added/updated (if applicable)
Docs related to the changes have been added/updated (if applicable). If a new feature has been added a Usage Guide has been added for the same.
For any breaking change/potential downtime/deprecation/big changes an entry has been made in Updating DataHub

codecov · 2026-03-10T05:24:10Z

Codecov Report

❌ Patch coverage is 85.81560% with 20 lines in your changes missing coverage. Please review.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...ingestion/src/datahub/ingestion/source/aws/glue.py	85.81%	20 Missing ⚠️

📢 Thoughts on this report? Let us know!

rajatoss · 2026-03-10T06:49:41Z

Connector Tests Results

All connector tests passed for commit b215298

View full test logs →

To skip connector tests, add the skip-connector-tests label (org members only).

Autogenerated by the connector-tests CI pipeline.

gabe-lyons · 2026-03-10T17:11:21Z

Linear: ING-1866

Thanks for your contribution! We have created an internal ticket to track this PR. A member of the core DataHub team will be assigned to review it within the next few business days - you will get a follow-up comment once a reviewer is assigned.

github-actions · 2026-03-10T17:15:07Z

Linear: ING-1869

Thanks for your contribution! We have created an internal ticket to track this PR. A member of the core DataHub team will be assigned to review it within the next few business days - you will get a follow-up comment once a reviewer is assigned.

github-actions · 2026-03-10T20:11:48Z

Your PR has been assigned to @treff7es (tamas) for review (ING-1866).

alokr-dhub · 2026-03-11T14:07:43Z

Marking this as draft for now for any upcomming edge cases

metadata-ingestion/src/datahub/ingestion/source/aws/glue.py

github-actions · 2026-03-16T10:54:53Z

Linear: ING-1932

alwaysmeticulous · 2026-03-24T19:51:56Z

🔴 Meticulous spotted visual differences in 35 of 1809 screens tested: view and approve differences detected.

Meticulous evaluated ~8 hours of user flows against your PR.

_{Last updated for commit bc78c5a fix: review comments. This comment will update as new commits are pushed.}

codecov · 2026-03-24T19:58:59Z

Bundle Report

Changes will increase total bundle size by 17.67kB (0.08%) ⬆️. This is within the configured threshold ✅

Detailed changes

Bundle name	Size	Change
datahub-react-web-esm	22.7MB	17.67kB (0.08%) ⬆️

Affected Assets, Files, and Routes:

view changes for bundle: datahub-react-web-esm

Assets Changed:

Asset Name	Size Change	Total Size	Change (%)
`assets/index-*.js`	4.54kB	12.45MB	0.04%
*`assets/fabriclogo-.svg`** (New)	8.86kB	8.86kB	100.0% 🚀
*`assets/fabricdatafactorylogo-.svg`** (New)	4.27kB	4.27kB	100.0% 🚀

…eam-jdbc-connectors

metadata-ingestion/src/datahub/ingestion/source/aws/glue.py

…eam-jdbc-connectors

github-actions bot added the ingestion PR or Issue related to the ingestion of metadata label Mar 10, 2026

github-actions bot deployed to datahub-wheels (Preview) March 10, 2026 05:22 View deployment

vercel bot deployed to Preview March 10, 2026 05:33 View deployment

github-actions bot deployed to datahub-wheels (Preview) March 10, 2026 06:17 View deployment

vercel bot deployed to Preview March 10, 2026 06:30 View deployment

alokr-dhub marked this pull request as ready for review March 10, 2026 07:11

maggiehays added the pending-submitter-response Issue/request has been reviewed but requires a response from the submitter label Mar 10, 2026

alokr-dhub marked this pull request as draft March 10, 2026 17:14

alokr-dhub marked this pull request as ready for review March 10, 2026 17:14

github-actions bot requested a review from treff7es March 10, 2026 20:11

github-actions bot deployed to datahub-wheels (Preview) March 11, 2026 13:58 View deployment

alokr-dhub marked this pull request as draft March 11, 2026 14:07

github-actions bot deployed to datahub-wheels (Preview) March 11, 2026 14:11 View deployment

vercel bot deployed to Preview March 11, 2026 14:26 View deployment

ligfx reviewed Mar 11, 2026

View reviewed changes

metadata-ingestion/src/datahub/ingestion/source/aws/glue.py Outdated Show resolved Hide resolved

github-actions bot deployed to datahub-wheels (Preview) March 12, 2026 11:24 View deployment

vercel bot deployed to Preview March 12, 2026 11:39 View deployment

github-actions bot deployed to datahub-wheels (Preview) March 16, 2026 09:13 View deployment

vercel bot deployed to Preview March 16, 2026 09:27 View deployment

vercel bot deployed to Preview March 16, 2026 10:09 View deployment

alokr-dhub marked this pull request as ready for review March 16, 2026 10:54

github-actions bot deployed to datahub-wheels (Preview) March 16, 2026 11:01 View deployment

vercel bot deployed to Preview March 16, 2026 11:15 View deployment

github-actions bot deployed to datahub-wheels (Preview) March 24, 2026 12:02 View deployment

vercel bot deployed to Preview March 24, 2026 12:17 View deployment

alokr-dhub added 2 commits March 25, 2026 01:10

fix: added support for target platform instance mappling

ed4462e

fix: updated docs

a168feb

github-actions bot deployed to datahub-wheels (Preview) March 24, 2026 19:50 View deployment

github-actions bot deployed to datahub-project-web-react (Preview) March 24, 2026 19:51 View deployment

vercel bot deployed to Preview March 24, 2026 20:04 View deployment

Merge branch 'master' into feature/support-glue-job-lineage-for-upstr…

1f4b9c0

…eam-jdbc-connectors

github-actions bot deployed to datahub-wheels (Preview) March 25, 2026 04:58 View deployment

vercel bot deployed to Preview March 25, 2026 05:12 View deployment

treff7es reviewed Mar 27, 2026

View reviewed changes

metadata-ingestion/src/datahub/ingestion/source/aws/glue.py Outdated Show resolved Hide resolved

metadata-ingestion/src/datahub/ingestion/source/aws/glue.py Outdated Show resolved Hide resolved

maggiehays added pending-submitter-response Issue/request has been reviewed but requires a response from the submitter and removed needs-review Label for PRs that need review from a maintainer. labels Mar 27, 2026

fix: review comments

bc78c5a

github-actions bot deployed to datahub-wheels (Preview) March 30, 2026 10:03 View deployment

github-actions bot deployed to datahub-project-web-react (Preview) March 30, 2026 10:06 View deployment

vercel bot deployed to Preview March 30, 2026 10:17 View deployment

Merge branch 'master' into feature/support-glue-job-lineage-for-upstr…

c70793f

…eam-jdbc-connectors

github-actions bot deployed to datahub-wheels (Preview) March 30, 2026 11:10 View deployment

vercel bot deployed to Preview March 30, 2026 11:25 View deployment

treff7es approved these changes Mar 30, 2026

View reviewed changes

maggiehays added pending-submitter-merge and removed pending-submitter-response Issue/request has been reviewed but requires a response from the submitter labels Mar 30, 2026

Merge branch 'master' into feature/support-glue-job-lineage-for-upstr…

b215298

…eam-jdbc-connectors

github-actions bot deployed to datahub-wheels (Preview) March 31, 2026 13:28 View deployment

vercel bot deployed to Preview March 31, 2026 13:43 View deployment

alokr-dhub merged commit b8bba2f into master Apr 1, 2026
72 checks passed

alokr-dhub deleted the feature/support-glue-job-lineage-for-upstream-jdbc-connectors branch April 1, 2026 08:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(glue): add JDBC upstream lineage for Glue jobs#16505

feat(glue): add JDBC upstream lineage for Glue jobs#16505
alokr-dhub merged 24 commits intomasterfrom
feature/support-glue-job-lineage-for-upstream-jdbc-connectors

alokr-dhub commented Mar 10, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 10, 2026 •

edited

Loading

Uh oh!

rajatoss commented Mar 10, 2026 •

edited by datahub-connector-tests bot

Loading

Uh oh!

gabe-lyons commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

alokr-dhub commented Mar 11, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 16, 2026

Uh oh!

alwaysmeticulous bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 24, 2026 •

edited

Loading

Assets Changed:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

alokr-dhub commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

codecov bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rajatoss commented Mar 10, 2026 • edited by datahub-connector-tests bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Connector Tests Results

Uh oh!

gabe-lyons commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

alokr-dhub commented Mar 11, 2026

Uh oh!

Uh oh!

github-actions bot commented Mar 16, 2026

Uh oh!

alwaysmeticulous bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bundle Report

Affected Assets, Files, and Routes:

Assets Changed:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

alokr-dhub commented Mar 10, 2026 •

edited

Loading

codecov bot commented Mar 10, 2026 •

edited

Loading

rajatoss commented Mar 10, 2026 •

edited by datahub-connector-tests bot

Loading

alwaysmeticulous bot commented Mar 24, 2026 •

edited

Loading

codecov bot commented Mar 24, 2026 •

edited

Loading