Reference Node Name vs Source Name in Lineage #6147
Closed
johnpawleylr
started this conversation in
General
Replies: 1 comment 3 replies
-
|
@vchan seems like @johnpawleylr I need to understand the situation a bit more to decide whether this is something that should change. Can you provide a concrete, minimal example that demonstrates the issue with the current logic? Thanks. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey all,
I've ran into something which I think is a small bug in the
lineagecode. A query like the following:gives a reference node name of dataset when tracing
xorcreated_at. I believe it should bealiasedas that is how dataset is aliased within the final select query. This is important to something I am working on, as if thereference_node_nameisdatasetthen it is difficult to follow the chain of nodes in the lineage as the information aboutaliasedis lost.I've raised a change to fix this, the alteration is very minor
reference_node_name=scope_name,rather thanreference_node_name=reference_node_nameonlineage.py:184. But it is causing an existing test to fail,test_lineage.py:427. This is because "dataset" is being assigned as thereference_node_nameas well as thesource_name.So what i want to know, is what is the intention behind these two labels? Am I free to change the existing test? Or is
reference_node_namebeing empty in this case a hard requirement?Beta Was this translation helpful? Give feedback.
All reactions