-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Description
I'm new in Gx framework and data management itself, tough trying to build data-quality framework on my project.
Initially im use all basic abstractions to build correct sequence of these to validate data in MSSQL,
At end all finished with run of Checkpoint that runs ValidationDefinition then Action, im also use really useful unexpected_index_column_names (with _id coumn) in result_format for Checkpoint.run() to include it into results and see in my dataDoc html.
It works perfect when I have 1 table (i mean all set of abstractions that end with VD), then added second again all great.
But when I have added a third table that doesn't have _id column it's failed
Then I start finding the way how to define unexpected_index_column_names on table level, I have found that I can define it only in run() function for Checkpoint, VD or batch.
It means if I want to use Checkpoint I have to include there only VDs with tables that have my column from unexpected_index_column_names. otherwise check for tables that not include the index column will be errored.
So if miss something let me know, but I would use full cycle of abstractions with predefined actions, but for that I need to use Checkpoint, but I can't skip unexpected_index_column_names it looks really useful
Describe the solution you'd like
I would like to set unexpected_index_column_names on any of table level abstractions
Describe alternatives you've considered
Right now im use loop with run() for all my VD, to each add result_format with unexpected_index_column_names correct for each table that using. After that separately build site context.build_data_docs(site_names=site_name) (with no action, before im used gx.checkpoint.actions.UpdateDataDocsAction)