-
Notifications
You must be signed in to change notification settings - Fork 34
Description
I am wondering whether you all had previously considered reading in assert/verify statements from a file.
In my explorations of TDD with data analysis, I came across the nifty tdda python package. It appears to have a fair amount of overlap with what is doable in assertr. My sense from their white papers is that tdda can algorithmically create constraints by summarizing columns in a data frame and writing those to a file or some other data structure. Those constraints can be modified to fine tune them and re-used in subsequent assertions when new data is considered.
I think it would be pretty powerful to have a yaml or json file that specifies the assertions for a file that can be loaded by assertr and applied to a data frame within a pipeline. A benefit of having a file-based approach to this would be that the file could also serve as a type of data dictionary that would be more readable than assertr code.
I suspect this might be a fair amount of effort to implement as was curious if it was something that's already on your roadmap or if you would be interested in contributions along these lines.