Replies: 2 comments 7 replies
-
We discussed this at lunch the other day. We thought for a given project we might have a folder containing:
|
Beta Was this translation helpful? Give feedback.
-
Recent discussions we're thinking of having scripts for the processing from Raw to Model. And version-controlling the scripts. Depending how long processing takes, we might also save the "Tulipa-friendly" versions of data sources - which users can manipulate/combine for scenarios. But this runs the risk of users permanently editing these "Tulipa-friendly" files without documenting what they did. Data provenance is a hard problem. This (or something similar) might be a solution (thanks @suvayu!): |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
@suvayu asked the question in the TIO repo, so thought I'd post it here for discussion.
The thought is that while we're doing analyses, we'll have a tendency to save-off datasets, duplicating a lot of stuff.
Of course Julia+DuckDB can do database manipulations without duplicating data, but then we have to decide the workflow and when we WANT it to save.
I see creating a scenario as a building-up process. So when do we save and when do we leave something as a "run this script again" process?
Issues to think about:
@gnawin @datejada @nope82 @g-moralesespana
Beta Was this translation helpful? Give feedback.
All reactions