I think there may be an argument for splitting out the metadata jsonschema into a separate repo, probably modelled after the vega spec repo.
Would be good to get the team to brainstorm pros and cons. Here's some I can think of:
Pros
- Can version
etl_manager independently
- Probably simpler to version metadata schemas, and make sure all historical schemas are easily accessible. (You
- Encourages us to properly version our metadata schema, like vega do, using semver.
- Makes it clearer that our metadata schema isn't intrinsically tied to
etl_manager. etl_manager is basically a set of converters, jobrunners, and metadata auditing tools.
- Makes it easier for us to talk about our metadata schema. It's no longer hidden away in etl manager, but a spec in its own right
Cons
- Need to version two things, so e.g. will need to be careful about compatibility between etl manager and metadata schema versions