Should we extend the functionality of this repo and turn it into a package to assist users to perform fake/synthetic/constructed data simulations?
Many times this synthetic data is data generated from a model, but sometimes we also want synthetic data without having or prior to contructing a model, like linear regression with specific R2 or a dataset were some of the covariates are noise. Or maybe some blobs (like scikit-learn functionality)...