How are new datasets added? Clarify archives are hosted and versioned in https://github.com/srlearn/datasets