-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
What would you like to discuss?
Maybe this is already documented and I'm missing it, but I don't currently understand where the actual data (resources) are to be stored/uploaded for individual data packages. Currently, the data packages that sprout creates are tightly integrated with git/github, but that's traditionally not friendly toward working with large binary data files like parquet. Is the idea to use something like GitHub LFS. https://dvc.org/, or on premises cloud storage for storing the data resources while all the metadata and python package related files are tracked via Git?
I noticed that there was mention on LFS in one of the sample projects seedcase-project/example-rhesus-monkeys#24
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status
Todo