Skip to content

More need for a split dataset command #100

@adswa

Description

@adswa

Origin: Office hour

Amir came into the office hour, and presented a superdataset, into which he accidentally saved ~380k files with a total disk space usage of multiple TB. The superdataset became painfully slow in response. He inquired how to get the data into a subdataset instead of having it in the superdataset directly. We pointed him to https://knowledge-base.psychoinformatics.de/kbi/0013/index.html and advised to split his directory (era_5) into year-wise subdatasets.

This support event is mostly documenting the need for a command to split datasets similar to how https://knowledge-base.psychoinformatics.de/kbi/0013/index.html outlines, but with DataLad tooling for ease of use.

TODO (not necessarily to be performed in this order)

  • Inform OP/Add reference to this issue at origin
  • Clarifying Qs asked or not needed
  • Nature of the issue is understood
  • Inform OP about resolution

Metadata

Metadata

Assignees

No one assigned

    Labels

    support-trackerTrack a support event that occurred elsewhere

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions