Skip to content

Conversation

@lhoestq
Copy link
Member

@lhoestq lhoestq commented Dec 19, 2025

will be useful for the "Edit dataset" feature on the Hub, to know which file a row idx belongs to and its location within the file

additional details: I had to update datasets so I added minor improvements for pdf, hdf5 and nifti support

will need huggingface/datasets#7943

@lhoestq lhoestq changed the title Store original shard lengths Store original shard paths and lengths Jan 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants