Skip to content
Discussion options

You must be logged in to vote

The split parameter is enabled for local file datasets via #2281.

So you can add something like split: train[:10%] to your dataset's config and preprocess and load only 10% of the dataset that way. Alternatively you can set shard as discussed by @NanoCode012 here.

You can set a fractional epoch via #2282.

Replies: 2 comments 10 replies

Comment options

You must be logged in to vote
9 replies
@mashdragon
Comment options

@NanoCode012
Comment options

@mashdragon
Comment options

@mashdragon
Comment options

@NanoCode012
Comment options

Comment options

You must be logged in to vote
1 reply
@NanoCode012
Comment options

Answer selected by mashdragon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants