Skip to content

How to shuffle & optimize Hugging Face datasets for LLM pre-training with StreamingDataset ? #1229

How to shuffle & optimize Hugging Face datasets for LLM pre-training with StreamingDataset ?

How to shuffle & optimize Hugging Face datasets for LLM pre-training with StreamingDataset ? #1229