Does spaCy require shuffling data before training? #10208

kanayer · 2022-02-04T02:50:33Z

kanayer
Feb 4, 2022

Feb 7, 2022

Hi @kannaricci ,

Ideally you want to shuffle your data to ensure that the training batches are more representative of the dataset, and that it's not dependent on some order / index. If you set max_epochs>=0 during training, the training Corpus is shuffled automatically every epoch, so you don't need to worry about shuffling it by yourself :)

View full answer

ljvmiranda921 · 2022-02-07T07:23:13Z

ljvmiranda921
Feb 7, 2022

Hi @kannaricci ,

Ideally you want to shuffle your data to ensure that the training batches are more representative of the dataset, and that it's not dependent on some order / index. If you set max_epochs>=0 during training, the training Corpus is shuffled automatically every epoch, so you don't need to worry about shuffling it by yourself :)

1 reply

kanayer Feb 7, 2022
Author

Thank you very much for your answer!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Does spaCy require shuffling data before training? #10208

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Does spaCy require shuffling data before training? #10208

Uh oh!

Uh oh!

kanayer Feb 4, 2022

Replies: 1 comment · 1 reply

Uh oh!

ljvmiranda921 Feb 7, 2022

Uh oh!

kanayer Feb 7, 2022 Author

kanayer
Feb 4, 2022

Replies: 1 comment 1 reply

ljvmiranda921
Feb 7, 2022

kanayer Feb 7, 2022
Author