Parallelize Pretraining 

Hello, great feature!, currently I'm doing some experiments on my specific use cases.
But I notice that pretraining speed It's considerably slow, 1 epoch took almost two days in 1B corpus at an average of 4800 w/s. 

So I check the uses of resources of the task, I'm training whit a configuration of dual 12 cores Xeon CPUs, (total 24 CPUs) a single machine, without GPU, and I noticed It's only using 1 core at a time.

Will be possible to add the desired number of workers on this task, then we could use the maximum number of cores and parallelize the pretraining, it could accelerate the processing time?

Best regards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallelize Pretraining #3350

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Parallelize Pretraining #3350

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions