Skip to content

Added torchrun compatibility for distributet training across multiple GPUs in a single node (single instance)#4766

Merged
sage-maker merged 22 commits intoaws:masterfrom
brunopistone:master
Aug 9, 2024
Merged

Added torchrun compatibility for distributet training across multiple GPUs in a single node (single instance)#4766
sage-maker merged 22 commits intoaws:masterfrom
brunopistone:master

Commits

Commits on Jul 2, 2024

Commits on Jul 25, 2024

Commits on Jul 26, 2024

Commits on Aug 7, 2024

Commits on Aug 8, 2024