Added torchrun compatibility for distributet training across multiple GPUs in a single node (single instance)#4766
Merged
sage-maker merged 22 commits intoaws:masterfrom Aug 9, 2024
brunopistone:master
Merged
Added torchrun compatibility for distributet training across multiple GPUs in a single node (single instance)#4766sage-maker merged 22 commits intoaws:masterfrom brunopistone:master
sage-maker merged 22 commits intoaws:masterfrom
brunopistone:master
Commits
Commits on Jul 2, 2024
Commits on Jul 25, 2024
Commits on Jul 26, 2024
Commits on Aug 7, 2024
Commits on Aug 8, 2024
- authored
- committed
- committed
- committed
- committed
- authored
- committed
- committed
- committed
- authored
- authored
- authored
- authored
- committedEC2 Default User
- committed
- committed