Memory of gradient calculation distributed across GPUs? #19393

rkruegs123 · 2024-01-17T04:03:52Z

rkruegs123
Jan 17, 2024

I have a question about how the gradient calculation can be distributed across GPUs.

Consider a calculation in which there is pmap (or shmap/xmap) inside an outer scan. See #1369 for an example. Then, consider taking the gradient of this calculation. Does the gradient calculation get distributed across devices?

rkruegs123 · 2024-03-07T04:09:38Z

rkruegs123
Mar 7, 2024
Author

Just bumping this @mattjj

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Memory of gradient calculation distributed across GPUs? #19393

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Memory of gradient calculation distributed across GPUs? #19393

Uh oh!

rkruegs123 Jan 17, 2024

Replies: 1 comment

Uh oh!

rkruegs123 Mar 7, 2024 Author

rkruegs123
Jan 17, 2024

rkruegs123
Mar 7, 2024
Author