pmap with varying degree of parallelization #8617
Unanswered
mattiasmar
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
Is is possible to use pmap with unique number of devices used for parallelization for each layer? (E.g. for some matmuls use many devices for parallelization and for others use just a few or even just one TPU/GPU.)
If yes, there an example available demonstrating that? Does JAX automatically handle the gather and scatter operations?
Beta Was this translation helpful? Give feedback.
All reactions