Any way to submit multi-node CPU jobs #8621
-
It would be AMAZING if there is any way to submit jobs across nodes in JAX. We want to use a total of 80 CPU cores but our cluster has 40 cores per node. So we want to use 2 nodes. On trying this out we find that ALL the cores on only one of the nodes is being used while NONE in the other is being used. We did come across pjit but it seems it is only for GPU or TPU? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
https://mpi4jax.readthedocs.io (note: I'm the author) Disclaimer: this works great on university clusters that are already setup with MPI. |
Beta Was this translation helpful? Give feedback.
https://mpi4jax.readthedocs.io (note: I'm the author)
Disclaimer: this works great on university clusters that are already setup with MPI.
If the cluster is not setup for MPI this might be somewhat complicated.