Pipeline parallelism hello world #17246

dionhaefner · 2023-08-23T11:48:31Z

dionhaefner
Aug 23, 2023

I'm experimenting with pipeline parallelism where subsequent computations are executed on different devices.

I've not been able to jit a simple function that takes parameters living on different devices. I'm assuming this needs some explicit sharding information but I got confused by the tutorials which seem to be written for the more advanced case of sharding individual axes (instead of entire arrays).

Example code:

import os
os.environ["XLA_FLAGS"] = "--xla_force_host_platform_device_count=2"

import jax
import jax.numpy as jnp

devices = jax.devices("cpu")

@jax.jit
def pipeline_test(x, t1, t2):
    x = jax.device_put(x, devices[0])
    x = x + t1
    x = jax.device_put(x, devices[1])
    x = x + t2
    return x

inp = jnp.ones(100)
t1 = jax.device_put(inp, devices[0])
t2 = jax.device_put(inp, devices[1])
pipeline_test(inp, t1, t2)

Error:

ValueError: Received incompatible devices for jitted computation. Got argument x of pipeline_test with shape float32[100] and device ids [0] on platform CPU and argument t2 of pipeline_test with shape float32[100] and device ids [1] on platform CPU

aniquetahir · 2023-08-24T23:13:19Z

aniquetahir
Aug 24, 2023

I think you have to rely on something like alpa for now:
https://github.com/alpa-projects/alpa

1 reply

dionhaefner Aug 25, 2023
Author

Thanks, that's what I figured. Alpa looks unmaintained unfortunately...

mjsML · 2023-08-25T13:13:25Z

mjsML
Aug 25, 2023
Collaborator

Thank you for your question, indeed there is no ergonomic recipe on how to do this today, however a reference implementation you might want to use is implemented in Praxis , I don’t know enough about your use case, do mind sharing more details?

4 replies

dionhaefner Aug 25, 2023
Author

I'm trying to solve memory issues by computing different parts of the pipeline on different devices (without replicating parameters). I also need to backpropagate through it so I can't split it into multiple jit blocks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pipeline parallelism hello world #17246

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 5 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Pipeline parallelism hello world #17246

Uh oh!

Uh oh!

dionhaefner Aug 23, 2023

Replies: 2 comments · 5 replies

Uh oh!

aniquetahir Aug 24, 2023

Uh oh!

dionhaefner Aug 25, 2023 Author

Uh oh!

mjsML Aug 25, 2023 Collaborator

Uh oh!

dionhaefner Aug 25, 2023 Author

Uh oh!

aniquetahir Aug 27, 2023

Uh oh!

Uh oh!

aniquetahir Aug 27, 2023

Uh oh!

MoFHeka Jul 8, 2024

dionhaefner
Aug 23, 2023

Replies: 2 comments 5 replies

aniquetahir
Aug 24, 2023

dionhaefner Aug 25, 2023
Author

mjsML
Aug 25, 2023
Collaborator

dionhaefner Aug 25, 2023
Author