Implementing scatter(and gather) via one-hot for multiple indices #21784

AakashKumarNain · 2024-06-10T19:17:02Z

AakashKumarNain
Jun 10, 2024

I am having a hard time to jit compile a function that updates an array at multiple indices. Before describing the solutions I tried, let me elaborate the problem first.

I have an array arr1 of size (T, C, D) where T represents the time dimension, Crepresents number of channels, and Drepresents depth dimension.At every step in the forward pass, we sample some positions (< T) representing the indices where the updates are to be made in arr1 from some other array, say arr2 (same shape as arr1). For example,

@jax.jit
def update_positions(arr1, arr2, positions):
    arr1 = arr1[positions, :, :].set(arr2[positions, :, :])
    return arr1

T = 1024
C = 32
D = 256

positions = jnp.arange(10)
arr1 = jnp.zeros((T, C, D))
arr2 = jnp.asarray(np.random.rand(T, C, D)) 

# Update the array based on positions
arr1 = update_positions(arr1, arr2, positions)

The Problem

Given that positions can vary from 0-1023, every time the size of the positions array change, jax will recompile the function. Plus if the positions array is large, the updates can be extremely slow.

Tried solution

Ideally, we want to keep the number of compilations to bare minimum. One solution is to pad the array of positions to the next biggest power of 2, and make the updates using this padded array of positions. The problem with this is that same recompilation will trigger for padding. So, either we pre-cache the padded position array, or just pre-cache the original function using some dummy data

Expectation

The above solution is more of a hack rather than a proper solution. Ideally, we should have an array of zeros(of full length) where we scatter the one-hot encoded vectors of the given positions, and then use this fixed-size array to make the updates. Something like this:

positions = jnp.arange(10)
ohe_positions = jax.nn.one_hot(positions, T)
zeros_array = jnp.zeros_like(arr1)

# scatter the ohe position in the zeros_array
mask = scatter(zeros_array, ohe_positions)

# make updates
arr1 = arr1 + arr2 * mask

But I couldn't find an easy way to do this, and any help would be much appreciated.

Answered by jakevdp

Jun 10, 2024

I would do this by padding the positions with out-of-bound indices, and then use mode='drop' to ignore them within the set() operation. Something like this:

@jax.jit
def update_positions(arr1, arr2, positions):
    return arr1.at[positions, :, :].set(arr2[positions, :, :], mode='drop')

size = 16
positions_padded = jnp.pad(positions, (0, size - len(positions)), constant_values=arr1.shape[0])

# Update the array based on positions
result1 = update_positions(arr1, arr2, positions)
result2 = update_positions(arr1, arr2, positions_padded)

np.testing.assert_array_equal(result1, result2)

View full answer

jakevdp · 2024-06-10T20:16:47Z

jakevdp
Jun 10, 2024
Maintainer

I would do this by padding the positions with out-of-bound indices, and then use mode='drop' to ignore them within the set() operation. Something like this:

@jax.jit
def update_positions(arr1, arr2, positions):
    return arr1.at[positions, :, :].set(arr2[positions, :, :], mode='drop')

size = 16
positions_padded = jnp.pad(positions, (0, size - len(positions)), constant_values=arr1.shape[0])

# Update the array based on positions
result1 = update_positions(arr1, arr2, positions)
result2 = update_positions(arr1, arr2, positions_padded)

np.testing.assert_array_equal(result1, result2)

1 reply

AakashKumarNain Jun 11, 2024
Author

Thanks @jakevdp Although padding outside of jit would be costly, but it's negligible compared to other operations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementing scatter(and gather) via one-hot for multiple indices #21784

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Implementing scatter(and gather) via one-hot for multiple indices #21784

Uh oh!

Uh oh!

AakashKumarNain Jun 10, 2024

The Problem

Tried solution

Expectation

Replies: 1 comment · 1 reply

Uh oh!

Uh oh!

jakevdp Jun 10, 2024 Maintainer

Uh oh!

AakashKumarNain Jun 11, 2024 Author

AakashKumarNain
Jun 10, 2024

Replies: 1 comment 1 reply

jakevdp
Jun 10, 2024
Maintainer

AakashKumarNain Jun 11, 2024
Author