Will concatenation result in slow speed? #6979

BugQualia · 2021-06-15T15:43:29Z

BugQualia
Jun 15, 2021

There are multiple arrays generated by individual process.
I need to concatenate them and feed it to another function using jax.lax.concatenate
Alternatively, I can make a template and copy arrays to template using jax.ops.index_update and feed that to the function.

So, my question is:

Is one way better than other in terms of performance(speed)?
Is there a better way?

In numpy, there are considerable difference in performance. I want to know if that's true in jax.
All functions will be jitted.

Answered by jakevdp

Jun 15, 2021

Great question! The general advice for something like this is that, under JIT, what you do at the high level shouldn't matter: the XLA compiler should be able to find the optimal route to computing what your code expresses.

That's the ideal, but in practice, you can sometimes improve things by choosing a different high-level approach. In cases like this micro benchmarks might be revealing, and it looks like a simple lax.concatenate, besides being shorter and easier to read, is 40-50% faster than a loop over index updates:

import jax.numpy as jnp
from jax import jit, lax

arrays = [jnp.arange(i, 2 * i) for i in range(10, 100)]

@jit
def f1(*arrays):
  return lax.concatenate(arrays, 0)

@jit
…

View full answer

jakevdp · 2021-06-15T16:16:24Z

jakevdp
Jun 15, 2021
Maintainer

Great question! The general advice for something like this is that, under JIT, what you do at the high level shouldn't matter: the XLA compiler should be able to find the optimal route to computing what your code expresses.

That's the ideal, but in practice, you can sometimes improve things by choosing a different high-level approach. In cases like this micro benchmarks might be revealing, and it looks like a simple lax.concatenate, besides being shorter and easier to read, is 40-50% faster than a loop over index updates:

import jax.numpy as jnp
from jax import jit, lax

arrays = [jnp.arange(i, 2 * i) for i in range(10, 100)]

@jit
def f1(*arrays):
  return lax.concatenate(arrays, 0)

@jit
def f2(*arrays):
  size = sum(len(arr) for arr in arrays)
  out = jnp.zeros(size, arrays[0].dtype)
  start = 0
  end = 0
  for arr in arrays:
    end += len(arr)
    out = out.at[start:end].set(arr)
    start = end
  return out

print(jnp.allclose(f1(*arrays), f2(*arrays)))
# True

%timeit f1(*arrays).block_until_ready()
# 10000 loops, best of 5: 152 µs per loop

%timeit f2(*arrays).block_until_ready()
# 1000 loops, best of 5: 275 µs per loop

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Will concatenation result in slow speed? #6979

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Will concatenation result in slow speed? #6979

Uh oh!

BugQualia Jun 15, 2021

Replies: 1 comment

Uh oh!

jakevdp Jun 15, 2021 Maintainer

BugQualia
Jun 15, 2021

jakevdp
Jun 15, 2021
Maintainer