Skip to content

b6513

Choose a tag to compare

@github-actions github-actions released this 18 Sep 19:18
38dbdf4
CUDA: Optimize PAD_REFLECT_1D (#15957)

* CUDA: Optimize PAD_REFLECT_1D
feat: add more test cases for PAD_REFLECT_1D

* use fast_div to improve performance

* Apply suggestion from JohannesGaessler

Co-authored-by: Johannes Gäßler <[email protected]>

* Apply suggestion from JohannesGaessler

Co-authored-by: Johannes Gäßler <[email protected]>

* optimize

* use a concise expression to further speedup the cuda kernel

---------

Co-authored-by: Johannes Gäßler <[email protected]>