feat: add xattention_prefill_reshape_cache kernel. by zhang-minchao · Pull Request #963 · jd-opensource/xllm

zhang-minchao · 2026-02-28T10:13:51Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces a new CUDA kernel for prefill_reshape_and_cache, replacing a simpler CPU-based implementation. The new kernel is optimized for performance using vectorized memory access. The PR also includes comprehensive unit tests that cover various tensor layouts, and the necessary build system updates. My review identified one critical issue where input tensor validation is performed after slicing, potentially leading to out-of-bounds memory access. I have provided a code suggestion to correct this validation logic. Once this issue is addressed, the changes look good.

xllm/core/kernels/cuda/xattention/prefill_reshape_and_cache.cu

DragonFive

LGTM

xllm/core/kernels/cuda/xattention/prefill_reshape_and_cache.cu

zhang-minchao requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners February 28, 2026 10:13

zhang-minchao requested a review from DragonFive February 28, 2026 10:14

gemini-code-assist bot reviewed Feb 28, 2026

View reviewed changes

xllm/core/kernels/cuda/xattention/prefill_reshape_and_cache.cu Outdated Show resolved Hide resolved

zhang-minchao force-pushed the feat/xattention_prefill_reshape_cache branch 2 times, most recently from 6b96a04 to ff93c78 Compare February 28, 2026 10:23

zhang-minchao requested a review from LMX-xin February 28, 2026 10:24

DragonFive previously approved these changes Feb 28, 2026

View reviewed changes

XuZhang99 reviewed Feb 28, 2026

View reviewed changes

xllm/core/kernels/cuda/xattention/prefill_reshape_and_cache.cu Outdated Show resolved Hide resolved

feat: add xattention_prefill_reshape_cache kernel.

482c8fc

zhang-minchao dismissed DragonFive’s stale review via 482c8fc February 28, 2026 13:22

zhang-minchao force-pushed the feat/xattention_prefill_reshape_cache branch from ff93c78 to 482c8fc Compare February 28, 2026 13:22

XuZhang99 approved these changes Feb 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add xattention_prefill_reshape_cache kernel.#963

feat: add xattention_prefill_reshape_cache kernel.#963
zhang-minchao wants to merge 1 commit intojd-opensource:mainfrom
zhang-minchao:feat/xattention_prefill_reshape_cache

zhang-minchao commented Feb 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

DragonFive left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zhang-minchao commented Feb 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

DragonFive left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants