Skip to content

[pull] main from NVIDIA:main#33

Merged
pull[bot] merged 2 commits intoLarryXFly:mainfrom
NVIDIA:main
Apr 15, 2025
Merged

[pull] main from NVIDIA:main#33
pull[bot] merged 2 commits intoLarryXFly:mainfrom
NVIDIA:main

Conversation

@pull
Copy link

@pull pull bot commented Apr 15, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

LinPoly and others added 2 commits April 16, 2025 00:11
…#3407)

Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
* fix: disable KV cache reuse if using attention sink

Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>

* fix: disable KV cache reuse if sink bubble

Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>

* add comment

Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>

---------

Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
@pull pull bot added the ⤵️ pull label Apr 15, 2025
@pull pull bot merged commit fffb403 into LarryXFly:main Apr 15, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants