[`CB`] Refactors the way we access paged #41370

ArthurZucker · 2025-10-06T13:18:41Z

What does this PR do?

The user decides on the function, but CB handles itself which "interface" wrapper to use. This should make stuff easier

HuggingFaceDocBuilderDev · 2025-10-06T13:28:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

remi-or

LGTM! Not sure we can run slow tests for this, but it's worth making sure the example still works with paged|eager, paged|sdpa or with paged|flash_attention_2 -- kernels is not available on AMD so we need the classic FA package.

remi-or · 2025-10-06T13:41:47Z

src/transformers/modeling_utils.py

-        "eager_paged": eager_paged_attention_forward,
+        "paged|flash_attention2": paged_attention_forward,
+        "paged|sdpa": sdpa_attention_paged_forward,
+        "paged|eager": eager_paged_attention_forward,


should paged|flex_attention be an option as well? I see it listed below in the tests

Not supported yet AFAIK

Ok good to know 👍 ty

tests/generation/test_continuous_batching.py

src/transformers/generation/continuous_batching/continuous_api.py

tests/generation/test_paged_attention.py

ArthurZucker added 2 commits October 6, 2025 14:30

up

eedf916

refactor the way we handle paged attention

4e09043

ArthurZucker requested review from remi-or and McPatate October 6, 2025 13:27

affect serve as well

c44815b

remi-or approved these changes Oct 6, 2025

View reviewed changes

ArthurZucker added 3 commits October 6, 2025 16:43

update

7ed8e2b

fix

b3071c6

cup

d391395

ArthurZucker merged commit 0395ed5 into main Oct 6, 2025
26 checks passed

ArthurZucker deleted the fix-kernels-cb branch October 6, 2025 15:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`CB`] Refactors the way we access paged #41370

[`CB`] Refactors the way we access paged #41370

Uh oh!

ArthurZucker commented Oct 6, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

remi-or left a comment

Uh oh!

remi-or Oct 6, 2025

Uh oh!

ArthurZucker Oct 6, 2025

Uh oh!

remi-or Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[CB] Refactors the way we access paged #41370

[CB] Refactors the way we access paged #41370

Uh oh!

Conversation

ArthurZucker commented Oct 6, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 6, 2025

Uh oh!

remi-or left a comment

Choose a reason for hiding this comment

Uh oh!

remi-or Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[`CB`] Refactors the way we access paged #41370

[`CB`] Refactors the way we access paged #41370