Add AITER attention backend #12549

lauri9 · 2025-10-27T09:51:54Z

What does this PR do?

AITER is AMD’s centralized repository to support high performance AI operators such as attention kernels for AMD ROCm enabled accelerators. This PR adds support for FlashAttention through AITER by introducing a new attention backend.

Test code for Flux inference below. Requires installation of aiter>=0.15.0 and a supported ROCm enabled accelerator.

import torch
from diffusers import FluxPipeline, FluxTransformer2DModel, attention_backend

model_id = "black-forest-labs/FLUX.1-dev"
transformer = FluxTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.bfloat16, device_map="cuda")
transformer.set_attention_backend("aiter")
pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-dev", transformer=transformer, torch_dtype=torch.bfloat16)
pipe.text_encoder.to("cuda")
pipe.text_encoder_2.to("cuda")
pipe.vae.to("cuda")

prompt = "A cat holding a sign that says 'hello world'"

image = pipe(prompt, num_inference_steps=28, guidance_scale=4.0).images[0]
image.save("output.png")

We are interested in following up this PR by eventually also enabling AITER backend support for context parallelism across multiple devices as the feature becomes more mature.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

cc: @sayakpaul @DN6 for review and any comments

sayakpaul · 2025-10-27T11:58:09Z

Thanks for this PR!

transformer = FluxTransformer2DModel.from_pretrained(model_id, subfolder="transformer", torch_dtype=torch.bfloat16, device_map="cuda"

Pardon my unwisdom, but for AMD devices, does this string not change? 👀

sayakpaul

Very cool PR!

sayakpaul · 2025-10-27T12:00:33Z

docs/source/en/optimization/attention_backends.md

 | attention family | main feature |
 |---|---|
 | FlashAttention | minimizes memory reads/writes through tiling and recomputation |
+| AI Tensor Engine for ROCm | FlashAttention implementation optimized for AMD ROCm accelerators |


Not related to this PR.

Do you think it might be possible to package the aiter kernels with kernels? If so, we could also support through the kernel hub then like we do for FA3 and others (FA2 and SAGE).

Cc: @danieldk

That's a great project and would also make for a good follow-up, though perhaps best handled via separate issue/PR? If I understand it correctly, the kernel would first need to make it to kernels before integration to diffusers.

100% not related.

lauri9 · 2025-10-27T12:37:25Z

Pardon my unwisdom, but for AMD devices, does this string not change? 👀

Existing PyTorch code that uses torch.cuda functions (e.g., tensor.to('cuda')) will generally work directly with ROCm, see PyTorch docs on HIP/ROCm semantics. To set up the environment appropriately, it's possible to build PyTorch from source or use a ROCm Docker image, to name a couple of examples - further info on this is provided in the ROCm docs.

Anecdotally, over the last months running diffusers code on ROCm I haven't had compatibility issues with PyTorch.

HuggingFaceDocBuilderDev · 2025-10-27T14:26:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul · 2025-10-27T14:29:02Z

@bot /style

github-actions · 2025-10-27T14:29:58Z

Style bot fixed some files and pushed the changes.

sayakpaul · 2025-10-27T14:55:12Z

Let's go! Thanks a lot for adding this!

add aiter attention backend

89903c3

lauri9 force-pushed the add-aiter-backend branch from 7482105 to 89903c3 Compare October 27, 2025 09:52

sayakpaul approved these changes Oct 27, 2025

View reviewed changes

Merge branch 'main' into add-aiter-backend

570f626

Apply style fixes

3df9fa7

sayakpaul merged commit 250f5cb into huggingface:main Oct 27, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add AITER attention backend #12549

Add AITER attention backend #12549

lauri9 commented Oct 27, 2025

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Oct 27, 2025

Uh oh!

lauri9 Oct 27, 2025

Uh oh!

sayakpaul Oct 27, 2025

Uh oh!

lauri9 commented Oct 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 27, 2025

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

github-actions bot commented Oct 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add AITER attention backend #12549

Add AITER attention backend #12549

Conversation

lauri9 commented Oct 27, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

lauri9 Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

sayakpaul Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

lauri9 commented Oct 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 27, 2025

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

github-actions bot commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sayakpaul commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Oct 27, 2025 •

edited

Loading