This change introduces a pure JAX implementation of flash attention to Maxtext, designed as a drop-in replacement for the existing Pallas kernel. In this cl we set up the stage by integrating it with maxtext in fsdp mode. We have plans for further optimizations to close the gap with pallas using different techniques such as: #2793

copybara-service · 2025-12-05T22:11:38Z

This change introduces a pure JAX implementation of flash attention to Maxtext, designed as a drop-in replacement for the existing Pallas kernel. In this cl we set up the stage by integrating it with maxtext in fsdp mode. We have plans for further optimizations to close the gap with pallas using different techniques such as:
iteration skipping, must_fuse, and memory space coloring.

The new implementation is located in maxtext/src/maxtext/kernels/jax_flash_attention.py and can be enabled with the use_jax_splash config flag.

To validate the implementation and compare it against the Tokamax kernel and the baseline dot-product attention, this change also introduces:

A new test suite in google_mla_attention_test.py for correctness and performance comparison, particularly for FSDP cases.
Refactored common MLA test utilities into attention_test_util.py.

…o Maxtext, designed as a drop-in replacement for the existing Pallas kernel. In this cl we set up the stage by integrating it with maxtext in fsdp mode. We have plans for further optimizations to close the gap with pallas using different techniques such as: iteration skipping, must_fuse, and memory space coloring. The new implementation is located in maxtext/src/maxtext/kernels/jax_flash_attention.py and can be enabled with the use_jax_splash config flag. To validate the implementation and compare it against the Tokamax kernel and the baseline dot-product attention, this change also introduces: A new test suite in google_mla_attention_test.py for correctness and performance comparison, particularly for FSDP cases. Refactored common MLA test utilities into attention_test_util.py. PiperOrigin-RevId: 834764107

copybara-service bot requested review from A9isha, NicoGrande, NuojCheng, RissyRan, SurbhiJainUSC, aireenmei, bvandermoon, gagika, gobbleturk, hengtaoguo, jiangjy1982, khatwanimohit, parambole, richjames0, shralex, shuningjin, suexu1025 and vipannalla as code owners December 5, 2025 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

copybara-service bot commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Are you sure you want to change the base?

Uh oh!

Conversation

copybara-service bot commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant