Add torch.backends.cuda.math_sdp.fp32_precision #2844

anatoliylitv · 2025-12-01T23:05:13Z

Overview
This PR adds a new float32 precision API
torch.backends.cuda.math_sdp.fp32_precision to configure fp32 precision
behavior of SDPBackend.MATH

Rationale
The test/test_transformers.py testing suite calculates the numerical
tolerance by comparing output tensors from the same precision ("reference")
and higher precision ("golden"), both calculated by SDPBackend.MATH.
However, the golden output is calculated with TF32 rather than FP32, which in
fact is less accurate than the FA/ME backend if they used IEEE rather than
TF32 for their accumulation.

The loss of precison causes false negatives in SDPA tests like
TestSDPACudaOnlyCUDA.test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16
, at least on ROCM platform. The false negative disappears after forcing
higher_precision_dtype = torch.float64

Major Changes
To restore the precision of golden output, a new API
torch.backends.cuda.math_sdp.fp32_precision is introduced, which allows
configuration of "matmul" precision during SDPBackend.MATH, and a new
decorator @math_sdp_precision("ieee") is added to all tests that use
check_out_and_grad. At last, an assert is added to the inner most function
_check_equal as a sanity check to ensure math_sdp has the right precison
configured for torch.float32 golden tensors.

Known Issues
The backward phase honors the configuration when calling backward(), regardless
the configuration when creating the graph.

…den to use ieee rather than tf32

rocm-repo-management-api · 2025-12-01T23:21:34Z

Jenkins build for f1fbfb6a324c9faefb86149ad2a8aeeae0c88088 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

…generation

rocm-repo-management-api · 2025-12-02T17:50:45Z

Jenkins build for b27caaba15c43257f8170853c3835876718ca5cb commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

…MathSDPModule

rocm-repo-management-api · 2025-12-03T22:51:46Z

Jenkins build for b5744a1c7d6873109d3f1b7f17e4165a53d60ba6 commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

xinyazhang added 9 commits December 1, 2025 16:56

Add torch.backends.cuda.math_sdp.fp32_precision

6999b01

Make torch.backends.cuda.math_sdp.fp32_precision effective for math_sdp

b365da9

torch/testing: add ctx mananger math_sdp_precision

cca5203

test/test_transformers: decorate all tests that uses fp32 math as gol…

b98605e

…den to use ieee rather than tf32

fix build error

b9eb99e

test/test_transformers: sanity check of golden tensor

9fd54bb

fix Fp32PrecisonGuard

ec5378b

more documentation

d9851f9

fix lint

f1fbfb6

anatoliylitv requested review from jerrymannil and xinyazhang December 1, 2025 23:05

Follow the practice of cuBLASModule and ignore MathSDPModule in docs …

b27caab

…generation

Follow the practice of public cuBLASModule to include to __all__ for …

b5744a1

…MathSDPModule

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add torch.backends.cuda.math_sdp.fp32_precision #2844

Add torch.backends.cuda.math_sdp.fp32_precision #2844

Uh oh!

anatoliylitv commented Dec 1, 2025

Uh oh!

rocm-repo-management-api bot commented Dec 1, 2025 •

edited

Loading

Uh oh!

rocm-repo-management-api bot commented Dec 2, 2025 •

edited

Loading

Uh oh!

rocm-repo-management-api bot commented Dec 3, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add torch.backends.cuda.math_sdp.fp32_precision #2844

Are you sure you want to change the base?

Add torch.backends.cuda.math_sdp.fp32_precision #2844

Uh oh!

Conversation

anatoliylitv commented Dec 1, 2025

Uh oh!

rocm-repo-management-api bot commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rocm-repo-management-api bot commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rocm-repo-management-api bot commented Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rocm-repo-management-api bot commented Dec 1, 2025 •

edited

Loading

rocm-repo-management-api bot commented Dec 2, 2025 •

edited

Loading

rocm-repo-management-api bot commented Dec 3, 2025 •

edited

Loading