You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1982
Pull Request resolved: #4963
This case is for when we are not using bottom right mask.
It should be slightly better perf in that case.
# notes
We note that backward is in general not stable. Sometimes you can get IMA. And numerics are not as good as we want it to be.
Reviewed By: q10
Differential Revision: D83076701
fbshipit-source-id: a1016b15a86d10f21d962166eae036d959befe18
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/attention/cuda/cutlass_blackwell_fmha/kernel/sm100_fmha_bwd_kernel_tma_warpspecialized.hpp
0 commit comments