You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
dev-util/Tensile: fix compilation of sci-libs/rocBLAS on gfx906
Clang-20 disallowed op_sel in some VOP3P dot instructions.
See: llvm/llvm-project#100485
As ROCm maintains a fork of Clang, these changes did not reach official ROCm releases.
However Gentoo uses original Clang-20, which has these incompatible changes.
Luckilly, in Tensile these op_sel do nothing. Generally, they allow to shuffle vector elements before multiplication, but with values 0,0/1,1 shuffling is disabled and op_sel can be removed.
Closes: https://bugs.gentoo.org/949817
Signed-off-by: Sv. Lockal <[email protected]>
0 commit comments