You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
BUG: Workaround for Intel Compiler mask conversion bug
Encountered a specific bug with Intel Compiler when `_cvtmask64_u64` comparisons against `-1`
trigger erroneous optimizations. This bug affects equality (`==`) and inequality (`!=`)
comparisons crucial for operations like `np.logical_or`.
The issue originates from the compiler’s optimizer, which mistakenly duplicates the
last vector comparison instruction (targeting `zmm`) onto `ymm`.
It then incorrectly performs a bitwise XOR between the masks from the duplicate and original
instructions, leading to wrong results.
This parch implements a workaround to bypass this behavior.
0 commit comments