[X86][AVX10] Remove EVEX512 and AVX10-256 implementations #157034

phoebewang · 2025-09-05T07:26:24Z

The 256-bit maximum vector register size control was removed from AVX10 whitepaper, ref: https://cdrdv2.intel.com/v1/dl/getContent/784343

We have warned these options in LLVM21 through #132542. This patch removes underlying implementations in LLVM22.

The 256-bit maximum vector register size control was removed from AVX10 whitepaper, ref: https://cdrdv2.intel.com/v1/dl/getContent/784343 We have warned options in LLVM21 through llvm#28154. We are removing underlying implementations in LLVM22.

llvmbot · 2025-09-05T07:26:58Z

@llvm/pr-subscribers-flang-driver
@llvm/pr-subscribers-clang-codegen
@llvm/pr-subscribers-clang-driver
@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-x86

Author: Phoebe Wang (phoebewang)

Changes

The 256-bit maximum vector register size control was removed from AVX10 whitepaper, ref: https://cdrdv2.intel.com/v1/dl/getContent/784343

We have warned these options in LLVM21 through #28154. This patch removes underlying implementations in LLVM22.

Patch is 455.29 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/157034.diff

156 Files Affected:

(modified) clang/docs/ReleaseNotes.rst (+4)
(modified) clang/docs/UsersManual.rst (+6-51)
(modified) clang/include/clang/Basic/BuiltinsX86.td (+382-382)
(modified) clang/include/clang/Basic/BuiltinsX86_64.td (+15-15)
(modified) clang/include/clang/Driver/Options.td (+4-14)
(modified) clang/lib/Basic/Targets/X86.cpp (+14-83)
(modified) clang/lib/Basic/Targets/X86.h (-3)
(modified) clang/lib/CodeGen/Targets/X86.cpp (+2-20)
(modified) clang/lib/Driver/ToolChains/Arch/X86.cpp (-41)
(modified) clang/lib/Headers/amxavx512intrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512bf16intrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512convertintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512niintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512satcvtdsintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2bf16intrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2convertintrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2copyintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2niintrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2satcvtdsintrin.h (+2-2)
(modified) clang/lib/Headers/avx2intrin.h (-9)
(modified) clang/lib/Headers/avx512bf16intrin.h (+3-4)
(modified) clang/lib/Headers/avx512bitalgintrin.h (+1-2)
(modified) clang/lib/Headers/avx512bwintrin.h (+3-4)
(modified) clang/lib/Headers/avx512cdintrin.h (+2-2)
(modified) clang/lib/Headers/avx512dqintrin.h (+4-3)
(modified) clang/lib/Headers/avx512fintrin.h (+6-5)
(modified) clang/lib/Headers/avx512fp16intrin.h (+4-6)
(modified) clang/lib/Headers/avx512ifmaintrin.h (+2-2)
(modified) clang/lib/Headers/avx512ifmavlintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vbmi2intrin.h (+3-2)
(modified) clang/lib/Headers/avx512vbmiintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vbmivlintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbf16intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbitalgintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbwintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlcdintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vldqintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlfp16intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlvbmi2intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlvnniintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlvp2intersectintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vnniintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vp2intersectintrin.h (+1-2)
(modified) clang/lib/Headers/avx512vpopcntdqintrin.h (+2-3)
(modified) clang/lib/Headers/avx512vpopcntdqvlintrin.h (+4-4)
(modified) clang/lib/Headers/avxintrin.h (-9)
(modified) clang/lib/Headers/emmintrin.h (-6)
(modified) clang/lib/Headers/fmaintrin.h (+2-2)
(modified) clang/lib/Headers/gfniintrin.h (+2-27)
(modified) clang/lib/Headers/mmintrin.h (+3-9)
(modified) clang/lib/Headers/movrs_avx10_2_512intrin.h (+2-2)
(modified) clang/lib/Headers/movrs_avx10_2intrin.h (+4-4)
(modified) clang/lib/Headers/pmmintrin.h (-6)
(modified) clang/lib/Headers/sm4evexintrin.h (+2-2)
(modified) clang/lib/Headers/smmintrin.h (-6)
(modified) clang/lib/Headers/tmmintrin.h (-6)
(modified) clang/lib/Headers/vaesintrin.h (+1-2)
(modified) clang/lib/Headers/xmmintrin.h (-9)
(modified) clang/test/CodeGen/X86/amx_avx512_api.c (+1-1)
(modified) clang/test/CodeGen/X86/amxavx512-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512bf16-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512convert-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512minmax-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512minmax-error.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512ni-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvt-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-errors.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64-error.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2bf16-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2convert-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2minmax-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2ni-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2satcvt-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2satcvtds-builtins-x64.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2satcvtds-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx512-error.c (+4-14)
(modified) clang/test/CodeGen/X86/avx512copy-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/avx512vlbw-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/avxvnniint16-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/avxvnniint8-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-512-builtins-error-32.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-512-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-builtins-error-32.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/sm4-evex-builtins.c (+2-2)
(modified) clang/test/CodeGen/attr-cpuspecific.c (+1-1)
(modified) clang/test/CodeGen/attr-target-x86.c (+7-12)
(modified) clang/test/CodeGen/regcall2.c (+2-2)
(modified) clang/test/CodeGen/target-avx-abi-diag.c (+1-53)
(modified) clang/test/CodeGen/target-builtin-noerror.c (+2-4)
(modified) clang/test/Driver/x86-target-features.c (+8-34)
(modified) clang/test/Preprocessor/predefined-arch-macros-x86.c (-1)
(modified) clang/test/Preprocessor/predefined-arch-macros.c (-48)
(modified) clang/test/Preprocessor/x86_target_features.c (+14-59)
(modified) llvm/include/llvm/TargetParser/X86TargetParser.def (+2-2)
(modified) llvm/lib/IR/Verifier.cpp (-10)
(modified) llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp (-3)
(modified) llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp (-12)
(modified) llvm/lib/Target/X86/X86.td (+12-14)
(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+14-20)
(modified) llvm/lib/Target/X86/X86ISelLoweringCall.cpp (+2-2)
(modified) llvm/lib/Target/X86/X86InstrAMX.td (+7-7)
(modified) llvm/lib/Target/X86/X86InstrAVX10.td (+52-79)
(modified) llvm/lib/Target/X86/X86InstrAVX512.td (+21-21)
(modified) llvm/lib/Target/X86/X86InstrPredicates.td (-3)
(modified) llvm/lib/Target/X86/X86Subtarget.cpp (-18)
(modified) llvm/lib/Target/X86/X86Subtarget.h (+2-4)
(modified) llvm/lib/Target/X86/X86TargetTransformInfo.cpp (+2-2)
(modified) llvm/lib/TargetParser/Host.cpp (+2-8)
(modified) llvm/lib/TargetParser/X86TargetParser.cpp (+18-22)
(modified) llvm/test/CodeGen/X86/amx-across-func-tilemovrow.ll (+3-3)
(modified) llvm/test/CodeGen/X86/amx-avx512-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/amx-tile-avx512-internals.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx10.2-fma-commute.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx10_2-cmp.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512bf16-arith.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512bf16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512convert-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512fptosi_satcvtds.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512minmax-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512ni-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512satcvt-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512satcvtds-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2bf16-arith.ll (+6-19)
(modified) llvm/test/CodeGen/X86/avx10_2bf16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2convert-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2fptosi_satcvtds.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2minmax-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2ni-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvt-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvtds-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvtds-x64-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx512bwvl-arith.ll (+14-28)
(modified) llvm/test/CodeGen/X86/avx512copy-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avxvnniint16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avxvnniint8-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/comi-flags.ll (+1-1)
(removed) llvm/test/CodeGen/X86/evex512-mem.ll (-29)
(modified) llvm/test/CodeGen/X86/fminimum-fmaximum.ll (+1-1)
(modified) llvm/test/CodeGen/X86/fminimumnum-fmaximumnum.ll (+1-1)
(modified) llvm/test/CodeGen/X86/movrs-avx10.2-512-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/movrs-avx10.2-intrinsics.ll (+1-1)
(removed) llvm/test/CodeGen/X86/pr90844.ll (-36)
(modified) llvm/test/CodeGen/X86/sm4-evex-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/vector-fshl-128.ll (+1-2)
(modified) llvm/test/CodeGen/X86/vector-fshl-256.ll (+89-140)
(modified) llvm/test/CodeGen/X86/vector-fshr-128.ll (+1-2)
(modified) llvm/test/CodeGen/X86/vector-fshr-256.ll (+68-104)
(modified) llvm/test/Instrumentation/MemorySanitizer/X86/avx10_2_512ni-intrinsics.ll (+1-1)
(modified) llvm/test/Instrumentation/MemorySanitizer/X86/avx10_2ni-intrinsics.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/X86/cost-conditional-branches.ll (+175-17)
(modified) llvm/test/Transforms/SLPVectorizer/X86/revec-getExtractWithExtendCost.ll (+1-1)
(modified) llvm/test/Transforms/SLPVectorizer/X86/revec-load-compress.ll (+1-1)

diff --git a/clang/docs/ReleaseNotes.rst b/clang/docs/ReleaseNotes.rst
index dd53b4d46f3cc..7b2a0bdfefa2b 100644
--- a/clang/docs/ReleaseNotes.rst
+++ b/clang/docs/ReleaseNotes.rst
@@ -373,6 +373,10 @@ X86 Support
 - NOTE: Please avoid use of the __builtin_ia32_* intrinsics - these are not
   guaranteed to exist in future releases, or match behaviour with previous
   releases of clang or other compilers.
+- Remove `m[no-]avx10.x-[256,512]` and `m[no-]evex512` options from Clang
+  driver.
+- Remove `[no-]evex512` feature request from intrinsics and builtins.
+- Change features `avx10.x-[256,512]` to `avx10.x`.
 
 Arm and AArch64 Support
 ^^^^^^^^^^^^^^^^^^^^^^^
diff --git a/clang/docs/UsersManual.rst b/clang/docs/UsersManual.rst
index 0e85c8109fd5e..a8bbf146431ea 100644
--- a/clang/docs/UsersManual.rst
+++ b/clang/docs/UsersManual.rst
@@ -4581,59 +4581,14 @@ implicitly included in later levels.
 - ``-march=x86-64-v3``: (close to Haswell) AVX, AVX2, BMI1, BMI2, F16C, FMA, LZCNT, MOVBE, XSAVE
 - ``-march=x86-64-v4``: AVX512F, AVX512BW, AVX512CD, AVX512DQ, AVX512VL
 
-`Intel AVX10 ISA <https://cdrdv2.intel.com/v1/dl/getContent/784267>`_ is
+`Intel AVX10 ISA <https://cdrdv2.intel.com/v1/dl/getContent/784343>`_ is
 a major new vector ISA incorporating the modern vectorization aspects of
 Intel AVX-512. This ISA will be supported on all future Intel processors.
-Users are supposed to use the new options ``-mavx10.N`` and ``-mavx10.N-512``
-on these processors and should not use traditional AVX512 options anymore.
-
-The ``N`` in ``-mavx10.N`` represents a continuous integer number starting
-from ``1``. ``-mavx10.N`` is an alias of ``-mavx10.N-256``, which means to
-enable all instructions within AVX10 version N at a maximum vector length of
-256 bits. ``-mavx10.N-512`` enables all instructions at a maximum vector
-length of 512 bits, which is a superset of instructions ``-mavx10.N`` enabled.
-
-Current binaries built with AVX512 features can run on Intel AVX10/512 capable
-processors without re-compile, but cannot run on AVX10/256 capable processors.
-Users need to re-compile their code with ``-mavx10.N``, and maybe update some
-code that calling to 512-bit X86 specific intrinsics and passing or returning
-512-bit vector types in function call, if they want to run on AVX10/256 capable
-processors. Binaries built with ``-mavx10.N`` can run on both AVX10/256 and
-AVX10/512 capable processors.
-
-Users can add a ``-mno-evex512`` in the command line with AVX512 options if
-they want to run the binary on both legacy AVX512 and new AVX10/256 capable
-processors. The option has the same constraints as ``-mavx10.N``, i.e.,
-cannot call to 512-bit X86 specific intrinsics and pass or return 512-bit vector
-types in function call.
-
-Users should avoid using AVX512 features in function target attributes when
-developing code for AVX10. If they have to do so, they need to add an explicit
-``evex512`` or ``no-evex512`` together with AVX512 features for 512-bit or
-non-512-bit functions respectively to avoid unexpected code generation. Both
-command line option and target attribute of EVEX512 feature can only be used
-with AVX512. They don't affect vector size of AVX10.
-
-User should not mix the use AVX10 and AVX512 options together at any time,
-because the option combinations are conflicting sometimes. For example, a
-combination of ``-mavx512f -mavx10.1-256`` doesn't show a clear intention to
-compiler, since instructions in AVX512F and AVX10.1/256 intersect but do not
-overlap. In this case, compiler will emit warning for it, but the behavior
-is determined. It will generate the same code as option ``-mavx10.1-512``.
-A similar case is ``-mavx512f -mavx10.2-256``, which equals to
-``-mavx10.1-512 -mavx10.2-256``, because ``avx10.2-256`` implies ``avx10.1-256``
-and ``-mavx512f -mavx10.1-256`` equals to ``-mavx10.1-512``.
-
-There are some new macros introduced with AVX10 support. ``-mavx10.1-256`` will
-enable ``__AVX10_1__`` and ``__EVEX256__``, while ``-mavx10.1-512`` enables
-``__AVX10_1__``, ``__EVEX256__``, ``__EVEX512__``  and ``__AVX10_1_512__``.
-Besides, both ``-mavx10.1-256`` and ``-mavx10.1-512`` will enable all AVX512
-feature specific macros. A AVX512 feature will enable both ``__EVEX256__``,
-``__EVEX512__`` and its own macro. So ``__EVEX512__`` can be used to guard code
-that can run on both legacy AVX512 and AVX10/512 capable processors but cannot
-run on AVX10/256, while a AVX512 macro like ``__AVX512F__`` cannot tell the
-difference among the three options. Users need to check additional macros
-``__AVX10_1__`` and ``__EVEX512__`` if they want to make distinction.
+Users are supposed to use the new options ``-mavx10.N`` on these processors
+and should not use traditional AVX512 options anymore. The ``N`` in
+``-mavx10.N`` represents a continuous integer number starting
+from ``1``. Current binaries built with AVX512 features can run on Intel AVX10
+capable processors without re-compile.
 
 ARM
 ^^^
diff --git a/clang/include/clang/Basic/BuiltinsX86.td b/clang/include/clang/Basic/BuiltinsX86.td
index acd8f70c4a5f2..cdd5d54a68d79 100644
--- a/clang/include/clang/Basic/BuiltinsX86.td
+++ b/clang/include/clang/Basic/BuiltinsX86.td
@@ -387,7 +387,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesenc256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesenc512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -395,7 +395,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesenclast256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesenclast512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -403,7 +403,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesdec256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesdec512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -411,7 +411,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesdeclast256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesdeclast512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -423,7 +423,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8affineinvqb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8affineinvqb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>, _Constant char)">;
 }
 
@@ -435,7 +435,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8affineqb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8affineqb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>, _Constant char)">;
 }
 
@@ -447,7 +447,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8mulb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8mulb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>)">;
 }
 
@@ -459,7 +459,7 @@ let Features = "vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<2
   def pclmulqdq256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def pclmulqdq512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>, _Constant char)">;
 }
 
@@ -830,7 +830,7 @@ let Features = "fma|fma4", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vfmaddsubpd256 : X86Builtin<"_Vector<4, double>(_Vector<4, double>, _Vector<4, double>, _Vector<4, double>)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vfmaddpd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
   def vfmaddpd512_maskz : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
   def vfmaddpd512_mask3 : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
@@ -936,7 +936,7 @@ let Features = "pku", Attributes = [NoThrow] in {
   def wrpkru : X86Builtin<"void(unsigned int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def sqrtpd512 : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Constant int)">;
   def sqrtps512 : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Constant int)">;
 }
@@ -946,7 +946,7 @@ let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<128>
   def rsqrt14ss_mask : X86Builtin<"_Vector<4, float>(_Vector<4, float>, _Vector<4, float>, _Vector<4, float>, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rsqrt14pd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, unsigned char)">;
   def rsqrt14ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Vector<16, float>, unsigned short)">;
 }
@@ -956,7 +956,7 @@ let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<128>
   def rcp14ss_mask : X86Builtin<"_Vector<4, float>(_Vector<4, float>, _Vector<4, float>, _Vector<4, float>, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rcp14pd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, unsigned char)">;
   def rcp14ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Vector<16, float>, unsigned short)">;
   def cvttps2dq512_mask : X86Builtin<"_Vector<16, int>(_Vector<16, float>, _Vector<16, int>, unsigned short, _Constant int)">;
@@ -974,7 +974,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<128
   def cmpps128_mask : X86Builtin<"unsigned char(_Vector<4, float>, _Vector<4, float>, _Constant int, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def cmppd512_mask : X86Builtin<"unsigned char(_Vector<8, double>, _Vector<8, double>, _Constant int, unsigned char, _Constant int)">;
 }
 
@@ -986,7 +986,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<128
   def cmppd128_mask : X86Builtin<"unsigned char(_Vector<2, double>, _Vector<2, double>, _Constant int, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rndscaleps_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Constant int, _Vector<16, float>, unsigned short, _Constant int)">;
   def rndscalepd_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Constant int, _Vector<8, double>, unsigned char, _Constant int)">;
   def cvtps2dq512_mask : X86Builtin<"_Vector<16, int>(_Vector<16, float>, _Vector<16, int>, unsigned short, _Constant int)">;
@@ -1004,12 +1004,12 @@ let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWi
   def vcvtph2ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, short>, _Vector<16, float>, unsigned short, _Constant int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, Constexpr, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, Constexpr, RequiredVectorWidth<512>] in {
   def pmuldq512 : X86Builtin<"_Vector<8, long long int>(_Vector<16, int>, _Vector<16, int>)">;
   def pmuludq512 : X86Builtin<"_Vector<8, long long int>(_Vector<16, int>, _Vector<16, int>)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, RequiredVectorWidth<512>] in {
   def loaddqusi512_mask : X86Builtin<"_Vector<16, int>(int const *, _Vector<16, int>, unsigned short)">;
   def loaddqudi512_mask : X86Builtin<"_Vector<8, long long int>(long long int const *, _Vector<8, long long int>, unsigned char)">;
   def loadups512_mask : X86Builtin<"_Vector<16, float>(float const *, _Vector<16, float>, unsigned short)">;
@@ -1024,7 +1024,7 @@ let Features = "avx512f,evex512", Attributes = [NoThrow, RequiredVectorWidth<512
   def storeaps512_mask : X86Builtin<"void(_Vector<16, float *>, _Vector<16, float>, unsigned short)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def alignq512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>, _Constant int)">;
   def alignd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Constant int)">;
 }
@@ -1045,7 +1045,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def alignq256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>, _Constant int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def extractf64x4_mask : X86Builtin<"_Vector<4, double>(_Vector<8, double>, _Constant int, _Vector<4, double>, unsigned char)">;
   def extractf32x4_mask : X86Builtin<"_Vector<4, float>(_Vector<16, float>, _Constant int, _Vector<4, float>, unsigned char)">;
 }
@@ -1058,7 +1058,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpbusd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpbusd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1070,7 +1070,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpbusds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpbusds512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1082,7 +1082,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpwssd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpwssd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1094,55 +1094,55 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpwssds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpwssds512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbssd128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbssd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbssds128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbssds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbsud128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbsud256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbsuds128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [...
[truncated]

llvmbot · 2025-09-05T07:26:58Z

@llvm/pr-subscribers-clang

Author: Phoebe Wang (phoebewang)

Changes

The 256-bit maximum vector register size control was removed from AVX10 whitepaper, ref: https://cdrdv2.intel.com/v1/dl/getContent/784343

We have warned these options in LLVM21 through #28154. This patch removes underlying implementations in LLVM22.

Patch is 455.29 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/157034.diff

156 Files Affected:

(modified) clang/docs/ReleaseNotes.rst (+4)
(modified) clang/docs/UsersManual.rst (+6-51)
(modified) clang/include/clang/Basic/BuiltinsX86.td (+382-382)
(modified) clang/include/clang/Basic/BuiltinsX86_64.td (+15-15)
(modified) clang/include/clang/Driver/Options.td (+4-14)
(modified) clang/lib/Basic/Targets/X86.cpp (+14-83)
(modified) clang/lib/Basic/Targets/X86.h (-3)
(modified) clang/lib/CodeGen/Targets/X86.cpp (+2-20)
(modified) clang/lib/Driver/ToolChains/Arch/X86.cpp (-41)
(modified) clang/lib/Headers/amxavx512intrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512bf16intrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512convertintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512niintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2_512satcvtdsintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2bf16intrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2convertintrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2copyintrin.h (+1-1)
(modified) clang/lib/Headers/avx10_2niintrin.h (+2-2)
(modified) clang/lib/Headers/avx10_2satcvtdsintrin.h (+2-2)
(modified) clang/lib/Headers/avx2intrin.h (-9)
(modified) clang/lib/Headers/avx512bf16intrin.h (+3-4)
(modified) clang/lib/Headers/avx512bitalgintrin.h (+1-2)
(modified) clang/lib/Headers/avx512bwintrin.h (+3-4)
(modified) clang/lib/Headers/avx512cdintrin.h (+2-2)
(modified) clang/lib/Headers/avx512dqintrin.h (+4-3)
(modified) clang/lib/Headers/avx512fintrin.h (+6-5)
(modified) clang/lib/Headers/avx512fp16intrin.h (+4-6)
(modified) clang/lib/Headers/avx512ifmaintrin.h (+2-2)
(modified) clang/lib/Headers/avx512ifmavlintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vbmi2intrin.h (+3-2)
(modified) clang/lib/Headers/avx512vbmiintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vbmivlintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbf16intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbitalgintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlbwintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlcdintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vldqintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlfp16intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlintrin.h (+2-4)
(modified) clang/lib/Headers/avx512vlvbmi2intrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlvnniintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vlvp2intersectintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vnniintrin.h (+2-2)
(modified) clang/lib/Headers/avx512vp2intersectintrin.h (+1-2)
(modified) clang/lib/Headers/avx512vpopcntdqintrin.h (+2-3)
(modified) clang/lib/Headers/avx512vpopcntdqvlintrin.h (+4-4)
(modified) clang/lib/Headers/avxintrin.h (-9)
(modified) clang/lib/Headers/emmintrin.h (-6)
(modified) clang/lib/Headers/fmaintrin.h (+2-2)
(modified) clang/lib/Headers/gfniintrin.h (+2-27)
(modified) clang/lib/Headers/mmintrin.h (+3-9)
(modified) clang/lib/Headers/movrs_avx10_2_512intrin.h (+2-2)
(modified) clang/lib/Headers/movrs_avx10_2intrin.h (+4-4)
(modified) clang/lib/Headers/pmmintrin.h (-6)
(modified) clang/lib/Headers/sm4evexintrin.h (+2-2)
(modified) clang/lib/Headers/smmintrin.h (-6)
(modified) clang/lib/Headers/tmmintrin.h (-6)
(modified) clang/lib/Headers/vaesintrin.h (+1-2)
(modified) clang/lib/Headers/xmmintrin.h (-9)
(modified) clang/test/CodeGen/X86/amx_avx512_api.c (+1-1)
(modified) clang/test/CodeGen/X86/amxavx512-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512bf16-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512convert-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512minmax-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512minmax-error.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512ni-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvt-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-errors.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64-error.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2bf16-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2convert-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2minmax-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2ni-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2satcvt-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx10_2satcvtds-builtins-x64.c (+1-1)
(modified) clang/test/CodeGen/X86/avx10_2satcvtds-builtins.c (+2-2)
(modified) clang/test/CodeGen/X86/avx512-error.c (+4-14)
(modified) clang/test/CodeGen/X86/avx512copy-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/avx512vlbw-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/avxvnniint16-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/avxvnniint8-builtins.c (+4-4)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-512-builtins-error-32.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-512-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-builtins-error-32.c (+1-1)
(modified) clang/test/CodeGen/X86/movrs-avx10.2-builtins.c (+1-1)
(modified) clang/test/CodeGen/X86/sm4-evex-builtins.c (+2-2)
(modified) clang/test/CodeGen/attr-cpuspecific.c (+1-1)
(modified) clang/test/CodeGen/attr-target-x86.c (+7-12)
(modified) clang/test/CodeGen/regcall2.c (+2-2)
(modified) clang/test/CodeGen/target-avx-abi-diag.c (+1-53)
(modified) clang/test/CodeGen/target-builtin-noerror.c (+2-4)
(modified) clang/test/Driver/x86-target-features.c (+8-34)
(modified) clang/test/Preprocessor/predefined-arch-macros-x86.c (-1)
(modified) clang/test/Preprocessor/predefined-arch-macros.c (-48)
(modified) clang/test/Preprocessor/x86_target_features.c (+14-59)
(modified) llvm/include/llvm/TargetParser/X86TargetParser.def (+2-2)
(modified) llvm/lib/IR/Verifier.cpp (-10)
(modified) llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp (-3)
(modified) llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp (-12)
(modified) llvm/lib/Target/X86/X86.td (+12-14)
(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+14-20)
(modified) llvm/lib/Target/X86/X86ISelLoweringCall.cpp (+2-2)
(modified) llvm/lib/Target/X86/X86InstrAMX.td (+7-7)
(modified) llvm/lib/Target/X86/X86InstrAVX10.td (+52-79)
(modified) llvm/lib/Target/X86/X86InstrAVX512.td (+21-21)
(modified) llvm/lib/Target/X86/X86InstrPredicates.td (-3)
(modified) llvm/lib/Target/X86/X86Subtarget.cpp (-18)
(modified) llvm/lib/Target/X86/X86Subtarget.h (+2-4)
(modified) llvm/lib/Target/X86/X86TargetTransformInfo.cpp (+2-2)
(modified) llvm/lib/TargetParser/Host.cpp (+2-8)
(modified) llvm/lib/TargetParser/X86TargetParser.cpp (+18-22)
(modified) llvm/test/CodeGen/X86/amx-across-func-tilemovrow.ll (+3-3)
(modified) llvm/test/CodeGen/X86/amx-avx512-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/amx-tile-avx512-internals.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx10.2-fma-commute.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx10_2-cmp.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512bf16-arith.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512bf16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512convert-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512fptosi_satcvtds.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512minmax-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512ni-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512satcvt-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2_512satcvtds-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2bf16-arith.ll (+6-19)
(modified) llvm/test/CodeGen/X86/avx10_2bf16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2convert-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2fptosi_satcvtds.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2minmax-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2ni-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvt-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvtds-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avx10_2satcvtds-x64-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avx512bwvl-arith.ll (+14-28)
(modified) llvm/test/CodeGen/X86/avx512copy-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/avxvnniint16-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/avxvnniint8-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/comi-flags.ll (+1-1)
(removed) llvm/test/CodeGen/X86/evex512-mem.ll (-29)
(modified) llvm/test/CodeGen/X86/fminimum-fmaximum.ll (+1-1)
(modified) llvm/test/CodeGen/X86/fminimumnum-fmaximumnum.ll (+1-1)
(modified) llvm/test/CodeGen/X86/movrs-avx10.2-512-intrinsics.ll (+1-1)
(modified) llvm/test/CodeGen/X86/movrs-avx10.2-intrinsics.ll (+1-1)
(removed) llvm/test/CodeGen/X86/pr90844.ll (-36)
(modified) llvm/test/CodeGen/X86/sm4-evex-intrinsics.ll (+2-2)
(modified) llvm/test/CodeGen/X86/vector-fshl-128.ll (+1-2)
(modified) llvm/test/CodeGen/X86/vector-fshl-256.ll (+89-140)
(modified) llvm/test/CodeGen/X86/vector-fshr-128.ll (+1-2)
(modified) llvm/test/CodeGen/X86/vector-fshr-256.ll (+68-104)
(modified) llvm/test/Instrumentation/MemorySanitizer/X86/avx10_2_512ni-intrinsics.ll (+1-1)
(modified) llvm/test/Instrumentation/MemorySanitizer/X86/avx10_2ni-intrinsics.ll (+1-1)
(modified) llvm/test/Transforms/LoopVectorize/X86/cost-conditional-branches.ll (+175-17)
(modified) llvm/test/Transforms/SLPVectorizer/X86/revec-getExtractWithExtendCost.ll (+1-1)
(modified) llvm/test/Transforms/SLPVectorizer/X86/revec-load-compress.ll (+1-1)

diff --git a/clang/docs/ReleaseNotes.rst b/clang/docs/ReleaseNotes.rst
index dd53b4d46f3cc..7b2a0bdfefa2b 100644
--- a/clang/docs/ReleaseNotes.rst
+++ b/clang/docs/ReleaseNotes.rst
@@ -373,6 +373,10 @@ X86 Support
 - NOTE: Please avoid use of the __builtin_ia32_* intrinsics - these are not
   guaranteed to exist in future releases, or match behaviour with previous
   releases of clang or other compilers.
+- Remove `m[no-]avx10.x-[256,512]` and `m[no-]evex512` options from Clang
+  driver.
+- Remove `[no-]evex512` feature request from intrinsics and builtins.
+- Change features `avx10.x-[256,512]` to `avx10.x`.
 
 Arm and AArch64 Support
 ^^^^^^^^^^^^^^^^^^^^^^^
diff --git a/clang/docs/UsersManual.rst b/clang/docs/UsersManual.rst
index 0e85c8109fd5e..a8bbf146431ea 100644
--- a/clang/docs/UsersManual.rst
+++ b/clang/docs/UsersManual.rst
@@ -4581,59 +4581,14 @@ implicitly included in later levels.
 - ``-march=x86-64-v3``: (close to Haswell) AVX, AVX2, BMI1, BMI2, F16C, FMA, LZCNT, MOVBE, XSAVE
 - ``-march=x86-64-v4``: AVX512F, AVX512BW, AVX512CD, AVX512DQ, AVX512VL
 
-`Intel AVX10 ISA <https://cdrdv2.intel.com/v1/dl/getContent/784267>`_ is
+`Intel AVX10 ISA <https://cdrdv2.intel.com/v1/dl/getContent/784343>`_ is
 a major new vector ISA incorporating the modern vectorization aspects of
 Intel AVX-512. This ISA will be supported on all future Intel processors.
-Users are supposed to use the new options ``-mavx10.N`` and ``-mavx10.N-512``
-on these processors and should not use traditional AVX512 options anymore.
-
-The ``N`` in ``-mavx10.N`` represents a continuous integer number starting
-from ``1``. ``-mavx10.N`` is an alias of ``-mavx10.N-256``, which means to
-enable all instructions within AVX10 version N at a maximum vector length of
-256 bits. ``-mavx10.N-512`` enables all instructions at a maximum vector
-length of 512 bits, which is a superset of instructions ``-mavx10.N`` enabled.
-
-Current binaries built with AVX512 features can run on Intel AVX10/512 capable
-processors without re-compile, but cannot run on AVX10/256 capable processors.
-Users need to re-compile their code with ``-mavx10.N``, and maybe update some
-code that calling to 512-bit X86 specific intrinsics and passing or returning
-512-bit vector types in function call, if they want to run on AVX10/256 capable
-processors. Binaries built with ``-mavx10.N`` can run on both AVX10/256 and
-AVX10/512 capable processors.
-
-Users can add a ``-mno-evex512`` in the command line with AVX512 options if
-they want to run the binary on both legacy AVX512 and new AVX10/256 capable
-processors. The option has the same constraints as ``-mavx10.N``, i.e.,
-cannot call to 512-bit X86 specific intrinsics and pass or return 512-bit vector
-types in function call.
-
-Users should avoid using AVX512 features in function target attributes when
-developing code for AVX10. If they have to do so, they need to add an explicit
-``evex512`` or ``no-evex512`` together with AVX512 features for 512-bit or
-non-512-bit functions respectively to avoid unexpected code generation. Both
-command line option and target attribute of EVEX512 feature can only be used
-with AVX512. They don't affect vector size of AVX10.
-
-User should not mix the use AVX10 and AVX512 options together at any time,
-because the option combinations are conflicting sometimes. For example, a
-combination of ``-mavx512f -mavx10.1-256`` doesn't show a clear intention to
-compiler, since instructions in AVX512F and AVX10.1/256 intersect but do not
-overlap. In this case, compiler will emit warning for it, but the behavior
-is determined. It will generate the same code as option ``-mavx10.1-512``.
-A similar case is ``-mavx512f -mavx10.2-256``, which equals to
-``-mavx10.1-512 -mavx10.2-256``, because ``avx10.2-256`` implies ``avx10.1-256``
-and ``-mavx512f -mavx10.1-256`` equals to ``-mavx10.1-512``.
-
-There are some new macros introduced with AVX10 support. ``-mavx10.1-256`` will
-enable ``__AVX10_1__`` and ``__EVEX256__``, while ``-mavx10.1-512`` enables
-``__AVX10_1__``, ``__EVEX256__``, ``__EVEX512__``  and ``__AVX10_1_512__``.
-Besides, both ``-mavx10.1-256`` and ``-mavx10.1-512`` will enable all AVX512
-feature specific macros. A AVX512 feature will enable both ``__EVEX256__``,
-``__EVEX512__`` and its own macro. So ``__EVEX512__`` can be used to guard code
-that can run on both legacy AVX512 and AVX10/512 capable processors but cannot
-run on AVX10/256, while a AVX512 macro like ``__AVX512F__`` cannot tell the
-difference among the three options. Users need to check additional macros
-``__AVX10_1__`` and ``__EVEX512__`` if they want to make distinction.
+Users are supposed to use the new options ``-mavx10.N`` on these processors
+and should not use traditional AVX512 options anymore. The ``N`` in
+``-mavx10.N`` represents a continuous integer number starting
+from ``1``. Current binaries built with AVX512 features can run on Intel AVX10
+capable processors without re-compile.
 
 ARM
 ^^^
diff --git a/clang/include/clang/Basic/BuiltinsX86.td b/clang/include/clang/Basic/BuiltinsX86.td
index acd8f70c4a5f2..cdd5d54a68d79 100644
--- a/clang/include/clang/Basic/BuiltinsX86.td
+++ b/clang/include/clang/Basic/BuiltinsX86.td
@@ -387,7 +387,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesenc256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesenc512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -395,7 +395,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesenclast256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesenclast512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -403,7 +403,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesdec256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesdec512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -411,7 +411,7 @@ let Features = "vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] i
   def aesdeclast256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>)">;
 }
 
-let Features = "avx512f,evex512,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vaes", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def aesdeclast512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>)">;
 }
 
@@ -423,7 +423,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8affineinvqb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8affineinvqb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>, _Constant char)">;
 }
 
@@ -435,7 +435,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8affineqb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8affineqb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>, _Constant char)">;
 }
 
@@ -447,7 +447,7 @@ let Features = "avx,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vgf2p8mulb_v32qi : X86Builtin<"_Vector<32, char>(_Vector<32, char>, _Vector<32, char>)">;
 }
 
-let Features = "avx512f,evex512,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,gfni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vgf2p8mulb_v64qi : X86Builtin<"_Vector<64, char>(_Vector<64, char>, _Vector<64, char>)">;
 }
 
@@ -459,7 +459,7 @@ let Features = "vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<2
   def pclmulqdq256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>, _Constant char)">;
 }
 
-let Features = "avx512f,evex512,vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f,vpclmulqdq", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def pclmulqdq512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>, _Constant char)">;
 }
 
@@ -830,7 +830,7 @@ let Features = "fma|fma4", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def vfmaddsubpd256 : X86Builtin<"_Vector<4, double>(_Vector<4, double>, _Vector<4, double>, _Vector<4, double>)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vfmaddpd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
   def vfmaddpd512_maskz : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
   def vfmaddpd512_mask3 : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, _Vector<8, double>, unsigned char, _Constant int)">;
@@ -936,7 +936,7 @@ let Features = "pku", Attributes = [NoThrow] in {
   def wrpkru : X86Builtin<"void(unsigned int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def sqrtpd512 : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Constant int)">;
   def sqrtps512 : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Constant int)">;
 }
@@ -946,7 +946,7 @@ let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<128>
   def rsqrt14ss_mask : X86Builtin<"_Vector<4, float>(_Vector<4, float>, _Vector<4, float>, _Vector<4, float>, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rsqrt14pd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, unsigned char)">;
   def rsqrt14ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Vector<16, float>, unsigned short)">;
 }
@@ -956,7 +956,7 @@ let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<128>
   def rcp14ss_mask : X86Builtin<"_Vector<4, float>(_Vector<4, float>, _Vector<4, float>, _Vector<4, float>, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rcp14pd512_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Vector<8, double>, unsigned char)">;
   def rcp14ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Vector<16, float>, unsigned short)">;
   def cvttps2dq512_mask : X86Builtin<"_Vector<16, int>(_Vector<16, float>, _Vector<16, int>, unsigned short, _Constant int)">;
@@ -974,7 +974,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<128
   def cmpps128_mask : X86Builtin<"unsigned char(_Vector<4, float>, _Vector<4, float>, _Constant int, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def cmppd512_mask : X86Builtin<"unsigned char(_Vector<8, double>, _Vector<8, double>, _Constant int, unsigned char, _Constant int)">;
 }
 
@@ -986,7 +986,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<128
   def cmppd128_mask : X86Builtin<"unsigned char(_Vector<2, double>, _Vector<2, double>, _Constant int, unsigned char)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def rndscaleps_mask : X86Builtin<"_Vector<16, float>(_Vector<16, float>, _Constant int, _Vector<16, float>, unsigned short, _Constant int)">;
   def rndscalepd_mask : X86Builtin<"_Vector<8, double>(_Vector<8, double>, _Constant int, _Vector<8, double>, unsigned char, _Constant int)">;
   def cvtps2dq512_mask : X86Builtin<"_Vector<16, int>(_Vector<16, float>, _Vector<16, int>, unsigned short, _Constant int)">;
@@ -1004,12 +1004,12 @@ let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWi
   def vcvtph2ps512_mask : X86Builtin<"_Vector<16, float>(_Vector<16, short>, _Vector<16, float>, unsigned short, _Constant int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, Constexpr, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, Constexpr, RequiredVectorWidth<512>] in {
   def pmuldq512 : X86Builtin<"_Vector<8, long long int>(_Vector<16, int>, _Vector<16, int>)">;
   def pmuludq512 : X86Builtin<"_Vector<8, long long int>(_Vector<16, int>, _Vector<16, int>)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, RequiredVectorWidth<512>] in {
   def loaddqusi512_mask : X86Builtin<"_Vector<16, int>(int const *, _Vector<16, int>, unsigned short)">;
   def loaddqudi512_mask : X86Builtin<"_Vector<8, long long int>(long long int const *, _Vector<8, long long int>, unsigned char)">;
   def loadups512_mask : X86Builtin<"_Vector<16, float>(float const *, _Vector<16, float>, unsigned short)">;
@@ -1024,7 +1024,7 @@ let Features = "avx512f,evex512", Attributes = [NoThrow, RequiredVectorWidth<512
   def storeaps512_mask : X86Builtin<"void(_Vector<16, float *>, _Vector<16, float>, unsigned short)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def alignq512 : X86Builtin<"_Vector<8, long long int>(_Vector<8, long long int>, _Vector<8, long long int>, _Constant int)">;
   def alignd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Constant int)">;
 }
@@ -1045,7 +1045,7 @@ let Features = "avx512vl", Attributes = [NoThrow, Const, RequiredVectorWidth<256
   def alignq256 : X86Builtin<"_Vector<4, long long int>(_Vector<4, long long int>, _Vector<4, long long int>, _Constant int)">;
 }
 
-let Features = "avx512f,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512f", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def extractf64x4_mask : X86Builtin<"_Vector<4, double>(_Vector<8, double>, _Constant int, _Vector<4, double>, unsigned char)">;
   def extractf32x4_mask : X86Builtin<"_Vector<4, float>(_Vector<16, float>, _Constant int, _Vector<4, float>, unsigned char)">;
 }
@@ -1058,7 +1058,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpbusd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpbusd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1070,7 +1070,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpbusds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpbusds512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1082,7 +1082,7 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpwssd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpwssd512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
@@ -1094,55 +1094,55 @@ let Features = "avx512vl,avx512vnni|avxvnni", Attributes = [NoThrow, Const, Requ
   def vpdpwssds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avx512vnni,evex512", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
+let Features = "avx512vnni", Attributes = [NoThrow, Const, RequiredVectorWidth<512>] in {
   def vpdpwssds512 : X86Builtin<"_Vector<16, int>(_Vector<16, int>, _Vector<16, int>, _Vector<16, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbssd128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbssd256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbssds128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbssds256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbsud128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<256>] in {
   def vpdpbsud256 : X86Builtin<"_Vector<8, int>(_Vector<8, int>, _Vector<8, int>, _Vector<8, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
+let Features = "avxvnniint8|avx10.2", Attributes = [NoThrow, Const, RequiredVectorWidth<128>] in {
   def vpdpbsuds128 : X86Builtin<"_Vector<4, int>(_Vector<4, int>, _Vector<4, int>, _Vector<4, int>)">;
 }
 
-let Features = "avxvnniint8|avx10.2-256", Attributes = [...
[truncated]

github-actions · 2025-09-05T07:32:49Z

⚠️ C/C++ code formatter, clang-format found issues in your code. ⚠️

You can test this locally with the following command:

git-clang-format --diff origin/main HEAD --extensions cpp,h,c -- clang/lib/Basic/Targets/X86.cpp clang/lib/Basic/Targets/X86.h clang/lib/CodeGen/Targets/X86.cpp clang/lib/Driver/ToolChains/Arch/X86.cpp clang/lib/Headers/amxavx512intrin.h clang/lib/Headers/avx10_2_512bf16intrin.h clang/lib/Headers/avx10_2_512convertintrin.h clang/lib/Headers/avx10_2_512niintrin.h clang/lib/Headers/avx10_2_512satcvtdsintrin.h clang/lib/Headers/avx10_2bf16intrin.h clang/lib/Headers/avx10_2convertintrin.h clang/lib/Headers/avx10_2copyintrin.h clang/lib/Headers/avx10_2niintrin.h clang/lib/Headers/avx10_2satcvtdsintrin.h clang/lib/Headers/avx2intrin.h clang/lib/Headers/avx512bf16intrin.h clang/lib/Headers/avx512bitalgintrin.h clang/lib/Headers/avx512bwintrin.h clang/lib/Headers/avx512cdintrin.h clang/lib/Headers/avx512dqintrin.h clang/lib/Headers/avx512fintrin.h clang/lib/Headers/avx512fp16intrin.h clang/lib/Headers/avx512ifmaintrin.h clang/lib/Headers/avx512ifmavlintrin.h clang/lib/Headers/avx512vbmi2intrin.h clang/lib/Headers/avx512vbmiintrin.h clang/lib/Headers/avx512vbmivlintrin.h clang/lib/Headers/avx512vlbf16intrin.h clang/lib/Headers/avx512vlbitalgintrin.h clang/lib/Headers/avx512vlbwintrin.h clang/lib/Headers/avx512vlcdintrin.h clang/lib/Headers/avx512vldqintrin.h clang/lib/Headers/avx512vlfp16intrin.h clang/lib/Headers/avx512vlintrin.h clang/lib/Headers/avx512vlvbmi2intrin.h clang/lib/Headers/avx512vlvnniintrin.h clang/lib/Headers/avx512vlvp2intersectintrin.h clang/lib/Headers/avx512vnniintrin.h clang/lib/Headers/avx512vp2intersectintrin.h clang/lib/Headers/avx512vpopcntdqintrin.h clang/lib/Headers/avx512vpopcntdqvlintrin.h clang/lib/Headers/avxintrin.h clang/lib/Headers/emmintrin.h clang/lib/Headers/fmaintrin.h clang/lib/Headers/gfniintrin.h clang/lib/Headers/mmintrin.h clang/lib/Headers/movrs_avx10_2_512intrin.h clang/lib/Headers/movrs_avx10_2intrin.h clang/lib/Headers/pmmintrin.h clang/lib/Headers/sm4evexintrin.h clang/lib/Headers/smmintrin.h clang/lib/Headers/tmmintrin.h clang/lib/Headers/vaesintrin.h clang/lib/Headers/xmmintrin.h clang/test/CodeGen/X86/amx_avx512_api.c clang/test/CodeGen/X86/amxavx512-builtins.c clang/test/CodeGen/X86/avx10_2_512bf16-builtins.c clang/test/CodeGen/X86/avx10_2_512convert-builtins.c clang/test/CodeGen/X86/avx10_2_512minmax-builtins.c clang/test/CodeGen/X86/avx10_2_512minmax-error.c clang/test/CodeGen/X86/avx10_2_512ni-builtins.c clang/test/CodeGen/X86/avx10_2_512satcvt-builtins.c clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-errors.c clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64-error.c clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins-x64.c clang/test/CodeGen/X86/avx10_2_512satcvtds-builtins.c clang/test/CodeGen/X86/avx10_2bf16-builtins.c clang/test/CodeGen/X86/avx10_2convert-builtins.c clang/test/CodeGen/X86/avx10_2minmax-builtins.c clang/test/CodeGen/X86/avx10_2ni-builtins.c clang/test/CodeGen/X86/avx10_2satcvt-builtins.c clang/test/CodeGen/X86/avx10_2satcvtds-builtins-x64.c clang/test/CodeGen/X86/avx10_2satcvtds-builtins.c clang/test/CodeGen/X86/avx512-error.c clang/test/CodeGen/X86/avx512copy-builtins.c clang/test/CodeGen/X86/avx512vlbw-builtins.c clang/test/CodeGen/X86/avxvnniint16-builtins.c clang/test/CodeGen/X86/avxvnniint8-builtins.c clang/test/CodeGen/X86/movrs-avx10.2-512-builtins-error-32.c clang/test/CodeGen/X86/movrs-avx10.2-512-builtins.c clang/test/CodeGen/X86/movrs-avx10.2-builtins-error-32.c clang/test/CodeGen/X86/movrs-avx10.2-builtins.c clang/test/CodeGen/X86/sm4-evex-builtins.c clang/test/CodeGen/attr-cpuspecific.c clang/test/CodeGen/attr-target-x86.c clang/test/CodeGen/regcall2.c clang/test/CodeGen/target-avx-abi-diag.c clang/test/CodeGen/target-builtin-noerror.c clang/test/Driver/x86-target-features.c clang/test/Preprocessor/predefined-arch-macros-x86.c clang/test/Preprocessor/predefined-arch-macros.c clang/test/Preprocessor/x86_target_features.c llvm/lib/IR/Verifier.cpp llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp llvm/lib/Target/X86/MCTargetDesc/X86MCTargetDesc.cpp llvm/lib/Target/X86/X86ISelLowering.cpp llvm/lib/Target/X86/X86ISelLoweringCall.cpp llvm/lib/Target/X86/X86Subtarget.cpp llvm/lib/Target/X86/X86Subtarget.h llvm/lib/Target/X86/X86TargetTransformInfo.cpp llvm/lib/TargetParser/Host.cpp llvm/lib/TargetParser/X86TargetParser.cpp

⚠️
The reproduction instructions above might return results for more than one PR
in a stack if you are using a stacked PR workflow. You can limit the results by
changing origin/main to the base branch/commit you want to compare against.
⚠️

View the diff from clang-format here.

diff --git a/llvm/lib/TargetParser/Host.cpp b/llvm/lib/TargetParser/Host.cpp
index 638481606..b83b80421 100644
--- a/llvm/lib/TargetParser/Host.cpp
+++ b/llvm/lib/TargetParser/Host.cpp
@@ -2061,7 +2061,7 @@ StringMap<bool> sys::getHostCPUFeatures() {
   Features["invpcid"]    = HasLeaf7 && ((EBX >> 10) & 1);
   Features["rtm"]        = HasLeaf7 && ((EBX >> 11) & 1);
   // AVX512 is only supported if the OS supports the context save for it.
-  Features["avx512f"]    = HasLeaf7 && ((EBX >> 16) & 1) && HasAVX512Save;
+  Features["avx512f"] = HasLeaf7 && ((EBX >> 16) & 1) && HasAVX512Save;
   Features["avx512dq"]   = HasLeaf7 && ((EBX >> 17) & 1) && HasAVX512Save;
   Features["rdseed"]     = HasLeaf7 && ((EBX >> 18) & 1);
   Features["adx"]        = HasLeaf7 && ((EBX >> 19) & 1);

RKSimon

Do we have any test coverage of the error messages if people use the old args and does it explain how they can fix it? Not sure how important it is given its still early days for avx10 but I imagine someone has already started using it......

phoebewang · 2025-09-05T10:00:51Z

Do we have any test coverage of the error messages if people use the old args and does it explain how they can fix it? Not sure how important it is given its still early days for avx10 but I imagine someone has already started using it......

Yes, we have reported warning in LLVM21 and have tests for it: https://github.com/llvm/llvm-project/pull/132542/files#diff-5f03b616b46b38c8f00da84c616bd5096df4ee66ecb9a7c20e980e4d0008f894R414

With this change, users will get an error like error: unknown argument: '-mavx10.1-512'

RKSimon

LGTM

jan-wassenberg · 2025-09-17T06:15:50Z

For the future, I wonder if we can avoid breaking changes like this and instead make the deprecated flags and targets no-ops. It is not just a matter of passing -mavx10.1-512 flags or not.

For dynamic dispatch, we have had to specify "evex512" attributes in code, and that has now transitioned from required to build-breaking. Because we are not only dealing with discrete clang releases, such behavior changes are quite painful.

It would help to add some way to detect from code whether the breaking change has occurred, for example a builtin #define.

phoebewang · 2025-09-17T06:57:15Z

For the future, I wonder if we can avoid breaking changes like this and instead make the deprecated flags and targets no-ops. It is not just a matter of passing -mavx10.1-512 flags or not.

For dynamic dispatch, we have had to specify "evex512" attributes in code, and that has now transitioned from required to build-breaking. Because we are not only dealing with discrete clang releases, such behavior changes are quite painful.

It would help to add some way to detect from code whether the breaking change has occurred, for example a builtin #define.

SW has to match with HW spec, so it's unavoidable sometimes when HW changed. We have tried to reduce the disturbance. So "evex512" turns to a no-op. "avx10.1-512" and "avx10.2-512" are aliases of "avx10.1" and "avx10.2". However, "no-evex512", "avx10.1-256" and "avx10.2-256" have no substitutable solutions because we don't have HW support.

jan-wassenberg · 2025-09-17T07:42:40Z

Thanks, appreciate the effort to reduce disturbance. Still, whenever there is a breaking change, it would be helpful to provide a way for SW to detect this, separately from the version number unless that actually atomically changes as part of the breaking change. Here, I believe the version both before and after is 22.0.0git.

Also expand list_targets to use foreach_target so that it prints the best available target attributes. PiperOrigin-RevId: 810355944

Also expand list_targets to use foreach_target so that it prints the best available target attributes. PiperOrigin-RevId: 810449933

aeubanks · 2025-09-24T20:52:51Z

hi, we're running into a crash with the following IR that I bisected to this patch

$ cat /tmp/a.ll
target datalayout = "e-m:e-p:32:32-p270:32:32-p271:32:32-p272:64:64-i128:128-f64:32:64-f80:32-n8:16:32-S128"
target triple = "i686-unknown-linux-android29"

declare void @g()

define <4 x i32> @f(<2 x double> %0) #0 {
  call void @g()
  %a = call <4 x i32> @llvm.x86.avx10.mask.vcvttpd2udqs.128(<2 x double> %0, <4 x i32> zeroinitializer, i8 -1)
  ret <4 x i32> %a
}

; Function Attrs: nocallback nofree nosync nounwind willreturn memory(none)
declare <4 x i32> @llvm.x86.avx10.mask.vcvttpd2udqs.128(<2 x double>, <4 x i32>, i8) #1

attributes #0 = { "target-features"="+avx10.2" }
attributes #1 = { nocallback nofree nosync nounwind willreturn memory(none) }
$ llc /tmp/a.ll -o /dev/null
llc: ../../llvm/lib/CodeGen/TargetInstrInfo.cpp:771: MachineInstr *llvm::TargetInstrInfo::foldMemoryOperand(MachineInstr &, ArrayRef<unsigned int>, int, LiveIntervals *, VirtRegMap *) const: Assertion `(!(Flags & MachineMemOperand::MOLoad) || NewMI->mayLoad()) && "Folded a use to a non-load!"' failed.

(well actually we get the following before this patch)

'+avx10.2' is not a recognized feature for this target (ignoring feature)
SplitVectorResult #0: t24: v4i32 = llvm.x86.avx10.mask.vcvttpd2udqs.128 TargetConstant:i32<13945>, t12, t22, Constant:i8<-1>

LLVM ERROR: Do not know how to split the result of this operator!

is this intended?

phoebewang · 2025-09-25T05:41:10Z

hi, we're running into a crash with the following IR that I bisected to this patch

$ cat /tmp/a.ll
target datalayout = "e-m:e-p:32:32-p270:32:32-p271:32:32-p272:64:64-i128:128-f64:32:64-f80:32-n8:16:32-S128"
target triple = "i686-unknown-linux-android29"

declare void @g()

define <4 x i32> @f(<2 x double> %0) #0 {
  call void @g()
  %a = call <4 x i32> @llvm.x86.avx10.mask.vcvttpd2udqs.128(<2 x double> %0, <4 x i32> zeroinitializer, i8 -1)
  ret <4 x i32> %a
}

; Function Attrs: nocallback nofree nosync nounwind willreturn memory(none)
declare <4 x i32> @llvm.x86.avx10.mask.vcvttpd2udqs.128(<2 x double>, <4 x i32>, i8) #1

attributes #0 = { "target-features"="+avx10.2" }
attributes #1 = { nocallback nofree nosync nounwind willreturn memory(none) }
$ llc /tmp/a.ll -o /dev/null
llc: ../../llvm/lib/CodeGen/TargetInstrInfo.cpp:771: MachineInstr *llvm::TargetInstrInfo::foldMemoryOperand(MachineInstr &, ArrayRef<unsigned int>, int, LiveIntervals *, VirtRegMap *) const: Assertion `(!(Flags & MachineMemOperand::MOLoad) || NewMI->mayLoad()) && "Folded a use to a non-load!"' failed.

(well actually we get the following before this patch)

'+avx10.2' is not a recognized feature for this target (ignoring feature)
SplitVectorResult #0: t24: v4i32 = llvm.x86.avx10.mask.vcvttpd2udqs.128 TargetConstant:i32<13945>, t12, t22, Constant:i8<-1>

LLVM ERROR: Do not know how to split the result of this operator!

is this intended?

I see the problem. It's expected to see the 2nd error before this patch. You can change "+avx10.2" to "+avx10.2-512" to see if the crash still happens.

alexfh · 2025-09-25T11:32:39Z

I see the problem. It's expected to see the 2nd error before this patch.

Yes, a proper error rather than an assertion failure would be a better failure mode: https://gcc.godbolt.org/z/Tbns9qYx7

phoebewang · 2025-09-25T12:29:41Z

I see the problem. It's expected to see the 2nd error before this patch.

Yes, a proper error rather than an assertion failure would be a better failure mode: https://gcc.godbolt.org/z/Tbns9qYx7

The error is due to missing required features. It's already checked in the FE. The problem here is to run new IR with old llc, which is not guaranteed.

https://chromium.googlesource.com/external/github.com/google/highway.git/+log/54fc0d7eb598..84379d1c73de $ git log 54fc0d7eb..84379d1c7 --date=short --no-merges --format='%ad %ae %s' 2025-09-25 janwas update README external link to its main branch 2025-09-25 49699333+dependabot[bot] Bump actions/cache from 4.2.4 to 4.3.0 2025-09-24 janwas Previously suppressed warning fixes: 2025-09-23 janwas Disable AVX10.2 as further fallout from llvm/llvm-project/pull/157034 2025-09-23 janwas Pool: reduce false sharing, minor cleanup 2025-09-22 janwas Update list of Highway users 2025-09-22 bgraur Adds HWY_HAVE_EVEX512 define to allow specifying from outside if EVEX512 is supported by the compiler or not. 2025-09-19 janwas also mention perf-load 2025-09-19 janwas Add mention of prior work on normalized processor time/blame shifting 2025-09-19 janwas Further pool simplification 2025-09-19 janwas split math_test due to RVV build timeout 2025-09-19 github Docs: update supported architectures and targets list 2025-09-19 github meson: remove unused LSX/LASX build options Created with: roll-dep src/third_party/highway/src Change-Id: I53226444e42621d95ba5d337e04d9f0448c1f7b8 Bug: 446982743 Reviewed-on: https://chromium-review.googlesource.com/c/chromium/src/+/6982987 Reviewed-by: Rick Byers <[email protected]> Commit-Queue: Arthur Eubanks <[email protected]> Cr-Commit-Position: refs/heads/main@{#1520796}

aeubanks · 2025-09-30T18:26:36Z

the following crashes with that assert at head

$ cat a.cpp
typedef long __m128i __attribute__((__vector_size__(16)));
__m128i _mm_undefined_si128();
__attribute__((
    __vector_size__(2 * sizeof(double)))) double _mm_cvtts_pd_epu32___A;
__m128i DemoteInRangeTo___trans_tmp_5;
__attribute__((target("avx512vpopcntdq,gfni,avx10.2"))) void DemoteInRangeTo() {
  DemoteInRangeTo___trans_tmp_5 = __builtin_ia32_vcvttpd2udqs128_mask(
      _mm_cvtts_pd_epu32___A, _mm_undefined_si128(), -1);
}
$ clang -cc1 -triple i686-unknown-linux-android29 -Os -emit-obj a.cpp
clang: ../../llvm/lib/CodeGen/TargetInstrInfo.cpp:773: MachineInstr *llvm::TargetInstrInfo::foldMemoryOperand(MachineInstr &, ArrayRef<unsigned int>, int, LiveIntervals *, VirtRegMap *) const: Assertion `(!(Flags & MachineMemOperand::MOLoad) || NewMI->mayLoad()) && "Folded a use to a non-load!"' failed.

and no frontend errors/warnings

phoebewang · 2025-10-01T14:45:49Z

the following crashes with that assert at head

$ cat a.cpp
typedef long __m128i __attribute__((__vector_size__(16)));
__m128i _mm_undefined_si128();
__attribute__((
    __vector_size__(2 * sizeof(double)))) double _mm_cvtts_pd_epu32___A;
__m128i DemoteInRangeTo___trans_tmp_5;
__attribute__((target("avx512vpopcntdq,gfni,avx10.2"))) void DemoteInRangeTo() {
  DemoteInRangeTo___trans_tmp_5 = __builtin_ia32_vcvttpd2udqs128_mask(
      _mm_cvtts_pd_epu32___A, _mm_undefined_si128(), -1);
}
$ clang -cc1 -triple i686-unknown-linux-android29 -Os -emit-obj a.cpp
clang: ../../llvm/lib/CodeGen/TargetInstrInfo.cpp:773: MachineInstr *llvm::TargetInstrInfo::foldMemoryOperand(MachineInstr &, ArrayRef<unsigned int>, int, LiveIntervals *, VirtRegMap *) const: Assertion `(!(Flags & MachineMemOperand::MOLoad) || NewMI->mayLoad()) && "Folded a use to a non-load!"' failed.

and no frontend errors/warnings

It crashes before this change: https://godbolt.org/z/dE8d9r5cW

phoebewang · 2025-10-01T15:09:14Z

I see the problem. It's a bug in the AVX10.2 implementation. I'll fix it then.

Fixes crashes reported in llvm#157034.

…162036) Fixes crashes reported in #157034.

…lvm#162036) Fixes crashes reported in llvm#157034.

phoebewang requested review from FreddyLeaf, KanRobert, RKSimon and e-kud September 5, 2025 07:26

Merge remote-tracking branch 'origin/main' into AVX10

026ddc6

RKSimon reviewed Sep 5, 2025

View reviewed changes

RKSimon approved these changes Sep 5, 2025

View reviewed changes

Fix flang failure

d956a79

llvmbot added flang:driver flang Flang issues not falling into any other category labels Sep 5, 2025

phoebewang enabled auto-merge (squash) September 5, 2025 13:30

phoebewang merged commit 94b164c into llvm:main Sep 5, 2025
11 of 12 checks passed

phoebewang deleted the AVX10 branch September 5, 2025 14:14

Andarwinux mentioned this pull request Sep 7, 2025

error: always_inline function '_mm_cvtts_ps_epi32' requires target feature 'avx10.2', but would be inlined into function 'ConvertInRangeTo' that is compiled without support for 'avx10.2' google/highway#2697

Closed

copybara-service bot mentioned this pull request Sep 23, 2025

Disable AVX10.2 as further fallout from llvm/llvm-project/pull/157034 google/highway#2726

Merged

phoebewang added a commit to phoebewang/llvm-project that referenced this pull request Oct 6, 2025

[X86][AVX512] Add missing mayLoad attribute to AVX512 instructions

520fe90

Fixes crashes reported in llvm#157034.

phoebewang mentioned this pull request Oct 6, 2025

[X86][AVX512] Add missing mayLoad attribute to AVX512 instructions #162036

Merged

phoebewang added a commit that referenced this pull request Oct 6, 2025

[X86][AVX512] Add missing mayLoad attribute to AVX512 instructions (#…

7185dd6

…162036) Fixes crashes reported in #157034.

aokblast pushed a commit to aokblast/llvm-project that referenced this pull request Oct 6, 2025

[X86][AVX512] Add missing mayLoad attribute to AVX512 instructions (l…

c149efa

…lvm#162036) Fixes crashes reported in llvm#157034.

[X86][AVX10] Remove EVEX512 and AVX10-256 implementations #157034

[X86][AVX10] Remove EVEX512 and AVX10-256 implementations #157034

Uh oh!

Conversation

phoebewang commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

phoebewang commented Sep 5, 2025

Uh oh!

RKSimon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jan-wassenberg commented Sep 17, 2025

Uh oh!

phoebewang commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jan-wassenberg commented Sep 17, 2025

Uh oh!

aeubanks commented Sep 24, 2025

Uh oh!

phoebewang commented Sep 25, 2025

Uh oh!

alexfh commented Sep 25, 2025

Uh oh!

phoebewang commented Sep 25, 2025

Uh oh!

aeubanks commented Sep 30, 2025

Uh oh!

phoebewang commented Oct 1, 2025

Uh oh!

phoebewang commented Oct 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

phoebewang commented Sep 5, 2025 •

edited

Loading

llvmbot commented Sep 5, 2025 •

edited

Loading

phoebewang commented Sep 17, 2025 •

edited

Loading