Update LLVM version to 189d4711915f by konstantinschwarz · Pull Request #561 · Xilinx/llvm-aie

konstantinschwarz · 2025-07-21T19:25:13Z

No description provided.

…neType (#91422) Extend `DIBasicType` and `DISubroutineType` with additional field `annotations`, e.g. as below: ``` !5 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed, annotations: !6) !6 = !{!7} !7 = !{!"btf:type_tag", !"tag1"} ``` The field would be used by BPF backend to generate DWARF attributes corresponding to `btf_type_tag` type attributes, e.g.: ``` 0x00000029: DW_TAG_base_type DW_AT_name ("int") DW_AT_encoding (DW_ATE_signed) DW_AT_byte_size (0x04) 0x0000002d: DW_TAG_LLVM_annotation DW_AT_name ("btf:type_tag") DW_AT_const_value ("tag1") ``` Such DWARF entries would be used to generate BTF definitions by tools like [pahole](https://github.com/acmel/dwarves). Note: similar fields with similar purposes are already present in DIDerivedType and DICompositeType. Currently "btf_type_tag" attributes are represented in debug information as 'annotations' fields in DIDerivedType with DW_TAG_pointer_type tag. The annotation on a pointer corresponds to pointee having the attributes in the final BTF. The discussion in [thread](https://lore.kernel.org/bpf/87r0w9jjoq.fsf@oracle.com/) came to conclusion, that such annotations should apply to the annotated type itself. Hence the necessity to extend `DIBasicType` & `DISubroutineType` types with 'annotations' field to represent cases like below: ``` int __attribute__((btf_type_tag("foo"))) bar; ``` This was previously tracked as differential revision: https://reviews.llvm.org/D143966

@nikic

The DIBuilder C API was changed to deal with DbgRecord functions: llvm/llvm-project#84915 llvm/llvm-project#85657 llvm/llvm-project#92417 (comment) As discussed by @nikic and @OCHyams: llvm/llvm-project#92417 (comment) > For the intrinsic-inserting functions, do you think we should: > > a) mark as deprecated but otherwise leave them alone for now. > b) mark as deprecated and change their behaviour so that they > do nothing and return nullptr. > c) outright delete them (it sounds like you're voting this one, > but just wanted to double check). This patch implements option (c).

@zygoloid

…odifiable" (#95833) Reverts llvm/llvm-project#94159 @zygoloid had some concerns about the implementation of this, which make sense to me, so I’d like to revert this for now so we can have more time to think about how to best approach this (if at all...)

This change keeps existing behavior, namely that if we hit a Z3 timeout we will accept the report as "satisfiable". This prepares for the commit "Harden safeguards for Z3 query times". https://discourse.llvm.org/t/analyzer-rfc-taming-z3-query-times/79520 Reviewers: NagyDonat, haoNoQ, Xazax-hun, mikhailramalho, Szelethus Reviewed By: NagyDonat Pull Request: llvm/llvm-project#95128

This patch is a functional change. https://discourse.llvm.org/t/analyzer-rfc-taming-z3-query-times/79520 As a result of this patch, individual Z3 queries in refutation will be bound by 300ms. Every report equivalence class will be processed in at most 1 second. The heuristic should have only really marginal observable impact - except for the cases when we had big report eqclasses with long-running (15s) Z3 queries, where previously CSA effectively halted. After this patch, CSA will tackle such extreme cases as well. Reviewers: NagyDonat, haoNoQ, Xazax-hun, Szelethus, mikhailramalho Reviewed By: NagyDonat Pull Request: llvm/llvm-project#95129

For better dominance check inside the function.

The Legacy PM was deprecated for the optimization pipeline in LLVM 14 [1] (the support was removed altogether in the following release). This patch removes the original Hello example that was introduced to illustrate the Legacy PM. The Hello example no longer works and hence is deleted. The corresponding documentation is also removed. Note, Hello example for the new PM is located in * llvm/lib/Transforms/Utils/HelloWorld.cpp and documented in * WritingAnLLVMNewPMPass.rst. [1] https://releases.llvm.org/14.0.0/docs/ReleaseNotes.html#changes-to-the-llvm-ir

… for SVE (#95236) This patch adds support for legal SVE fromal arguments in IRTranslator, and support for COPY with SVE. SVE arguments are allowed only if the hidden option `-aarch64-enable-gisel-sve` is enabled. Illegal types and predicates like `nxv8i1` are not supported yet.

…(#93529) Extends bubble-up pack through reshape pattern to handle pack propagation through expand shape ops. --------- Co-authored-by: Prashant Kumar <pk5561@gmail.com>

/llvm-project/llvm/tools/llvm-c-test/llvm-c-test.h:39:24: error: a function declaration without a prototype is deprecated in all versions of C [-Werror,-Wstrict-prototypes] int llvm_test_dibuilder(); ^ void 1 error generated.

These produce garbage libcalls, so the result is not useful but this at least shows we don't assert.

…x (#95591) The global/flat/buffer atomic fmin/fmax situation is a mess. These instructions have been renamed 3 times. We currently have separate pseudos defined for the same opcodes with the different names (e.g. GLOBAL_ATOMIC_MIN_F64 from gfx90a and GLOBAL_ATOMIC_FMIN_X2 from gfx10). Use the _FMIN versions as the canonical name for the f32 versions. Use the _MIN_F64 style as the canonical name for the f64 case. This is because gfx90a has the most sensible names, but does not have the f32 versions.t sho Wire through the pseudo to use for the instruction properties vs. the assembly name like in other cases. This will simplify handling of direct atomicrmw selection. This will simplify directly selecting these from atomicrmw.

…#95730) See the comment: llvm/llvm-project#92083 (comment) After the patch, llvm/llvm-project#92083, the lower 32 bits of DeclID can be the same commonly. This may produce many collisions. It will be best to change the default hash implementation for uint64_t. But sent this one as a quick workaround. Feel free to update this if you prefer other hash functions.

These were required a long time ago due to `static_assert` not actually being available in C++03. Now `static_assert` is simply mapped to `_Static_assert` in C++03, making the additional parens unnecessary.

Differential Revision: https://reviews.llvm.org/D121213

This changes the `is_swappable` implementation to use variable templates first and basing the class templates on that. This avoids instantiating them when the `_v` versions are used, which are generally less resource intensive.

This is a follow up to #92037, which moved the architecture information. Generate the AArch64TargetParser CPUInfo from tablegen Processor defs using a new tablegen emitter. Some basic error checking is added in the emitter to ensure that duplicate features are not added to the Processor defs. The generic CPU becomes an entry in tablegen. Some CPU features which were present in the CPUInfo but absent from the tablegen defs have been added to tablegen. FeatureCrypto is replaced with FeatureSHA2 and FeatureAES. This changes a few of the tests.

…VLI. NFC SlotIndexes is already marked as a transitive requirement of LiveIntervals, so we don't need to specify it again in RISCVInsertVSETVLI.

For x86_64 callable functions, ARM64EC requires an entry thunk generated by the compiler. The linker interprets .hybmp sections to associate function chunks with their entry points and writes an offset to thunks preceding function section contents. Additionally, ICF needs to be aware of entry thunks to not consider chunks to be equal when they have different entry thunks, and GC needs to mark entry thunks together with function chunks. I used a new SectionChunkEC class instead of storing entry thunks in SectionChunk, following the guideline to keep SectionChunk as compact as possible. This way, there is no memory usage increase on non-EC targets.

This patch removes the TPIDR2 lazy-save object and buffer if no lazy save is required. --------- Co-authored-by: Samuel Tebbs <samuel.tebbs@arm.com>

These are incorrectly matching with signed offsets.

@vzakhari

The runtime API for copy-in copy-out currently only has an entry only for the copy-out. This entry has a "skipInit" boolean that is never set to false by lowering and it does not deal with the deallocation of the temporary. The generated code was a mix of inline code and runtime calls This is not a big deal, but this is unneeded compiler and generated code complexity. With assumed-rank, it is also more cumbersome to establish a temporary descriptor. Instead, this patch: - Adds a CopyInAssignment API that deals with establishing the temporary descriptor and does the copy. - Removes unused arg to CopyOutAssign, and pushes destruction/deallocation responsibility inside it. Note that this runtime API are still not responsible for deciding the need of copying-in and out. This is kept as a separate runtime call to IsContiguous, which is easier to inline/replace by inline code with the hope of removing the copy-in/out calls after user function inlining. @vzakhari has already shown that always inlining all the copy part increase Fortran compilation time due to loop optimization attempts for loops that are known to have little optimization profitability (the variable being copied from and to is not contiguous).

Pulled from #82290

Fixes some atomicrmw fadd and intrinsic cases

Because of the --debug-only flag.

Share the implementation with the current interpreter.

…055) This fixes llvm/llvm-project#93309, and seems to match how GNU ld handles this case.

There are only three actual uses of the section kind in MCSection: isText(), XCOFF, and WebAssembly. Store isText() in the MCSection, and store other info in the actual section variants where required. ELF and COFF flags also encode all relevant information, so for these two section variants, remove the SectionKind parameter entirely. This allows to remove the string switch (which is unnecessary and inaccurate) from createELFSectionImpl. This was introduced in [D133456](https://reviews.llvm.org/D133456), but apparently, it was never hit for non-writable sections anyway and the resulting kind was never used.

Reverts the behavior introduced by 770393b while keeping the refactored code. Fixes a miscompile on AArch64, at the cost of a small regression on AMDGPU. #96146 opened to investigate the issue

…r TableGen.

eddyz87 and others added 30 commits June 18, 2024 10:23

[Loads] Pass DominatorTree if available (#95752)

a6eddf9

For better dominance check inside the function.

[mlir][linalg] Add pattern to bubble-up pack through expand shape op …

a945f55

…(#93529) Extends bubble-up pack through reshape pattern to handle pack propagation through expand shape ops. --------- Co-authored-by: Prashant Kumar <pk5561@gmail.com>

[tools] Fix -Wstrict-prototypes in llvm-c-test.h (NFC)

9cd3622

/llvm-project/llvm/tools/llvm-c-test/llvm-c-test.h:39:24: error: a function declaration without a prototype is deprecated in all versions of C [-Werror,-Wstrict-prototypes] int llvm_test_dibuilder(); ^ void 1 error generated.

AMDGPU: Add some tests for i128 and fp128 atomic expansion

cf5ce8c

These produce garbage libcalls, so the result is not useful but this at least shows we don't assert.

[libc++][NFC] Remove unnecessary parens in static_asserts (#95605)

6b4b29f

These were required a long time ago due to `static_assert` not actually being available in C++03. Now `static_assert` is simply mapped to `_Static_assert` in C++03, making the additional parens unnecessary.

[libc++] Enable modernize-use-equals-delete (#93293)

bbe4a80

Differential Revision: https://reviews.llvm.org/D121213

[libc++] Refactor<__type_traits/is_swappable.h> (#86822)

cb41740

This changes the `is_swappable` implementation to use variable templates first and basing the class templates on that. This avoids instantiating them when the `_v` versions are used, which are generally less resource intensive.

[RISCV] Remove redundant dependency on SlotIndexes in RISCVInsertVSET…

aabf6df

…VLI. NFC SlotIndexes is already marked as a transitive requirement of LiveIntervals, so we don't need to specify it again in RISCVInsertVSETVLI.

RegPressure: Fix incorrect comment. NFC (#95896)

7ea203b

[AArch64][SME] Remove unused ZA lazy-save (#81648)

ca7dc89

This patch removes the TPIDR2 lazy-save object and buffer if no lazy save is required. --------- Co-authored-by: Samuel Tebbs <samuel.tebbs@arm.com>

AMDGPU: Add some addressing mode tests for flat atomic intrinsics

1d97f8f

These are incorrectly matching with signed offsets.

[X86] Add AVX1 v8i32 comparison test coverage

612fdf3

Pulled from #82290

[clang][NFC] Take const pointers in Sema::DiagnoseAutoDeductionFailure

20d3cab

AMDGPU: Flat instructions do not have signed offsets gfx7-gfx11 (#95852)

eda9ff8

Fixes some atomicrmw fadd and intrinsic cases

[gn] port 8b0d38b

66aa260

[bolt] stale-matching-min-matched-block.test requires asserts

69753aa

Because of the --debug-only flag.

[clang][Interp] Implement complex division (#94892)

4d7d45e

Share the implementation with the current interpreter.

[gn build] Port 89c26f6

9e63632

mstorsjo and others added 8 commits June 20, 2024 11:20

[LLD] [MinGW] Interpret an empty -entry option as no entry point (#96…

acf675b

…055) This fixes llvm/llvm-project#93309, and seems to match how GNU ld handles this case.

[MachineLICM] Work-around Incomplete RegUnits (#95926)

f0897ea

Reverts the behavior introduced by 770393b while keeping the refactored code. Fixes a miscompile on AArch64, at the cost of a small regression on AMDGPU. #96146 opened to investigate the issue

Merge commit '9d0754ada5db' into aie-public

4dcb419

fixup! [AIE] Add architecture-specific files

fc4b011

Merge commit '1737814e577f' into aie-public

689ed99

Merge commit '189d4711915f' into aie-public

828ac00

fixup! [TableGen] Support modelInaccessibleMemThroughRegs property fo…

e8e81fe

…r TableGen.

konstantinschwarz requested review from F-Stuckmann, SagarMaheshwari99, abhinay-anubola, abnikant, andcarminati, katerynamuts, khallouh, martien-de-jong, niwinanto and stephenneuendorffer as code owners July 21, 2025 19:25

Alexandre Isoard added 2 commits July 21, 2025 16:49

Merge commit 'f0897ea4bb2c' into aie-public

21ff17c

fixup! [CodeGen] Streamline use of PipelinerLoopInfo

d9dfa23

isoard-amd requested a review from philippjh as a code owner July 21, 2025 23:21

Alexandre Isoard added 2 commits July 21, 2025 17:34

fixup! Add two AsmBackend virtuals to get the right behaviour

10e5b08

fixup! [AIE] Add architecture-specific files

eba40e6

isoard-amd force-pushed the kschwarz.llvm.bump branch from 88e1b12 to e63eb3a Compare July 21, 2025 23:34

Alexandre Isoard added 3 commits July 21, 2025 17:43

fixup! [AIEX] Support simple TCO

5be1a60

fixup! [AIEX] MBB Placement tweaks

a75baf8

fixup! [AIE] Add architecture-specific files

970f692

isoard-amd force-pushed the kschwarz.llvm.bump branch from e63eb3a to 970f692 Compare July 21, 2025 23:46

konstantinschwarz merged commit 1dd2103 into aie-public Jul 22, 2025
16 of 17 checks passed

konstantinschwarz deleted the kschwarz.llvm.bump branch July 22, 2025 02:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update LLVM version to 189d4711915f#561

Update LLVM version to 189d4711915f#561
konstantinschwarz merged 1119 commits intoaie-publicfrom
kschwarz.llvm.bump

konstantinschwarz commented Jul 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

Conversation

konstantinschwarz commented Jul 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants