Skip to content

Conversation

@dhernandez0
Copy link
Contributor

@dhernandez0 dhernandez0 commented Nov 10, 2025

Motivation

Upstream merge pointing to this commit: ROCm/llvm-project@73f26e2

Technical Details

Changes in upstream merge:

External LIT Tests

  • check-llvm
  • check-mlir

Jenkins Internal CI

  • Weekly (parameterSweeps + Tuning)
    • Link:
  • Nightly CI
    • Link:
  • PR CI

MIGraphX CI

Performance

  • Compare tuning runtime on CI with one of the weekly runs to see if tuning time has regressed or not.

Navi2X

  • parameterSweeps (manual running)
  • Nightly E2E tests (w/ fixed data)
  • Nightly E2E tests (w/ random data)
  • PR E2E tests

List of files that are different

Diff files for review

Open these files in notepad++ or any DiffViewer with "diff" language
diff -rup upstream/llvm-project/llvm rocMLIR/external/llvm-project/llvm &> llvm_diffs.txt
llvm_diffs.txt

diff -rup upstream/llvm-project/mlir rocMLIR/external/llvm-project/mlir &> mlir_diffs.txt
mlir_diffs.txt

Submission Checklist

…ae60

73f26e2fae60 merge main into amd-staging (#544)
c44e5784fa53 merge main into amd-staging (#545)
0fadc2184ded merge main into amd-staging
0d786b9a207a [clang-tidy][NFC] Enable `performance-unnecessary-value-param` in the codebase (#163686)
4d88bb6c6303 [Hexagon] Implement isMaskAndCmp0FoldingBeneficial (#166891)
cf1f871023e4 [Flang] Add parser support for prefetch directive (#139702)
7f18465adc24 merge main into amd-staging
b67796f3fca1 [llvm][RISCV] Support Zvfbfa codegen for fneg, fabs and copysign (#166944)
f734cebc396b [LoongArch] Initial implementation for `enableMemCmpExpansion` hook (#166526)
0c4c05e09a69 merge main into amd-staging (#543)
b3d62645158c Revert "[Github] Update PR labeller to v6.0.1 (#167246)"
6c02bcb24465 [Github] Make Windows container use zstd (#167022)
10da6ab53621 [Github] Update PR labeller to v6.0.1 (#167246)
fa0f2d2caf9b [mlir][tosa] Fix crash in `tosa.concat` verifier (#165966)
5ba0c7c12674 [LoongArch][NFC] Pre-commit tests for memcmp expansion (#166718)
836919bb3449 [clang-format] Fix a crash in AlignArrayOfStructures (#167099)
f1e4266bbcd6 merge main into amd-staging
bf5042591ab3 merge main into amd-staging (#542)
d406c15fc8fc [VPlan] Use VPInstructionWithType for casts in VPlan0. (NFC)
700b77b5e5ca [InstCombine] Don't sink if it would require dropping deref assumptions. (#166945)
63fe60f830ec [gn build] Port f2c50f930540
f2c50f930540 [BOLT] Support restartable sequences in tcmalloc (#167195)
4023beb09e48 Fix typo in LangRef.md regarding regions (#167242)
c10b25a450fd [lldb] Disable TestLocationsAfterRebuild for remote targets (#167239)
24525f1ed645 merge main into amd-staging
c8f168c2c933 [SandboxIR] Remove tight-coupling with LLVM's SwitchInst::CaseHandle (#167093)
36e9a0b3a1b5 [AArch64][GlobalISel] Correct instructions for 64bit fneg constant vectors. (#166537)
d4b41b9de091 [mlir] Consolidate two implementations of meet (NFC) (#167208)
6de4f0681321 [clang] Remove redundant typename (NFC) (#167207)
d93982347bd6 [Target] Fix misleading indentation (NFC) (#167206)
3d823700aa4c [mlir] Use llvm::transform (NFC) (#167205)
6ef32188b5a1 [SPIRV] Add support for `bfloat16` atomics via the `SPV_INTEL_16bit_atomics` extension (#166257)
c3b31ba19c2d [clang-tidy] Fix `readability-container-data-pointer` check (#165636)
2095ea5b407f Remove unused <set> and <map> inclusion (#167175)
04b05998b167 Remove unused <array> and <list> inclusion (#167116)
d8f3202b34af merge main into amd-staging (#540)
0367711f3f3a [gn build] Port 8d950d27d686
eef52257626f [gn build] Port 00eacc29f009
d858aad5d1e0 [NFCI][lldb][test][Recognizer] Fix mismatched C/C++ frontend subtitutions (#167220)
6616f0750e61 [gn] port c940bfd7e621 (BPF SDNodeInfo)
cc3a505b65d8 [SelectionDAG] Fix assertion failure on inline asm register type mismatch (#166615)
b15e220985fc [tools][llc] Fix save-stats.ll require aarch64 target (#167218)
96a52893ca40 [tools][llc] Add `--save-stats` option (#163967)
32f9cce5dc7e merge main into amd-staging
93d445cba39f [PostRASink] Add target hook shouldPostRASink (#167182)
8cc49fb99a47 Revert "Reapply "[compiler-rt] Default to Lit's Internal Shell""
1b3eaacb9da5 [llvm] Remove unused local variables (NFC) (#167185)
e736c5c03e6a merge main into amd-staging (#539)
5361e10276d9 [LifetimeSafety] Use StringMap::contains (NFC) (#167186)
3ce5df408baf [Vectorize] Remove a redundant declaration (NFC) (#167188)
64ad5d976dbd [Offload] Remove unused KernelArgsTy instantiation (#167197)
da0afffe3766 merge main into amd-staging
4b6a59723002 Reapply "[compiler-rt] Default to Lit's Internal Shell"
3a8894d7aff1 [libc++][numeric] Marked saturation artithmetic functions as `[[nodiscard]]` (#166898)
32405811bb1a [gn] port ebb61a5bea (llvm-cas)
8628ff3ec0bf [ObjectYAML] Remove extraneous .c_str() (NFC) (#167189)
188f109490bd [mlir] Remove duplicate includes (NFC) (#167187)
3bb903e3c0af [libc++] Treat P0513R0 as a defect report against C++11 (#166690)
91821ba86607 [MLIR] Fix FileCheck annotations and add 32-bit coverage (#167149)
176dd5bde109 merge main into amd-staging (#538)
0246f331d443 [BinaryFromat] Simplify subscript operators (NFC) (#167170)
7b1a74cd7910 [BOLT] Use DenseMap::contains (NFC) (#167169)
0fad7d7494df [llvm] Use static_assert on getEmptyKey and getTombstoneKey (NFC) (#167167)
e61a51d0d620 [llvm] Use llvm::find_if and llvm::is_contained (NFC) (#167166)
3b219cf42ac4 [LV] Add register pressure test for #164124.
0902a6b8ded4 Add missing #include (fix for  #166997)
15e3b49cb822 [llvm] Allow Rust personality name to contain arbitrary prefix (#166095)
02976f5ffa20 [TableGen] Use "using" instead of "typedef" (NFC) (#167168)
26bb121dfe0b Remove unused <vector> inclusion (#166997)
3673cc7a4222 [llvm-rc] Don't interpret integer literals as octal numbers in rc.exe mode (#166915)
afc83688cfad [libc++abi][libunwind] Enable AT&T syntax explicitly (#166818)
98a4dbdc3b70 merge main into amd-staging
c500c6ad94dc merge main into amd-staging (#537)
825943e06f84 [libc++] Run macOS buildbot under xcrun (#167072)
070f3310ccfd [Driver] Remove extraneous c_str() (NFC) (#167156)
6b42c913fa80 [WebAssembly] Clean up CustomMappingTraits<BBNumberMap> (NFC) (#167155)
1e1874735326 [ObjectYAML] Remove redundant .str().c_str() (NFC) (#167154)
ae1622a48a03 [NVPTX] Remove redundant declarations (NFC) (#167153)
c06f864558c6 [ExecutionEngine] Use StringRef::starts_with (NFC) (#167152)
18b8e3bfb108 [ADT] Add a missing std::move to StringSwitch::EndsWithLower (#167151)
cbfc053a8b83 merge main into amd-staging
e1da51831d15 [Offload] Remove handling for device memory pool (#163629) (#533)
ace77c25a492 [clang-tidy][NFC] Fix misc-const-correctness warnings (9/N) (#167124)
5896a25ffe53 [clang-tidy][NFC] Fix misc-const-correctness warnings (13/N) (#167130)
385dbc12c459 [clang-tidy][NFC] Fix misc-const-correctness warnings (12/N) (#167129)
545c3022d281 [clang-tidy][NFC] Fix misc-const-correctness warnings (8/N) (#167123)
01bea27b7327 [clang-tidy][NFC] Fix misc-const-correctness warnings (10/N) (#167127)
ce7f9f9ccde9 [llvm] Proofread *.rst (#167108)
2844d867340a [mlir] Remove unused local variables (NFC) (#167107)
0028ef667a87 [llvm] Remove unused local variables (NFC) (#167106)
ee0652b4da98 [flang] Remove unused local variables (NFC) (#167105)
d838ca21cade [clang-doc] Remove an unused local variable (NFC) (#167104)
6313830d0714 Fix missing include from #166664
c6ffc93130cb [clang-tidy][NFC] Fix misc-const-correctness warnings (14/N) (#167131)
6deb50dc7c7f [clang-tidy][NFC] Fix misc-const-correctness warnings (11/N) (#167128)
fa98c8dfde45 Fix bazel build for #166719
8468f7931281 merge main into amd-staging (#536)
b9ea93cd5c37 [InstCombine] Fold operation into select, when one operand is zext of select's condition (#166816)
21c1b7845b05 fix: C++ empty record with align lead to va_list out of sync (#72197)
a0e222f7c7bc [SimplifyCFG] Simplify uncond br with icmp & select (#165580)
ffb5831fceac [libc] add various macros relate to *ADDR* (#164830)
51856987cde3 merge main into amd-staging
3226a5f982a3 [Headers][X86] avx512cd - move constexpr to the end of the function attribute lists. NFC. (#166968)
8f2b167de4a1 [clang-tidy][NFC] Remove trailing whitespaces in documentation (#167103)
c8ab3b7e5093 [clang-tidy][NFC] Run clang-format-22 over clang-tidy (#167122)
2af0cfb879e3 merge main into amd-staging (#534)
577b5194bf7a [clang-tidy] Fix `bugprone-exception-escape` not diagnosing throws in argument lists (#165955)
8d950d27d686 [clang-tidy][NFC] Add missing "Check" suffix to filenames in clang-tidy checks (#166889)
af456dfa111b [BOLT] Refactor tracking internals of BinaryFunction. NFCI (#167074)
77342761a906 benchmarks: Fix sample_symbol_list.txt generation again (#167078)
0875755f5275 [CI] Drop  use of install-ninja action (#167068)
6c12623d621e [clang][bytecode] Fix a std::optional<bool> mishap (#167091)
b4b57adb89b7 [AMDGPU][MachineVerifier] test failures in SIFoldOperands (#166600)
de4aa9cdeab0 AMDGPU: Minor SDWA pass cleanups (#166629)
2a3ef056fdd9 Fix issues and re-enable tests disabled in PR #165983 (from PR #165360) (#166147)
6fac21e4049c [clang][bytecode] Avoid copies with elidable CXXConstructExprs (#166931)
260479f69c09 merge main into amd-staging
62e52d44937f merge main into amd-staging (#532)
82b921628b97 Fix UB introduced by be0aa7b6c72bdb162f1f3fe251e469927118963e.
7af2b56dd539 [BOLT] Refactor undefined symbols handling. NFCI (#167075)
ad7488a94155 [X86][clang-cl] Add CL option /vlen (#166375)
05a25fcacbae merge main into amd-staging (#530)
77a194cb09e2 [NFC][CodeGen] Replace loop with "if !empty()" (#166515)
37e7748443f4 merge main into amd-staging
baf41d214349 [GitHub][CI] Fix tooling container names (#167052)
a4b68b8ed516 [compiler-rt][HWAsan] Partially revert to fix test
a9301d872294 [clang-tidy][NFC] Fix misc-const-correctness warnings (7/N) (#167058)
893387ab30e4 [Flang][OpenMP] Unify MapInfoFinalization's BoxChar handling with other Box types (#165954)
6c84dd7b1c03 [lld][WebAssembly] Default to --stack-first (#166998)
0ecbd4e26b92 [clang-tidy][NFC] Temporary disable `llvm-header-filter` check (#167021)
d5b62fabd17e [lldb][ClangModulesDeclVendor] Print Clang module loading errors to expression log instead of console (#166964)
8552a8fbffd0 [DependencyScanning] Fix warnings
6aaa9cbcedcd [clang-tidy][NFC] Fix misc-const-correctness warnings (6/N) (#167056)
e3a6792f767b [clang-tidy][NFC] Use llvm::DenseMap (#167050)
2cb72a2fa6ec merge main into amd-staging
accec8b92d34 [lld][macho] Move unwind logic from equalsVariable to equalsConstant (#165325)
9decb102d9a1 [prepare-code-coverage-artifact.py] Change run line to use python3
70b86ab2177f [CAS] Fix msan error in ObjectStore::importObject() (#167051)
cbce49725a16 Manual update of LLVM_MAIN_REVISION to 558610 (#528)
4ecb3c78359c [CodeGen][NFC] Remove one meaningless `equal_to` specialization (#166115)
1b861cb9ccfa [clang-tidy][NFC] Fix misc-const-correctness warnings (5/N) (#167047)
0c18999e7d25 [CAS] Restrict the CAS size when running lit testing (#167024)
c3740802b6e3 [clang-tidy][NFC] Fix misc-const-correctness warnings (4/N) (#167042)
74e34eff3feb [lldb][ClangModulesDeclVendor] Don't stop loading Clang modules if an individual import failed (#166940)
bbc4a450c6f3 [clang-tidy][NFC] Fix misc-const-correctness warnings (3/N) (#167040)
bc2784376141 Reapply "[compiler-rt][HWAsan] Remove CHECK lines from test" (#167038)
913849ecc036 [clang-tidy][NFC] Fix misc-const-correctness warnings (2/N) (#167035)
11efce4605e6 [clang-tidy][NFC] Fix misc-const-correctness warnings (1/N) (#167030)
7f55f264ec04 [lldb][ClangModulesDeclVendor] Revamp error handling when loading Clang modules (#166917)
397701f3a0a0 [SystemZ] Mark RELOC_NONE as not having size for SystemZ (#167027)
6d516c6e28a1 [llvm][RISCV] Do not assume V extension on seeing vector type. (#166994)
53b64b0208b9 [DebugInfo] Adjusting DWARF version to fix a test failure in aix-ppc64 pipeline (#167019)
afb84ca157e1 [clang-doc] revert to a nested directory structure (#166978)
01caad049cff [fix test] Move typedefs-with-same-name.test into ARM folder (#167005)
c0e4bced616c Revert "[CI] Make premerge_advisor_explain write comments"
64ff52a5e954 [OpenMP][Clang] Add parser/semantic support for dyn_groupprivate clause (#152651)
3511388013c7 [lldb][docs] Use section_iter() to iterate over sections (#167012)
1b2eee55a7fd [CIR] Fix failing lit test (statement-exprs.c) (#167000)
a7382f12c197 [clang-doc] remove indentation for preformatted text (#166672)
f00d353c5d35 [flang][cuda][NFC] Simplify thread fence lowering (#167009)
682c8e22e61f [Hexagon] Add missing build attributes (#166992)
8a6d5f68e48f [VPlan] Update more VPRecipeBuilder members to take VPInst directly (NFC)
be0aa7b6c72b [clang][DependencyScanning] Implementation of `CompilerInstanceWithContext` to Improve By-Name Queries (#164345)
1f8d5d46df48 [CI] Make premerge_advisor_explain write comments
ec21e58a30b2 Revert "[CIR] Recognize constant aggregate initialization of auto vars (#166850)"
8e8e46aeb608 [compiler-rt][sanitizers] Mark three tests as unsupported on Android
9a9e3e3b9475 [clang-tidy][NFC] Update header guards to match LLVM style (#166669)
00eacc29f009 [clang-tidy] Rename and move 'cert-oop58-cpp' to 'bugprone-copy-constructor-mutates-argument' (#164566)
17ad8480f8e6 [VPlan] Convert redundant isSingleScalar check into assert (NFC).
973b173503e8 [AMDGPU] Stop optimising readfirstlane in pass AMDGPUUniformIntrinsicCombine (#166955)
873b8d502a8a [flang][cuda][NFC] Use NVVM operation for thread syncs (#166999)
c21cd52fab90 Fix a crash when a stop hook deletes itself in its callback. (#160416)
ec620bf61588 [clang][CIR] Port test from d9c7c76
5fc1b74af520 [CIR] Recognize constant aggregate initialization of auto vars (#166850)
7568a99c9f68 [dsymutil] Fix parallel linker's self-recursive typedef DIE by including referred-to types into synthetic name (#166767)
92e240483e33 [MLIR][XeVM] Update XeVM prefetch ops. (#166445)
8506d8afed77 [Comgr] Add AMD_COMGR_ACTION_COMPILE_SOURCE_TO_SPIRV action (#521)
90735affd3e6 merge main into amd-staging (#523)
2dd77050d4c3 [libc] add cpu feature flags for SVE/SVE2/MOPS (#166884)
c6969e578a31 [Github][Bazel] Add Workflow to Run Bazel Build (#165071)
b4d7d3f745aa [mlir][NVVM] Add nvvm.membar operation (#166698)
67198d1997cf [libc]  Fix wrapper headers for `at_quick_exit` on GLIBC for C++11 (#166960)
50ba89a22e45 [VectorCombine] support mismatching extract/insert indices for foldInsExtFNeg (#126408)
b78f6fca38f3 [AMDGPU][NFC] Pre-commit shlN_add test results with sdag (#166636)
ebb61a5beaa2 [CAS] Add llvm-cas tools to inspect on-disk LLVMCAS (#166481)
917d815d4ee4 AMDGPU: Preliminary documentation for named barriers (#165502)
4637bf0c76bd [NFC][AMDGPU][GISel] Precommit GlobalISel specific tests for call instruction (#165898)
9e341b36ed69 [DA] Properly pass outermost loop to monotonicity checker (#166928)
cce1055e4803 [lldb] Correctly detach (rather than kill) when connecting with gdb-remote (#166869)
37403685298b [mlir][arith] Fix `arith.select` lowering after #166513 (#166692)
9cca883dd031 Revert "[NFCI][lldb][test] Avoid unnecessary GNU extension for assembly call" (#166970)
03d8184d65f1 [flang][NFC] Strip trailing whitespace from tests (1 of N)
626cbf70f1eb [X86] isGuaranteedNotToBeUndefOrPoison - add simple target shuffles with known test coverage (#161553)
a3b5b4bd79cf [clang] Proofread *.rst (#166897)
d606bb1667ac [OpenMP] Print loaded envar when debug flag is enabled (#520)
f55b393ea038 [clang][CIR] Fix build. NFC
9857791c4421 [bazel] Add mlir/utils/generate-test-checks.py to bazel overlay (#160693)
f090dd15a10e [libc++][test] Fix-up tests for `is_clock(_v)` (#166888)
1baf7dbed208 [flang][runtime] Allow some list-directed child output to advance (#166847)
3d0ae1e78a7f [flang] Improve warning text (#166407)
b3b4ea18ac94 [flang] Explicit interface externals are constant expressions (#166181)
093f947202a3 [CAS] Fix wrong usage of `llvm::sort()` in UnifiedOnDiskCache (#166963)
a7c0e78fa1e5 [Flang][OpenMP] Unify MapInfoFinalization's BoxChar handling with other Box types (#165954)
80a533283923 [mlir] Remove redundant declarations (NFC) (#166896)
de4d95324656 [Demangle] Remove redundant declarations (NFC) (#166895)
563ea299329a [clang-tools-extra] Remove redundant declarations (NFC) (#166894)
bddab8359e9a [BOLT] Remove redundant declarations (NFC) (#166893)
70f4b596cf45 Add `llvm.vector.partial.reduce.fadd` intrinsic (#159776)
411ea8e9dd69 [PowerPC] Lowering support for EVL type VP_LOAD/VP_STORE (#165910)
67d0f181f469 [AMDGPU] Delete redundant s_or_b32 (#165261)
7377ac037dc5 [AArch64][llvm] Add support for Neon vmmlaq_{f16,f32}_mf8_fpm intrinsics (#165431)
435da1c6aee6 merge main into amd-staging
bcb1b773f641 [clang-format] Add option to separate comment alignment between ... (#165033)
21aa788ae0b1 [AArch64][CostModel] Replace undef with poison in sve-arith-fp.ll (NFC) (#166930)
9a8781b86f06 [AArch64][llvm] Add support for new vcvt* intrinsics (#163572)
c29e12733733 merge main into amd-staging (#522)
ac047f2bd204 [InstCombnine] Add test for sinking with dereferneceable assumes.
050339b94a2e [Clang] Fix comment typo in BuiltinTargetFeatures.h
d05e6628e707 [AMDGPU] Handle empty-except-for-DI regions in PreRARematerialize (#516)
037fd3056294 Revert "[NVGPU] Fix nvdsl examples" (#166943)
d9c7c762695e Revert "Ignore trailing NullStmts in StmtExprs for GCC compatibility." (#166036)
d78e0ded5215 [MLIR][Transform][Python] Sync derived classes and their wrappers (#166871)
3ee2f07e17f0 [VPlan] Support multiple F(Max|Min)Num reductions. (#161735)
281e3844f649 [mlir] Use LDBG to replace LLVM_DEBUG in IntegerRelation.cpp (NFC) (#166772)
311d115ed809 [LLDB] Run MSVC STL string(-view) tests with PDB (#166833)
7ac6a95a11c2 [MLIR][Pygments] Refine the pygments MLIR lexer (#166406)
cdc3cb20543b [LoongArch] Add `isSafeToMove` hook to prevent unsafe instruction motion (#163725)
3719c438dc4a [X86] Add some initial add i64 test coverage for #142308 (#166929)
3c81587f6a55 [OpenMP] Add definitions for DECLARE_INDUCTION and related clauses (#166235)
831cb18d3bef merge main into amd-staging
c2fe1d94eedb [X86][Clang] VectorExprEvaluator::VisitCallExpr / InterpretBuiltin - add AVX512 KTEST/KORTEST intrinsics to be used in constexpr (#166103)
d07a4fe12a85 [bazel][mlir] Fix transform_xegpu_ext.py test for bazel (#166924)
bcb29ee44421 add to evar table OMPX_XteamBlockSize and OMPX_XTeamReductionOccupanc… (#505)
299df7ed256f [NVGPU] Fix nvdsl examples (#156830)
0ade2604f2e1 [DAG] visitBITCAST - fold (bitcast (freeze (load x))) -> (freeze (load (bitcast*)x)) (#164618)
9b114c5d9aa5 [libc++] Optimize fstream::read (#165223)
33e21510c1f3 Fix bazel build issue I introduced in #166918 (#166921)
69dbf376b100 Fix bazel build issue from #165615 (#166918)
a96ad136f6f6 [clang][bytecode] Dummy variables can have pointers to them (#166908)
9a8203df3f7a [mlir][spirv] Add validation for loop and struct tests (#166803)
7219b1e8a9f7 [mlir][spirv] Enable validation of selection and phi tests (#166794)
3a8f6979cef2 [libc][math] Enable math acos for baremetal Arm and AArch64 (#166749)
3c62eadffd69 [flang][OpenACC] lower acc loops with early exits (#164992)
3149c7c5184c Reapply "[lldb-dap] Use protocol types for exceptioninfo" (#166161) (#166836)
06b35296eb4f [flang] remove bogus header include from #164988 (#166906)
3e90ecaa2fe2 [ValueTracking] Refine known bits for linear interpolation patterns (#166378)
a04ceb02ad31 [libclang/python] Return actual C types from libclang functions instead of Python bool (#166446)
4508f44af752 [libunwind] Add CMake option to enable execute-only code generation on AArch64 (#140554)
9d18e92ee78c [compiler-rt] Add CMake option to enable execute-only code generation on AArch64 (#140555)
927092b262e5 [Runtimes][CMake] Add CMake option to enable execute-only code generation on AArch64 (#143698)
bac427a0f634 [VPlan] Remove no-longer-needed EVL VPlan debug output tests. NFC (#166158)
a257a063c6fd [mlir][linalg-transform] dyn_cast DestinationStyleOpInterface and early return (#166299)
1a34007f5f86 [VPlan] Inline WidenSelect::isInvariantCond (NFC) (#166742)
54c9ddddd1da [libcxxabi][ItaniumDemangle] Separate GtIsGt counter into more states (#166578)
fc5e0c071bfb [libc++] Simplify most of the segmented iterator optimizations (#164797)
c8adbd7a8b46 [orc-rt] Add endian_read/write operations. (#166892)
3aa7a2426357 [NFCI][lldb][test] Avoid unnecessary GNU extension for assembly call (#166769)
6145b9d1925a [RISCV] Support outlining of CFI instructions in the machine outliner (#166149)
856ef9605b23 [RISCV] Optimize (and (icmp x, 0, neq), (icmp y, 0, neq)) utilizing zicond extension
a7bf45a914f6 [bazel] Add missing deps for AlignmentAttrInterface.h (#166899)
77b9301ad904 AArch64: support extended spills in SEH on WoS (#166849)
40c89e5be4c1 [X86][NewPM] Add New Pass Manager wiring for x86-avoid-trailing-call (#166723)
28fdda6ae1ba [RISCV] Use SLLI.UW in double-SHL_ADD multiplications (#166728)
ee6b59dadd75 merge main into amd-staging (#519)
bf1b86698b41 [libc] Add localtime_r to baremetal entrypoints (#166677)
faa3ea59f596 remove LIT tests for now
41825fbf0e86 [Option] Fix simple subcommand with positional arguments (#166859)
a4c5deddc3a9 [Comgr] Add AMD_COMGR_ACTION_COMPILE_SOURCE_TO_SPIRV action
c9b416999781 [LifetimeSafety] Optimize fact storage with IDs and vector-based lookup (#165963)
5314d99d3052 Use `llvm::unique_function` in the async APIs (#166727)
f8e9b89ae07a [CodeGen] Fix a warning
630f43a305b9 [RISCV] Move remat.ll test from riscv32 to riscv64. NFC
ca0866bf6dce [LLDB][Editline] empty current line before `el_wgets` (#165830)
0e46b4189031 [Github] Use zstd to compress CI container images (#166879)
fca5d45d32de [SelectionDAG] Make SelectionDAG::dump(true) usable from debugger (#166722)
f29955a594ae [Clang][Sema] Allow counted_by on void* as GNU extension (#164737)
c88e207bf16f [LifetimeSafety] Fix typo which breaks the test
8fca65c65e3a [LifetimeSafety] Optimize loan propagation by separating persistent and block-local origins (#165789)
f2857c2aded1 [clang-tidy] Fix `readability-use-concise-preprocessor-directives` check (#166000)
97fe5f795a7b [mlir][linalg] Remove redundant implicit terminator trait(NFC) (#166298)
c838eddf8cd2 merge main into amd-staging
7463efe62e93 Revert "[Offload] Remove handling for device memory pool (#163629)"
c85be7935a13 merge main into amd-staging
ef4de122faf1 merge main into amd-staging (#517)
7e9db961f834 [bazel] Fix compilation for AlignmentAttrInterface and BPF. (#166872)
83d60778c8dd [Github] Revert labeller update in new PRs workflow
16ca2eb77b2c [NFC][CUDA][HIP] Print the triple when there's no mcpu (#166565)
995b0f1883b2 [SPIRV] Handle `inttoptr` constant expressions in global initialisers (#166494)
32ebf635c2be [LLDB] Fix debuginfo ELF files overwriting Unified Section List (#166635)
2fd3bf36806b [Github] Update GHA Dependencies (major) (#161108)
e30dc12640a2 [NFC][Github] Move Container Composite Workflows to .github/actions (#166864)
8b422006af02 [mlir][ods] Enable basic string interpolation in constraint summary. (#153603)
8d9e2e7e6ada merge main into amd-staging
004cfea449b2 [AMDGPU] Handle empty-except-for-DI regions in PreRARematerialize
fa83723bbe55 [debugserver] Remove unnecessary sleep in MachProcess::AttachForDebug (#166674)
4b4bfe9ceb48 [flang][acc] Add missing FIRAnalysis dependency (#166853)
2c0e4e775c0f RuntimeLibcalls: Remove LLVM_ABI from private member variable declarations (#166776)
3ad5765e2341 [LV] Check all users of partial reductions in chain have same scale. (#162822)
c940bfd7e621 [BPF] TableGen-erate SDNode descriptions (#166499)
94a52cbdae28 [CI] Remove Disabled Warning Set on Windows (#166838)
bb6d2bea6495 [Clang] fix confusing diagnostics for lambdas with init-captures inside braced initializers (#166180)
6adf99338832 [libc] Disable overflow test in strfromtest on riscv32 (#166719)
aa1b1dc3914d [lldb] Add function to tell if a section is a GOT section (#165936)
6c040e1a31e5 [device-libs][comgr] - Add gfx1250 and gfx1251 support (#513)
e0822202a8ce [GitHub][CI] Factor out duplicate container building code into composite actions (#166663)
fce58897ce82 [lldb] Enable locate module callback for all module loading (#160199)
41c78ccbffbd merge main into amd-staging (#514)
c12cb2892c80 [flang][acc] Add infrastructure and tests for ACCImplicitData (#166797)
67eb691fc5d9 [gn build] Port adc79324618f
adc79324618f [libcxx] Implement C++20 std::chrono::is_clock, std::chrono::is_clock_v (#160607)
a97a3d9d430f CODEOWNERS: Update usernames for AMD-owned projects (#511)
e33098555132 [sanitizer-common] [Darwin] Fix overlapping dyld segment addresses (#166005)
f410c9771262 [CI] Remove Comment about Windows Compile Jobs Limit
e5ba3c6cad99 [MLIR][Python] Update Nanobind Warnings List for clang-cl on Windows
ad723f940980 [LLVM] Check if LLVM_ON_UNIX is Defined
8d0df57340be [Hexagon] Improve QFP Optimizer (#166647)
316236b1c05a [Clang] fix false-positive lambda shadow diagnostics in explicit object member functions (#165919)
4cd17eeaeb13 [lldb/Interpreter] Implement ScriptedFrameProvider{,Python}Interface (#166662)
165563cf20f2 [gn build] Port 71cb0bb8932e
ef93c42bc046 [Comgr] Fix memory leak in test. (#425)
c9f1d909ad7f [Comgr] Fix memory leak in comgr for TranslatedSpirvT (#440)
71550ffb8164 [GitHub][CI] Move PATH setting into base image for tooling containers (#166826)
d2f75f2fe326 [clang] SFINAE context refactor (#164703)
f1bf0b02ae25 [lldb] Make it so not finding and SDK doesn't look like an error (#166676)
71cb0bb8932e [lldb/Target] Add SyntheticFrameProvider class (#166664)
4a7d3dfca0ef [mlir] Introduce AlignmentAttrOpInterface to expose MaybeAlign (#161440)
b82c7e7819b3 [C2y] Claim conformance to WG14 N3525 (#166824)
96806a7ec35f [SLP]Gather copyable node, if its parent is copyable, but this node is still used outside of the block only
4c605e912449 [HLSL] [DirectX] Invert the result of `firstbithigh` (#166419)
c145f2844b19 AMDGPU: Replace some undef uses in tests (#166813)
e341a9f47f31 [Github] Exclude Renovate from Updating OS Versions in GHA (#166811)
ecddaaeb3e3f [DirectX] Remove llvm.assume intrinsic (#166697)
aaddd8d38aa0 [OpenMP] Fix tests relying on the heap size variable
de2a86e5f0bd [LLDB] Run working STL data formatter tests with PDB (#166812)
321de63633be [VPlan] Unify casting unit testing (NFC).
2be5421da9aa [CodeGen] Add missing header guard to LibcallLoweringInfo.h (#166815)
732c7255bf0c [X86] narrowBitOpRMW - add additional uses of the StoredVal back to the DAG worklist (#166819)
0ca7d57d745e [NFCI][lldb][test] Enable GNU POSIX extensions where necessary (#166768)
f84c4c468353 [flang][Driver] Better error message when multiple actions are specified (#165575)
3c31cde97998 CodeGen: Handle bundled instructions in two-address-instructions pass (#166212)
f20619c610f5 [RISCV] More explicitly check that combineOp_VLToVWOp_VL removes the extends it is supposed to. (#166710)
52e8f3c97bf8 [DA] Check for overflow in strong SIV test (#166223)
f55b55c2a110 [CMake][Fuchsia] Build libclang_rt.builtins for arm-fuchsia (#166686)
948d39bfd683 [RISCV] Update SpacemiT-X60  vector reduction operations latencies (#152737)
509ee6baa6f4 [PatternMatch] Fix matching order for `m_c_Intrinsic` (#166047)
6fce53af846c [BOLT][AArch64] Skip as many zeros as possible in padding validation (#166467)
3be825053233 [clang-repl] Fixing vulnerabilities with respect to orc runtime (#165852)
5af27f8c208b [InstrProf] Fix frontend generated function hash (#165358)
d1387ed2729c CodeGen: More accurate mayAlias for instructions with multiple MMOs (#166211)
70f5fd47a497 [flang][debug] Add debug type support for procedure pointers (#166764)
210b9a58f68b [Github] Update GitHub Artifact Actions (major) (#166112)
50daf4d6005e Add @llvm.reloc.none intrinsic to LLVM release notes (#166805)
75c09b792433 [DirectX] Let data scalarizer pass account for sub-types when updating GEP type (#166200)
83930beb8d01 [CI] Ensure compatibility with Python 3.8
6ac458527d88 [clang][AST] Do not try to handle irrelevant cases in writeBareSourceLocation (#166588)
ba4abc61a1f5 [Support] Fix up cast function object definitions. NFC. (#166789)
fcc50fb091b7 [device-libs][comgr] - Add gfx1250 and gfx1251 support
5f08fb4d72f6 [IR] llvm.reloc.none intrinsic for no-op symbol references (#147427)
71927ddb63ac [CodeGen] Delete two ComputeValueVTs overloads (NFC) (#166758)
912e2653af8a merge main into amd-staging
bda72894bb5d [lldb][docs][NFC] Fix ClangModulesDeclVendor::AddModule parameter docs
faae161914ff [mlir][acc] Erase empty kernel_environment ops during canonicalization (#166633)
670c453aeb19 [Offload] Remove handling for device memory pool (#163629)
527b7a48c6e1 [CI][NFC] Refactor compute_platform_title into generate_test_report_lib
36d477850fac [ASan] Skip explicit check of 'xnack' feature for gfx1250 && gfx1251. (#166754)
55436aeb2e82 [CI] Add Ability to Explain Failures
792524e067ad Fix openacc bazel build (#166790)
7227030a82ec [HLSL] enable Wconversion by default for HLSL (#166617)
4d67e157682d [HLSL] add support for HLSLAggregateSplatCast and HLSLElementwiseCast to constant expression evaluator (#164700)
96d4cb080d8b [CI][NFC] Generalize _format_ninja_failures
0b153a9de9d9 Fix a build error
c08644caa684 [NFC][LLVM][Transforms/Utils] Fix indentation inside namespace (#166624)
2a2d7496cce5 [NFC][LLVM] Code cleanup in RegionPrinter (#166622)
a1640c1e89a8 [NFC][TableGen] Adopt CodeGenHelpers in InstrInfoEmitter (#166442)
d380c2a96df4 Fix bazel build issue caused by #165615 (#166788)
b1bd74e1cc1f [LLVM][DebugInfo] Allow ExtraData field to be a node reference (#165023)
1af0424fed2e [flang][OpenACC] simplify copy and combiner recipe generation (#164988)
fa050eadab2e Reland: CodeGen: Record MMOs in finalizeBundle (#166689)
28c6ed591464 [mlir][acc] Add ACCImplicitData pass for implicit data attributes (#166472)
54803f8fce5b [RISCV] Add test cases for widening add/sub with mismatched extends. NFC (#166700)
3d589a93efed [VPlan] Add splitAt unit test. NFC (#164636)
ff11b93bb8f5 [RISCV] Correct the CFA offsets for stack probing. (#166616)
43b69e760eb4 Filter out unemitted metadata before assertion in AIXAsmPrinter. (#165620)
e0736c0b4338 [MLIR] Add missing namespace qualifier in BufferizableOpInterface.td (#166781)
c6e8f63d8d7a merge main into amd-staging (#508)
a6893f9a113a [DA] Regenerate test checks (NFC) (#166736)
875646b1f3c1 [llvm] Proofread Extensions.rst (#166716)
cdfd2905fd0b [mlir] Remove redundant declarations (NFC) (#166714)
eb63a4aa9e0c [llvm] Remove redundant declarations (NFC) (#166713)
753d4bc57147 [lld] Remove redundant declarations (NFC) (#166712)
d5d697f35912 [clang] Remove redundant declarations (NFC) (#166711)
7ff8a5175428 [libc] Fix stale char_ptr for find_first_character_wide read (#166594)
277bd096890f Revert "Add dump info for VarDecl"
b8a814e4fce2 [HLSL] Add support for user semantics (#153424)
831a8b55cafe Add dump info for VarDecl
3a6875119080 [MLIR][XeGPU][Transform] add xegpu.set_desc_layout transform op (#165615)
9d1b578a2237 [RISCV] Shrink deleted dead ADDI's use if coalesced in RISCVInsertVSETVLI (#166729)
99bb7895fa83 [VPlan] Fix gcc -Wparentheses warning
75573041969a [mlir][linalg] Update vectorization of linalg.pack (#163539)
4830e638f939 [LLVM][CodeGen][AArch64] Improve lowering of boolean vector popcount operations. (#166401)
cbb9b0e08ed1 [AArch64] Lower v1i64 and v2i64 [S|U][MIN|MAX] to SVE when available (#166735)
e974c6577441 [CIR] Implement __builtin_object_size and __builtin_dynamic_object_size (#166191)
47d71b69b493 [BasicTTI] Only split vectors with even element counts in getCastInstrCost (#166528)
fd9dd4327f2a [mlir] Use LDBG to replace LLVM_DEBUG (NFC) (#166733)
f3413bdb5adb merge main into amd-staging
0663710a59e6 [X86] narrowBitOpRMW/combineTruncate - don't retain pointer info for unknown offset load/stores (#166752)
f6a44f45c2e9 merge main into amd-staging (#506)
e4467fbf3077 [clang][ExprConst] Handle dependent switch case statements (#166533)
ee0818a1f1fa [X86] Add test coverage for #166744 (#166745)
55fb1caf8a13 Revert "[mlir] Make remove-dead-values pass remove blocks arguments first" (#166746)
474237bcdd6d [mlir] Expose output strategies of TimingManager (#166548)
47cf5a1b828d [SPIRV] Support for the extension SPV_ALTERA_blocking_pipes (#138675)
22b6c491d69e [libc] Enable the FPU in Arm startup code (#166349)
06ec47055ad1 [SPIRV] Handle unknown intrinsics (#166284)
59f6f33bc3d6 Reapply "[utils][UpdateLLCTestChecks] Add MIR support to update_llc_test_checks.py." (#164965) (#166575)
3ea1ffde025c Fix bazel build issue caused in #166517 one last target (#166739)
f9360e36c1ae [clang] Make 'fileScopeAsmDecl' matcher public (#165319)
92da0ec4626d Fix bazel build issue caused in #166517 (some targets were still failing) (#166737)
eab44600fb7e [VPlan] Rename onlyFirst(Lane|Part)Used (NFC) (#166562)
fc179af5202f Fix bazel build issue caused in #166517 (#166734)
a24eac88eb53 [BOLT] Adding a unittest that covers Arm SPE PBT aggregation (#160095)
cad96ad703d9 [NFC] Refactor target intrinsic call lowering (#153204)
cc8478b38d9d Revert "[compiler-rt][HWAsan] Remove CHECK lines from test"
bf2653ed770f Disable flaky test TimeProfilerTest.ConstantEvaluationCxx20
f88071301053 [gn build] Port 22242ae072f8
22242ae072f8 [clang-tidy] Rename `cert-flp30-c` to `bugprone-float-loop-counter` (#166571)
8b3a124ad87d Revert "[InterleavedAccess] Construct interleaved access store with shuffles"
d1874047f590 [VPlan] Retrieve alignment from Load/StoreInst in constructors. nfc (#165722)
a928c6196100 [mlir] Make remove-dead-values pass remove blocks arguments first (#165725)
9e6a31f832a4 [WebAssembly] vf32 to vi8, vi16 lowering (#164644)
9f5811ec6bd5 [compiler-rt][HWAsan] Remove CHECK lines from test
1c756932e996 [DAGCombiner] Bail out if BitWidthDiff > BitWidth when folding cltz(and) - BitWidthDiff (#166607)
9100c9212db8 [AArch64] Enable masked load/store for Streaming-SVE with -march=armv8-a+sme (#163133)
cc8f7cd2521c [ORC][LibraryResolver] Fix ensureFilterBuilt assertion failure and concurrency issue. (#166510)
5e7f7a496ccb ARM: Add fp128 ldexp tests (#166619)
6986f125c901 [mlir] [NFC] Remove stray debug statement (#166696)
556dd019eb7f [RISCV] Expand multiplication by `(2/4/8 * 3/5/9 + 1) << N` with SHL_ADD (#166372)
868f23f8cc1a [LoongArch] Override hooks to enable sink-and-fold support in MachineSink (#163721)
49f55f499122 [mlir][ods] Enable granular pass registration. (#166532)
e700f157026b [clang] Unify -mspeculative-load-hardening as a benign compiler option (#166640)
95557e32945a [flang][cuda][NFC] Use the NVVM op for syncwarp (#166695)
9a0000b15022 Revert "[AMDGPU][MC] GFX9 - Support NV bit in FLAT instructions in pre-GFX90A" (#166693)
5fcd39acd5fe merge main into amd-staging
3361e40c3288 [AIX] Enable libc++ bots on AIX (#166650)
7c863ecf3fb7 merge main into amd-staging (#504)
260f9e9f67c0 [profcheck] Update profcheck-xfail.txt after recent fixes (#166685)
67b6fd04dd46 AMDGPU: Delete redundant recursive copy handling code (#157032)
5321f5c0b29c [compiler-rt][Profile] Temporarily Disable Test on AIX
77e3975236fc [LSCFG][profcheck] Add dummy branch weights for the dummy switch to dead exits (#164714)
b84784f9e63d [profcheck] Exclude GPU tests (#166681)
93ef57617d08 Add binop-select test to profcheck-xfail.txt (#166678)
e04f278ae1d1 merge main into amd-staging
47e450190cbb [flang][acc] Lower zero modifier for Copyout clause (#166660)
1262dce1fddf Add support for ref.func to AsmParser/MC (#163326)
bd9030e762c0 [debugserver] Move constants into TaskPortForProcessID (NFC) (#166670)
7d640c0b696c merge main into amd-staging (#503)
d2b43ffffcde [AIX] unsupport gcc triple test case on aix NFC (#166408)
158dfe9b3d61 [gn build] Port d584d00ed250
d584d00ed250 [lldb] Introduce SBFrameList for lazy frame iteration (#166651)
8321eaa037b9 [gn build] Port ac547a532a91
3665e7606e22 [gn build] Port 050cbd297ba4
050cbd297ba4 [CodeGen] Allow negative frame indicies in Register class. (#164459)
3b010c96ac23 [Clang] Add elementwise ldexp builtin function (#166296)
d18b79642926 [clang-doc] remove FullName from serialization (#166595)
46c948935dd9 [GlobalISel] Add `G_EXTRACT_VECTOR_ELT` for `computeKnownBits` (#164825)
1ff0098f7c9d [profcheck] Exclude Instrumentation tests (for now) (#166659)
efe8573127b5 [LV] Add extra tests for narrowing interleave groups with op chains.
597cd767d6ad Revert "[libc] Return errno from OFD failure paths in fcntl." (#166658)
2d5170594147 [webkit.UncountedLambdaCapturesChecker] Assertion failure with coroutine body (#165650)
9fc8ddd2c840 [VPlan] Move code narrowing ops feeding an interleave group to helper (NFCI)
ac547a532a91 Analysis: Add RuntimeLibcall analysis pass (#165196)
81dede888a35 [libc] Return errno from OFD failure paths in fcntl. (#166252)
521bafc203c8 [LLDB] Fix typo
b0b46167906c [VPlan] Handle single-scalar conds in VPWidenSelectRecipe. (#165506)
c1ca4a55d41b [VPlan] Strip redundant code in VPTransformState::get (NFC) (#166145)
5da2c09e6ad3 [X86][ISel] Fix VPTERNLOG matching ensuring the InnerOp is logicOp (#166591)
f80cec2786b2 merge main into amd-staging
5e46103f1b52 [LIR][profcheck] Reuse the loop's exit condition profile (#164523)
0c0b0ea887c8 [SPARC] Mark branches as being expensive in early Niagara CPUs (#166489)
c3b284919139 [libc] Allow openat and creat to return fd 0. (#166466)
0502314f7a41 [bazel] Fix bazel build (#166641)
e79528f7b82b [TLI] Add basic support for nextafter/nexttoward libcalls (#166250)
9e2f73fe9052 [libc] Add printf error handling (with fixes #2) (#166517)
54190970cf27 [LV] Add tests for narrowing interleave groups with casts.
5f1b9023a809 [BOLT][AArch64] Fix printing of relocation types (#166621)
163933e9e709 [gn build] Port 0469ff0a212d
1fc5c02aa56a [LVer][profcheck] explicitly set unknown branch weights for the versioned/unversioned selector (#164507)
78d649199b47 [InterleavedAccess] Construct interleaved access store with shuffles
f76c13223032 [SimplifyCFG] Fix weight calculation for `simplifySwitchOfPowersOfTwo` (#165956)
b0ae054a5686 [BOLT][AArch64] Fix LDR relocation type in ADRP+LDR sequence (#166391)
0469ff0a212d TableGen: Split RuntimeLibcallsEmitter into separate utility header (#166583)
6c4f96880823 [clang-format] Fix brace wrapping for Java records (#164711)
db6231b4c2e1 [AMDGPU][MC] GFX9 - Support NV bit in FLAT instructions in pre-GFX90A (#154237) targets
1041423393ff [clang][SourceManager] Reuse code when computing Column and Line numbers (#166593)
e2d2affc70a8 [AMDGPU][LowerBufferFatPointers] Fix crash with `select false` (#166471)
3d0a3674d9ae [libc] Make errno asserts noop on gpu targets (#166606)
28a279ce14f9 [lldb-dap] expand tilde in dap executable path (#162635)
00171b352def [NFC][TableGen] Adopt CodeGenHelpers in SDNodeInfoEmitter (#165622)
37fff6e17ee2 [NFC][LLVM][IR] Cleanup namespace usage in LLVM IR cpp files (#166477)
e7f7973899f7 [libc] Migrate wctype_utils to use wchar_t where applicable. (#166234)
120689e46679 [libc] Migrate ctype_utils to use char instead of int where applicable. (#166225)
1d471f8ce2ad merge main into amd-staging (#501)
c193eea86e9f [gn build] Port 056d2c12f756
bc55f4f4f2b4 [debugserver] Fix debugserver build on < macOS 10.15 (#166599)
d3caae1c07c2 [clang] Refactor clang's keyword enable/disable mechanism to allow lldb to re-use it (#165323)
ebeb36b12e46 [PowerPC] Implement vsx rotate left word instr (#160754)
af0b6b18a869 [ProfCheck][NFC] fix argument order for call to setExplicitlyUnknownBranchWeightsIfProfiled (#166601)
3c162ba247d3 [LLDB][NativePDB] Add non-overlapping fields in root struct (#166243)
d49c6707d073 [libcxxabi][demangle] Fix the cp-to-llvm.sh sync script to copy all headers (#166572)
efa7ab06ebf7 [profcheck] Add unknown branch weights to expanded cmpxchg loop. (#165841)
cb41408d3c2d [Xtensa] Fix S32C1I instruction encoding and copyPhysReg. (#165174)
f60e69315e9e [llvm] Emit canonical linkage correct function symbol (#166487)
9564b26f81f4 [lldb] Support a Stable ABI LLDB_PYTHON_EXT_SUFFIX (#166269)
a1f0fe140a0e [MsDemangle] Use NodeList over SmallVector for target names (#166586)
6312d2751144 [DirectX] Emit `hlsl.wavesize` function attribute as entry property metadata (#165624)
dd8892300e72 RuntimeLibcalls: Remove LibcallLoweringPredicate from RuntimeLibcallImpl (#166585)
056d2c12f756 RuntimeLibcalls: Split lowering decisions into LibcallLoweringInfo (#164987)
9b1719efa063 [lldb] Mark single-argument SourceLanguage constructors explicit (#166527)
0b72899f6db9 [libc][math] Refactor the `math_errhandling` macro definition (#166350)
95c875052550 [AArch64][GlobalISel] Added pmull/pmull64 intrinsic support (#165740)
d4e3a2327da1 [clang] [doc] Document that the ms_abi attribute works on aarch64 too (#166373)
3154a841be80 [AMDGPU] Autogenerate R600 packetizer checks (#166570)
6fec104b4573 [AMDGPU] Enable typechecks for __builtin_amdgcn_raw_ptr_buffer_atomic_fadd_v2f16 (#166547)
d7c1df38b99b [llvm] Proofread GoldPlugin.rst (#166509)
aea75d059f24 [ObjectYAML] Remove redundant declarations (NFC) (#166508)
0b29c3c1a1bd [Hexagon] Remove redundant declarations (NFC) (#166507)
ab02808c66b0 [Support] Simplify minIntN and isUIntN (NFC) (#166506)
99334f74ae1c [ADT] Add static_assert to llvm::to_address for function types (#166505)
0b5a00aab739 [CI][NFC] Reformat Python Files in .ci directory
22013cc2375d device-libs: Remove hardcoded datalayouts from IR files (#497)
bb367c14aea9 [polly] Fix unused variable warnings
ce5dac67ffdd [gn build] Port dd14eb8242d7
3bf0ce15f80d [gn build] Port 718a3b268fcf
9bb67f88ed28 [gn build] Port 51d0f6d6e172
ef6947b098e8 [gn build] Port 3ebed51e997b
a796d1836930 [gn build] Port 370058777be2
ff108f7486fa Fix failures introduced in #166032 (#166574)
3641e269b0fd [MsDemangle] Read entire chain of target names in special tables (#155630)
d568601d5a0c [NFC][TableGen] Adopt NamespaceEmitter in DirectiveEmitter (#165600)
fa6cc7eadedd [Clang] FunctionEffects: ignore (methods of) local CXXRecordDecls. (#166078)
4334b43c6593 [gn] port bb4ed55acdbc
52cb6e9d49f8 [ProfCheck][NFC] Make Function argument from branch weight setter optional (#166032)
87fb7b002db3 [flang] Adding NOTIFY specifier in image selector and add notify type checks (#148810)
338fb02c9878 [BOLT][NFC] Rename funtions with _negative suffix to _unknown when th… (#166536)
9762ab0c3d25 [MLIR][NVVM] Fix the lowering of mbarrier.test.wait (#166555)
3a84aef64a19 [PowerPC][NFC] auto gen checks vec rounding tests (#166435)
6f7e70deba2e merge main into amd-staging
ad8f6b44be14 DAG: Avoid some libcall string name comparisons (#166321)
14c76437ee1a [clang][bytecode] Check types when loading values (#165385)
63d6e3eb4664 [DebugInfo] Assign best possible debugloc to bundle (#164573)
3426f9c4811d [clang][NFC] Rename stale TypeSourceInfo DI variables (#166082)
dd14eb8242d7 [RISCV] Introduce pass to promote double constants to a global array (#160536)
5fedb7c58881 [Clang][ARM] Fix tests using thumb instead arm arch on cc1 (#166416)
6e8375fb6cf0 merge main into amd-staging (#498)
438a18c1e105 [X86] Add test coverage for #166534 (#166552)
ba1dbdd44a1d Revert "[utils][UpdateLLCTestChecks] Add MIR support to update_llc_test_checks.py." (#166549)
305cf623d7ab [Polly] Check for ISL errors after schedule optimization (#166551)
1de55c9693f3 [VPlan] Avoid sinking allocas in sinkScalarOperands (#166135)
b675c0c498e4 [RISCV] Add a test for multiple save locations of a callee-saved register (#164479)
a389472e0b63 [Clang][NFC] Refactor SemaCXX/dllexport.cpp to use -verify= instead of macros (#165855)
98ca2e85450c [clang-tidy][NFC] Fix broken link in `bugprone-default-operator-new-on-overaligned-type` (#166546)
0d77cba6e165 [libc][math] Refactor exp2m1f16 implementation to header-only in src/__support/math folder. (#162019)
04c01f066032 [gn] port 0c7300923638403
a4105707eeaf [llvm][LoongArch] Introduce LASX and LSX conversion intrinsics (#157818)
c782ed3440b5 [utils][UpdateLLCTestChecks] Add MIR support to update_llc_test_checks.py. (#164965)
6c640b86e6e0 [mlir][LLVM] Fix unsupported FP lowering in `VectorConvertToLLVMPattern` (#166513)
a38e0942407e [mlir] Dialect Conversion: Add support for post-order legalization order (#166292)
c1dc064ba063 [CIR] Add support for storing into _Atomic variables (#165872)
e8564830c19e [clang-tidy][doc] add more information in twine-local's document (#166266)
fedd3b0399e7 [clang][bytecode] Remove dummy variables once they are proper globals (#166174)
d249e67a6a7c [libc][math] Disable `FEnvSafeTest.cpp` if AArch64 target has no FP support (#166370)
d1142e807da8 merge main into amd-staging
a87d2e9fceba merge main into amd-staging (#496)
5821b09e5f34 [clang][bytecode] Print primitive arrays in Descriptor::dumpFull() (#166393)
a65867ac319b [BOLT][AArch64] Fix search to proceed upwards from memcpy call (#166182)
86d9e1c035fe [clang] Delete duplicate code in sourcemanager (#166236)
28e024fb9d42 [X86] narrowBitOpRMW - allow additional uses of the BTC/R/S result (#166376)
fb49adb6ea8f [AMDGPU] Another test for missing S_WAIT_XCNT (#166154)
833983918d2f Revert "CodeGen: Record MMOs in finalizeBundle" (#166520)
35ee3c6f72ba [MLIR][NVVM] Update mbarrier Ops to use AnyTypeOf[] (2/n) (#165993)
cc9ad9afc6d2 [Clang] Add constexpr support for AVX512 permutex2 intrinsics (#165085)
0314b939d695 [Headers][X86] avx ifma - move constexpr to the end of the function attribute lists. NFC. (#166523)
7b3fe5fd42f1 [LV][NFC] Remove undef values in some test cases (#164401)
8eacea993408 [MLIR][ODS] Re-enable direct implementation of type interfaces with method bodies (#166335)
628d53aba532 [InstCombine] Enable FoldOpIntoSelect and foldOpIntoPhi when the Op's other parameter is non-const (#166102)
5b5d0a831967 [libc++][NFC] Make __type_info_implementations a namespace (#166339)
51d0f6d6e172 [clang-tidy] Rename `cert-dcl58-cpp` to `bugprone-std-namespace-modification` (#165659)
988c1b1c8dc8 [libc++] Remove <cstdlib> include from <exception> (#166340)
98f0139f5bd9 Fix bazel build issue caused by #166259 (#166519)
9016c60c685e [clang] Call ActOnCaseExpr even if the 'case' is missing (#166326)
66f52ca9a09c test: correct typo in RUN line (#166511)
304d2ff4d998 CodeGen: Record MMOs in finalizeBundle (#166210)
19a9de06cd55 [clang] Accept empty enum in MSVC compatible C (#159981)
4c2a9c4ba379 [msan][test] Add some avx512bf16 tests (#166219)
87b1d3537ae6 [AMDGPU][NFC] Avoid copying MachineOperands (#166293)
044e0f041d5c Revert "IR: Remove null UseList checks in hasNUses methods (#165929)" (#166500)
849038cad16f AMDGPU: Do not infer implicit inputs for !nocallback intrinsics  (#131759)
6111ff16df37 [RISCV] Implement shouldFoldMaskToVariableShiftPair (#166159)
b5f200129ad9 [CodeGen] Register-coalescer remat fix subreg liveness (#165662)
a487a79457f6 merge main into amd-staging
cdf52a1325e6 [msan][NFCI] Generalize handleVectorPmaddIntrinsic() (#166282)
c5aaee0bb07b Revert commit d8e5698 and 15b19c7 (#166498)
f291f335c962 [MLIR][XeGPU] Support order attribute and add pattern for vector.transpose in WgToSg Pass (#165307)
952d4b4c0bc9 [CIR] Fix assignment ignore in ScalarExprEmitter (#166118)
0c7300923638 [WebAssembly] TableGen-erate SDNode descriptions (#166259)
9cd1e4067873 [HLSL] Layout Initalizer list in Column order via index conversion (#166277)
a2977dea61d7 [libc++][NFC] Removed unsupported compilers from tests (#166403)
6c9b5943f846 [MLIR] Fix generate-test-checks.py to not remove every blank lines (#166493)
5c8bcf7dc746 [flang][cuda][NFC] Move CUDA intrinsics lowering to a separate file (#166461)
3d8e338fa8cb [OpenMP] [Xteam] Restrict conditions for occupancy based opts. (#489)
d998f92a002b [CodeGen] MachineVerifier to check early-clobber constraint (#151421)
f43a6c934709 merge main into amd-staging (#495)
d6fdfe0a27d3 CodeGen: Record tied virtual register operands in finalizeBundle (#166209)
cfca2297822d AMDGPU: Add and clarify reserved address spaces (#166486)
4b367e0b85aa [WebAssembly] Use IRBuilder in FixFunctionBitcasts (NFC) (#164268)
1458d313a1ca [SLU][profcheck] Propagate profile for branches on injected conditions. (#164476)
397415792929 AMDGPU: Pre-commit a test (#166414)
1d0aa6c2ad62 [BOLT] Fix impute-fall-throughs (#166305)
15b19c732172 [libc] Fix fprintf_test assuming specific errnos. (#166479)
b296e8fdf847 [ProfCheck] Disable X86 AMX Test Case
2b60b6d9639c [llvm][mustache] Avoid extra allocations in parseSection (#159199)
4209e41c4c1b [libc] Fix printf long double bugs (#166474)
02f5a1aa07f1 [ADT] Introduce Casting function objects (#165803)
d54793113798 [SLU][profcheck] create likely branch weights for guard->branch (#164271)
3ebed51e997b [Clang][LLDB] Refactor trap reason demangling out of LLDB and into Clang (#165996)
fa5cd27ef0fc [profcheck] Add unknown branch weights to expand LL/SR loop. (#166273)
6d4e75cc931a [MISched][NFC] Rename isUnbufferedGroup to isReservedGroup (#166439)
d8e56988f87e [libc] Add printf error handling (with fixes) (#166382)
8648beff758d [NFC][libclc] Rename clc_cbrt.inc to clc_cbrt.h (#166330)
0fd029858a81 [libc] Add chown and getgid implementations (#166434)
2b4ac6629782 AMDGPU: Cleanup and modernize limit-coalesce.mir test (#166465)
dc3b5141cdf7 [llvm][mustache] Optimize accessor splitting with a single pass (#159198)
92aa68dee249 merge main into amd-staging
4cdeb7d292bf [libc++] Remove guards for builtins for reference binding traits (#166288)
ac5b6151976c [BPF] Remove dead code related to __bpf_trap global var (#166440)
a51d219ee7ac [sanitizer_common] Add arm64e module type (#166018)
d4222bf9e217 [libc++] Use saturation builtins directly for {add,sub}_sat (#165228)
c93df83b0469 [opt-viewer] Account for optimization records YAML files generated by LTO (#135059)
cf73a0b102c0 [AArch64] Copy implicit def operands when creating LDP. (#164253)
b21949eb3403 Revert "[mlir][memref]: Collapse strided unit dim even if strides are dynamic" (#166448)
025e431e7450 [LoopFusion] Forget loop and block dispositions after latch merge (#166233)
8cd22447af23 AArch64: correct `preserve_most` and `preserve_all` on Windows (#166436)
ce091da5df6c [AMDGPU] Mark WMMA machine instructions as convergent (#165602)
1c85981e3770 [llvm][mustache] Avoid redundant saves in accessor splitting (#159197)
6e2dcdb17c0e Fix implicit truncation of `select` non-bool vector conditions (#166279)
9703bda95b08 [mlir][xegpu] Add OptimizeBlockLoads pass.  (#165483)
7f4063019d74 merge main into amd-staging (#488)
2141edf506ba [lldb] Skip tests on older versions of clang
4f428d30e4d8 [clang][lit] Add SPIR-V to some OpenMP offload tests (#165775)
240fe7e8445e [CIR][NFC] EHScope & Cleanups Iterators and operators overloading (#165317)
6f91f588d98a build_llvm_release.bat: Put the 32-bit sanitizers back (#166437)
718818a5cb4c [MLIR][Linalg][Transform] Expose more args in VectorizeChildren[...] op's Python bindings (#166134)
2a65fab965d6 [z/OS] Improve compiler options on z/OS (#166415)
8f683c3e4b6f [AMDGPU] NFC, delete promote-alloca testcase (#166297)
a99e13271e25 [Docs][clang-tools-extra] Convert maintainers list to .rst format (#165171)
7272a6c8882d [BPF] Avoid relocation for jumptable entries (#166301)
e03fac127c8d [X86][NewPM] Port X86PartialReduction to NewPM
4776451693f4 [X86][NewPM] Port lower-amx-intrinsics to NewPM
0ae0ac0f004d [cmake] Pass PYTHON_EXECUTABLE to native builds (#163574)
5be12e1c95e8 [LLDB] Run API tests with PDB too (#149305)
e5f191e17172 DeclareRuntimeLibcalls: Add registered target checks to test run lines
dc94f2cbadfd [Offload] Add device UID (#164391)
92a1eb37122f AArch64: Regenerate cost model tests
fe5c3cbbd02f Added Conditions of SM90 and ISA7.8 for Using cvt.ftz.f32.bf16 Instruction (#165774)
7398591148f4 [CodeGen] Add skipFunction() check to MachineFunctionSplitter (#166260)
2bc22ea02edd [NFC][SLU] Update SimpleLoopUnswitch/guards.ll (#166285)
e29ee270615f [lld][WebAssembly] Allow `--no-stack-first` in addition to `--stack-first` (#166384)
290ff955f07f [VPlan] Verify incoming values of VPIRPhi matches before checking (NFC)
2e8543c73555 [X86] Improve variable 8-bit shifts on AVX512BW (#164136)
831e79adff45 DAG: Merge all sincos_stret emission code into legalizer (#166295)
fe106b6e7308 BasicTTI: Cleanup multiple result intrinsic handling (#165970)
8ee1803538f9 [CIR] Upstream Builtin ExpOp (#166061)
fb21f16fe6fd RuntimeLibcalls: Add stub API for getting function signatures (#166290)
6217f351bb88 [NFC][AMDGPU] use DAG.UpdateNodeOperands update chain (#166396)
8208591f274c [RISCV] Use TargetConstant for the immediate RISCVISD::SHL_ADD nodes. (#166312)
208b7360152b [X86] select-copy.mir - regenerate test checks. NFC (#166405)
8aff0d99d608 [test] Avoid writing test output to readonly dir (#166404)
1409db663139 [LifetimeSafety] Fix Python path for Windows compatibility (#166291)
1aa86ca521b8 [LoopUnroll] Fix division by zero (#166258)
39221718519f [clang][Driver][HIP] Change OffloadingActionBuilder to respect the --no-gpu-bundle-output flag (#163834)
4ce58833d365 [SimplifyCFG] Fix value enumeration of a full range (#166379)
a02e5740119a [libc] Add faccessat entrypoints for aarch64 and riscv (#165869)
ecd67a7a95b3 [CIR] Upstream CXXDefaultArgExpr for AggregateExpr (#165991)
0e0ec981255d Revert "[C2y] Support WG14 N3457, the __COUNTER__ macro (#162662)"
6601c384d36e Fix getting section info in large mach-o files. (#165940)
8d8e9eba3b4f merge main into amd-staging
0ce03c2be4c4 [flang][cuda] Add interface and lowering for atomicadd_r4x2 and atomicadd_r4x4 (#166308)
67ce4aba2617 [llvm][mustache] Use single pass when tokenizing (#159196)
37825ad4f6c5 [LoopUnroll] Prevent LoopFullUnrollPass from performing partial unrolling when trip counts are unknown (#165013)
af9a4263a1a2 [LAA] Only use inbounds/nusw in isNoWrap if the GEP is dereferenced. (#161445)
83d1599871b2 [lldb-dap] Addressing orphaned processes in tests. (#166205)
89ec96b8b4f4 [HLSL] Implement the f16tof32() intrinsic (#165860)
a7ba6de4bdae [OpenMP] [Xteam] Added envar LIBOMPTARGET_AMDGPU_XTEAM_BLOCKSIZE (#482)
2dc0fa1000d6 [Flang] Nested directives are comments (#166348)
cc3ad201ecd5 [analyzer] Revert incorrect LazyCoumpoundVal changes (#163461)
78769d51c6b1 [LLDB] Don't check for libcxx if LLDB_ENFORCE_STRICT_TEST_REQUIREMENTS is off
ed7d6c3511df [ADT] Deprecate a soft-deprecated APInt constructor (#166314)
4749bf56a65e [lldb] When starting in a hidden frame, don't skip over hidden frames when navigating up/down (#166394)
a50d036c0bf4 [NFC] [Build Fix] Fix failing test case due to missing host arch. (#166392)
6c563dc6a212 [mlir][XeGPU] Add optional layout attribute to LoadGather StoreScatter ops (#163414)
71022d1ed6f1 [Clang] [Docs] Add some CMake example code for linking against libclang (#166268)
2286118e6f2c [SPIRV] Enable `bfloat16` arithmetic (#166031)
534814134477 Fix rare race condition in asan dm (#480)
bcb3d2f51222 build_llvm_release.bat fixes (#166385)
42b25085ade7 merge main into amd-staging (#486)
50faea28fb93 [llvm] Use conventional enum declarations (NFC) (#166318)
502742b53868 [llvm] Proofread MergeFunctions.rst (#166317)
370058777be2 [BinaryFormat] Remove redundant declarations (NFC) (#166316)
c2269c842d28 [ADT] Move llvm::to_address to STLForwardCompat.h (NFC) (#166315)
a2495ff991bd [NFC][TableGen] Emit empty lines after/before namespace scope (#166217)
5ba746d9a0f7 [NFC][TableGen] Use namespace qualifier to define `RecordKeeperImpl` (#166220)
3170345e20b7 [X86] Fix LEA compression on 64 bit (#166334)
b25868157df7 [NFC] add LLVM_ABI to function getMemcmp declaration (#166192)
718a3b268fcf [BOLT][AArch64] Run LDR relaxation (#165787)
bdf02486a3b1 [clang][Driver] Fix crash in --offload-new-driver and -save-temps. (#165606)
2237a18f25dc [SPIRV] Enable OpenCL max_work_group_size translation via `SPV_INTEL_kernel_attributes` (#165891)
a01e4da6d643 [PowerPC] Ensure correct codgen for MMA functions for cpu=future (#165791)
ed45c0571eb3 [MLIR][XeGPU] fix load/store/prefetch op offset verifier (#166137)
389aa46a9940 [mlir][DataFlow] Add visitBlockTransfer hook to dense analyses (#166263)
12f392cff10f [AArch64][SME] Support `aarch64-split-sve-objects` with VLAs/realignment (#163816)
89c261703948 [X86] bittest-big-integer.ll - add test showing multiple uses of the RMW store chain AND its stored value (#166366)
2e89b7190630 [C23] Correctly handle missing embed with -MG (#166188)
f771f1ee2697 [llvm-dwarfdump][CMake] Link against BinaryFormat (#166364)
8a84b285f67c [SimplifyCFG] Eliminate dead edges of switches according to the domain of conditions (#165748)
747050bcceca [MLIR][XeGPU][VectorToXeGPU] Lower vector.load/store/transfer_read/transfer_write to new offsets syntax (#162095)
0307147105d5 [NFC][SPIRV] Add AMDGCN SPIR-V specific defaults to the BE (#165815)
28a20b4af944 [VectorCombine] Avoid inserting freeze when scalarizing extend-extract if all extracts would lead to UB on poison. (#164683)
fc5d7450a612 merge main into amd-staging
ee944968e72a merge main into amd-staging (#484)
8cb0c0ce1be9 Revert "Fix bazel build issue after #166157" (#166359)
bea31dd373e3 Fix bazel build issue after #166157 (#166358)
51269e220da6 [bazel] Fix bazel build for bb4ed55acdbc7f48bc978147189e8106e3ea42f8
8636c40f4e54 [lldb][test] Skip registration-unique.test on Windows
dbce71382c65 [AMDGPU] Skip debug instructions when eliminating S_SET_GPR_IDX_ON/OFF (#160715)
0a95a86634a8 [VPlan] Fix first-lane comment in sinkScalarOperands (NFC) (#166347)
f037f413506a [IR] Add new function attribute nocreateundeforpoison (#164809)
f59beca45948 [X86] bittest-big-integer.ll - add test showing dependent BTC/BT/BTS sequence on same i128 all with unknown bit indices (#166351)
99a1fcad5de6 [UTC] Update AMDGPU asm regexp for private functions (#166169)
0c653514cc2f [mlir][rocdl] Add GlobalLoadAsyncToLDS operation (#165374)
c80faaefe10f [lldb][test] registration-unique.test: don't hardcode the order of recognizer registration
f8656ed45575 [llvm][dwarfdump] Add --child-tags option to filter by DWARF tags (#165720)
85c782743587 [ASan][Test-Only] Allow read of size 2 in strcmp.c test (#166179)
c4ac31dc52e0 [X86] narrowBitOpRMW - use reachesChainWithoutSideEffects instead of direct chain matching (#165870)
5b2f9b53bdb3 [SimplifyCFG]: Switch on umin replaces default (#164097)
f7be258c2893 [BOLT][NFC] Clean up the outdated option --write-dwp in doc (#166150)
97947f150f55 [MLIR][NVVM] Update Op verifiers to prevent ungraceful exits (#165677)
ab487b6378cf [BitcodeReader][NFC] Tidy getEnableSplitLTOUnitAndUnifiedFlag (#165732)
0cae0af520f7 [VPlan] Shorten insert-idiom in sinkScalarOperands (NFC) (#166343)
73e3d788906a [flang][test] Fix gcc-triple test when default target is not x86_64
dec6e7380c69 [LIT][Cygwin] Mark `ulimit -f` unsupported for Cygwin (#165849)
84cc2b0ebc2b [LIT] replace `lit.util.mkdir` with `pathlib.Path.mkdir` (#163948)
9ee9fb0d4090 [X86] narrowBitOpRMW - add handling for single bit insertion patterns (REAPPLIED) (#166337)
bb4ed55acdbc [lldb][Runtime] Move VerboseTrapFrameRecognizer into CPPLanguageRuntime (#166157)
c02bdd466a1c [AMDGPU] Fix handling of FP in cs.chain functions (#161194)
baf55f98a4f8 merge main into amd-staging
f74e90961f51 [mlir][memref]: Collapse strided unit dim even if strides are dynamic (#157330)
6ad25c5912fc [AArch64] Improve the cost model for extending mull (#125651)
954730566f56 merge main into amd-staging (#483)
25a592cc63f0 [MLIR][NVVM] Update redux.sync op (#166125)
04619db16ba8 [RISCV] Add short forward branch support for `mul` instruction (#166300)
546a783d81d3 [Attributor] Fix sanitizer for getAssumedMinTrailingZeros()
52fdcf94a39a [RISCV][NFC] Match `3/5/9 * 3/5/9 * 2^N` without a loop (#165547)
0ba7bfc34fa2 [DFAJumpThreading] Enable DFAJumpThread by default. (#157646)
93e860e69477 IR: Remove null UseList checks in hasNUses methods (#165929)
993a38fa539d [MLIR][Affine] Extend getVectorReductionOp to support xor/maxnumf/minnumf (#163310)
95d6caa5d436 [flang][cuda] Add interfaces and lowering for atomicaddvector (#166275)
57730f6cdb32 [flang][cuda] Switch to inline ptx for barrier_arrive (#166261)
a3a99c3996ff [compiler-rt][x86] Don't use assert.h when building without a libc (#165384)
00ee53cc7b55 [Attributor] Propagate alignment through ptrmask (#150158)
c902102cdd38 merge main into amd-staging
264a8fca68dc merge main into amd-staging (#481)
bb563b196f0e [OpenMP][AIX] Not to create symbolic links to libomp.so on AIX (#165890)
2b45efe92013 [AMDGPU] NFC, move testcase, only test output of promote-alloca with vector-combine (#166289)
d470bfe0d464 [clang] Mark labels referenced when used in named break or continue (#166033)
c63cb50080fb [RISCV][GISel] Support select indexed vector load store intrinsics (#165876)
70ff2c9a32ec [doc][RISCV] Add XSfvfexp* and XSfvfexpa* into RISCVUsage.rst (#166198)
4d98ee2a2224 ARM: Add watchos run line to llvm.sincos test (#166271)
749a0e1e410b update_llc_test_checks: Recognize thumbv7k triples (#166270)
03e78a9976cf [Clang][Sema] Check null after ExtractTypeForDeductionGuide (#165776)
bc08e69959ec [clang][modules] Fix crash in enum visibility lookup for C++20 header units (#166272)
a22d1c222554 Revert "[Polly] Introduce PhaseManager and remove LPM support (#125442)"
02d93f7abbd5 [RISCV] Adding `vlenb` register as callee register (#165796)
c79cad200a2b merge main into amd-staging
10cd7e68533a merge main into amd-staging (#478)
f7fff18ad096 [mlir][OpenACC] add unstructured attributes for acc.loop with early exits (#164990)
c77b614564c6 ARM: Add more ABIs to llvm.sincos test (#166264)
590a2b0a1f32 Revert "ARM: Remove unnecessary manual ABI lowering for sincos_stret (#166040)" (#166262)
d4c41b7fa30b [mlir][Transforms] Dialect Conversion: Convert entry block only (#165180)
285b57b1a68e Update BOLT's README.md example optimization flag (#166251)
73ef4dd48fa3 [flang][cuda] Add missing semi-colon in inlined ptx (#166254)
615299934489 [CodeGen][ARM64EC] Don't treat guest exit thunks as indirect calls (#165885)
2a42a85f5bf8 [MLIR][NVVM] Add support for Convert Ops with rs-rounding mode (#165736)
475c632b17a9 [flang][cuda] Use local scope to avoid duplicate definition (#166249)
562e3bfcd45c [BOLT] Add an option for constant island cloning (#165778)
c0cb5133489f [lld][WebAssembly] Use writePtrConst helper function (#166228)
148a42bdd2f2 [sanitizer-common] [Darwin] Provide warnings for common sandbox issues (#165907)
01221874e41e [SLU][profcheck] Use the original branch weigths in `buildPartialInvariantUnswitchConditionalBranch` (#164270)
cf11ff9084ca Exclude another StructurizeCFG test from profcheck (#166248)
e987ab11a6f3 [Polly] Introduce PhaseManager and remove LPM support (#125442)
0e3612e75138 [scudo] Add config option to modify get usable size behavior (#158710)
5d9d89092e0f [NVPTX] Add more clear error message for using invalid syncscope (#165737)
ecaaebf8f0f7 X86: Correct IR type used for sincos_stret return value (#166240)
3c2c9d5bc1e2 DAG: Cleanup string bool attribute check for disable-tail-calls (#166237)
a522ae3ef6e1 ARM: Remove unnecessary manual ABI lowering for sincos_stret (#166040)
ed53c413ef51 [profcheck] Exclude more tools tests (#166239)
2c8781de6489 Revert "[Clang] Make the AS of llvm.compiler.used & llvm.used elements addrspace(0)" (#166242)
f62ff29157cb [mlir][spirv] Fix Intel SubgroupBlock* op tests (#166185)
ac21fde46488 [flang][cuda] Add support for f16 atomicadd (#166229)
f77ce52b56d0 [Driver][NFC] Don't specify offloading model in help text for -Xarch_device/host (#165503)
a8de6499c240 [NFC][LLDB][BoundsSatety] Add `InstrumentationRuntime::MatchAllModules` (#166001)
5054220bf322 Revert "[Clang] Make the AS of llvm.compiler.used & llvm.used elements addrspace(0) (#164432)"
1d8d8dc6b53a [gn build] Port 1621486d676f
bc0d0cf3ac9b [Clang] FunctionEffect analysis was missing a CXXBindTemporaryExpr's implicit call to a destructor. (#166110)
5e8a0d64952f [RISCV] Use reportFatalUsageError in a few places (#166218)
1621486d676f [lldb-dap] Add support for launching supported clients (#165941)
6fe3eccdf44f [llvm][DebugInfo] Emit 0/1 for constant boolean values (#151225)
25da15f9b1f9 [lldb] Fix indentation when printing stop hooks (#165945)
e5d9644bca21 [NFC] [DirectX] Fix warning about parentheses for assertion in DXContainerGlobals.cpp (#166231)
9ff31be2f2c1 [clang][builtins] Add stdckdint.h to the modulemap. (#166230)
96f093770d21 [dsymutil] Add option to copy swiftmodules built from interface (#165293)
346da3dfd3e7 Revert "[libc] Add printf error handling" (#166232)
68c4c83bcbf9 [MLIR][XeGPU] Matrix load/store subgroup distribution (#165008)
ca00234c09e0 [RISCV] Correct comments in rv64-stackmap.ll to not use X86 register name. NFC (#165912)
0623497a0fcd [RISCV] Mark FLH as canFoldAsLoad. (#165974)
c081fb058831 [RISCV] Removed unused OPERAND_SIMM8. NFC (#166215)
ccc473254fd2 [Clang] FunctionEffects: properly extract the type of a bound member member function from a CallExpr. (#166101)
f02b66105454 [libc++] Add move constructor & assignment to `exception_ptr` (#164281)
dccced25a014 [libc++][docs] Fix documentation of `REQUIRES: std-at-least-*` (#166226)
009706ff6295 [Github] Use truncated body in llvm-bugs.yml
42b608cdd6cd [lldb] Skip tests on older versions of clang
d4ca474ca310 [libc++][docs] Fix documentation of `REQUIRES: std-at-least-*` (#166221)
15bbdd143cb1 [lldb][windows] print an error if python.dll is not in the DLL search path (#164893)
a8ea7f4580b4 Reapply: [AMDGPU][UnifyDivergentExitNodes][StructurizeCFG] Add support for callbr instruction with inline-asm (#152161) (#166195)
dd45c060ffe0 [clang][NFC] Fix BuildExtVectorType parameter name (#166208)
ca69a8d2f403 [clang-format] Fix ColumnLimit violation while aligning (#165627)
0c707c9713f0 [libc] Add printf error handling (#162876)
3a0c534032aa [ORC] Fix -Wunused-function warning on windows (#166207)
0a28c07f2f51 workflows/release-binaries: Disable LTO/PGO for testing macOS job in PRs (#165801)
8fd1bf2f8c9e [BPF] Remove unused weak symbol __bpf_trap (#166003)
d200df0557b7 [libcxx] Remove Redundant Reset in ~basic_string (#164718)
fa2c5fe21df4 [Github] Pin Remaining Github Actions to SHAs (#166194)
e302a6bf3cef merge main into amd-staging
ecdd66035632 [clang] Report Diagnostic when builtin vector has negative size (#166055)
425fe3362d49 [lldb] Fix unaligned writes in ObjectFileELF (#165759)
84a9ed25e816 [clangd] Preserve qualified names in "override pure virtual methods" tweak (#163726)
d65e712e30a8 [clang] Make "__GCC_HAVE_DWARF2_CFI_ASM" a proper predefined macro (#165731)
2de5a17e9c8e [CIR] Upstream FPToFP Builtin CeilOp (#166052)
10349ca13906 [gn build] Port 6747ea050dfc
6747ea050dfc [CAS] Add UnifiedOnDiskCache and OnDiskCAS (#114103)
e8765401d49c [AMDGPU][GlobalISel] Add RegBankLegalize support for G_FENCE (#165939)
c4…
@dhernandez0 dhernandez0 self-assigned this Nov 10, 2025
@dhernandez0 dhernandez0 requested a review from causten as a code owner November 10, 2025 13:35
@dhernandez0 dhernandez0 changed the title November upstream merge [DRAFT] November upstream merge Nov 10, 2025
@dhernandez0 dhernandez0 changed the title [DRAFT] November upstream merge November upstream merge Nov 11, 2025
@umangyadav
Copy link
Member

diff -rup llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp rocMLIR-2122/external/llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
--- llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp	2025-11-11 11:15:33.302593110 +0100
+++ rocMLIR-2122/external/llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp	2025-11-11 13:14:15.569060089 +0100
@@ -66,11 +66,12 @@ static cl::opt<unsigned> PromoteAllocaTo
     cl::desc("Maximum byte size to consider promote alloca to vector"),
     cl::init(0));
 
+// TODO(rocmlir): temporary fix for backend bug
 static cl::opt<unsigned> PromoteAllocaToVectorMaxRegs(
     "amdgpu-promote-alloca-to-vector-max-regs",
     cl::desc(
         "Maximum vector size (in 32b registers) to use when promoting alloca"),
-    cl::init(32));
+    cl::init(16));
 
 // Use up to 1/4 of available register budget for vectorization.
 // FIXME: Increase the limit for whole function budgets? Perhaps x2?
Only in llvm-project/llvm/test/CodeGen/AMDGPU: amdgpu-num-agpr.ll
Only in llvm-project/llvm/test/CodeGen/AMDGPU: min-waves-per-eu-not-respected.ll
diff -rup llvm-project/llvm/test/CodeGen/AMDGPU/promote-alloca-max-regs.ll rocMLIR-2122/external/llvm-project/llvm/test/CodeGen/AMDGPU/promote-alloca-max-regs.ll

This TODO is this not fixed by upstream yet ?

Only in llvm-project/llvm/test/CodeGen/AMDGPU: amdgpu-num-agpr.ll

Why does amdgpu-num-agpr.ll not exist in rocMLIR

@umangyadav
Copy link
Member

umangyadav commented Nov 11, 2025

--- llvm-project/mlir/include/mlir/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.h	2025-02-27 12:36:13.465135726 +0100
+++ rocMLIR-2122/external/llvm-project/mlir/include/mlir/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.h	2025-11-11 11:14:16.535541433 +0100
@@ -27,7 +27,8 @@ class Pass;
 /// populateAMDGPUMemorySpaceAttributeConversions().
 void populateAMDGPUToROCDLConversionPatterns(LLVMTypeConverter &converter,
                                              RewritePatternSet &patterns,
-                                             amdgpu::Chipset chipset);
+                                             amdgpu::Chipset chipset,
+                                             bool hackForDirectToLDS);
 
 /// Remap AMDGPU memory spaces to LLVM address spaces
 /// by mapping amdgpu::AddressSpace::fat_raw_buffer to ptr addrspace(7),

Do we still require hack ?

-  module.walk([&](func::CallOp op) {
-    auto callee = symtab.lookup<func::FuncOp>(op.getCallee());
-    if (!callee) {
-      op.emitError() << "cannot find callee '" << op.getCallee() << "' in "
-                     << "symbol table";
-      didFail = true;
-      return;
-    }
-    if (!options.filterFn(&callee))
-      return;
-    if (callee.isExternal() || callee.isPublic())
-      return;
+  module.walk([&](CallOpInterface op) {
+    // TODO(rocmlir) CallOpInterface does not have a getCallee() method.
+    // We should enable this back once we add this method.
+    //
+    // auto callee = symtab.lookup<func::FuncOp>(op.getCallee());
+    // if (!callee) {
+    //   op.emitError() << "cannot find callee '" << op.getCallee() << "' in "
+    //                  << "symbol table";
+    //   didFail = true;
+    //   return;
+    // }
+    // if (!options.filterFn(&callee))
+    //   return;
+    // if (callee.isPublic() && !options.modifyPublicFunctions)
+    //   return;
+    // if (callee.isExternal())
+    //   return;
+    // FIXME validate callee in the symbol table.

Is this some new change ? There is getCallableForCallee for the callOpInterface.

You may want to clean this change :

diff -rup llvm-project/mlir/test/lit.site.cfg.py.in rocMLIR-2122/external/llvm-project/mlir/test/lit.site.cfg.py.in
--- llvm-project/mlir/test/lit.site.cfg.py.in	2025-11-11 11:15:35.834562546 +0100
+++ rocMLIR-2122/external/llvm-project/mlir/test/lit.site.cfg.py.in	2025-11-11 11:14:35.595303785 +0100
@@ -82,4 +82,4 @@ import lit.llvm
 lit.llvm.initialize(lit_config, config)
 
 # Let the main config do the real work.
-lit_config.load_config(config, "@MLIR_SOURCE_DIR@/test/lit.cfg.py")
+lit_config.load_config(config, "@MLIR_SOURCE_DIR@/test/lit.cfg.py")
\ No newline at end of file

What's the motivation behind this "modifyPublicFunctions" ?

 /// Replace buffers that are returned from a function with an out parameter.
diff -rup llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td rocMLIR-2122/external/llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td
--- llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td	2025-11-11 11:16:05.246203784 +0100
+++ rocMLIR-2122/external/llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td	2025-11-11 11:14:35.447305709 +0100
@@ -258,6 +258,9 @@ def BufferResultsToOutParamsPass
               /*default=*/"false", "Hoist static allocations to call sites.">,
        Option<"hoistDynamicAllocs", "hoist-dynamic-allocs", "bool",
               /*default=*/"false", "Hoist dynamic allocations to call sites.">,
+       Option<"modifyPublicFunctions", "modify-public-functions", "bool",
+              /*default=*/"false", "Modify function signatures of public "
+              "functions.">,
   ];
   let dependentDialects = ["memref::MemRefDialect"];
 }

@umangyadav
Copy link
Member

umangyadav commented Nov 11, 2025

I see lot of build warnings. Check if you can remove them in a subsequent PR to not add more diffs in this one.

@dhernandez0
Copy link
Contributor Author

diff -rup llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp rocMLIR-2122/external/llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp
--- llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp	2025-11-11 11:15:33.302593110 +0100
+++ rocMLIR-2122/external/llvm-project/llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp	2025-11-11 13:14:15.569060089 +0100
@@ -66,11 +66,12 @@ static cl::opt<unsigned> PromoteAllocaTo
     cl::desc("Maximum byte size to consider promote alloca to vector"),
     cl::init(0));
 
+// TODO(rocmlir): temporary fix for backend bug
 static cl::opt<unsigned> PromoteAllocaToVectorMaxRegs(
     "amdgpu-promote-alloca-to-vector-max-regs",
     cl::desc(
         "Maximum vector size (in 32b registers) to use when promoting alloca"),
-    cl::init(32));
+    cl::init(16));
 
 // Use up to 1/4 of available register budget for vectorization.
 // FIXME: Increase the limit for whole function budgets? Perhaps x2?
Only in llvm-project/llvm/test/CodeGen/AMDGPU: amdgpu-num-agpr.ll
Only in llvm-project/llvm/test/CodeGen/AMDGPU: min-waves-per-eu-not-respected.ll
diff -rup llvm-project/llvm/test/CodeGen/AMDGPU/promote-alloca-max-regs.ll rocMLIR-2122/external/llvm-project/llvm/test/CodeGen/AMDGPU/promote-alloca-max-regs.ll

This TODO is this not fixed by upstream yet ?

Only in llvm-project/llvm/test/CodeGen/AMDGPU: amdgpu-num-agpr.ll

Why does amdgpu-num-agpr.ll not exist in rocMLIR

This is the workaround we introduced here: #2028 AFAIK the fix is not done yet. This is the ticket: https://ontrack-internal.amd.com/browse/SWDEV-559837 They are actively working no it (see the recent comments).

The file amdgpu-num-agpr.ll expects the warnings related to waves-per-eu that we remove. So, instead of fixing the test I thought it was easier to remove it. I'm not even sure why we remove the warning, we could just let it be printed as well.

@dhernandez0
Copy link
Contributor Author

Do we still require hack ?

Yes, we do, @pabloantoniom is working on a better workaround: https://github.com/ROCm/rocMLIR-internal/issues/2062
This is the backend ticket: https://ontrack-internal.amd.com/browse/SWDEV-521121

Is this some new change ? There is getCallableForCallee for the callOpInterface.

This is not new, you can see this in develop:

// TODO CallOpInterface does not have a getCallee() method.

You may want to clean this change :

Sure, I will.

What's the motivation behind this "modifyPublicFunctions" ?

They changed the behavior of the pass to avoid modifying public functions. Someone fixed that with this new param modifyPublicFunctions the day after I took the commit hash, here's the PR: llvm/llvm-project#167248
So, I'm just partially taking their PR changes, so that we don't have to update Pipelines.cpp in the future upstream merge, because we already use modifyPublicFunctions.

 /// Replace buffers that are returned from a function with an out parameter.
diff -rup llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td rocMLIR-2122/external/llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td
--- llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td	2025-11-11 11:16:05.246203784 +0100
+++ rocMLIR-2122/external/llvm-project/mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td	2025-11-11 11:14:35.447305709 +0100
@@ -258,6 +258,9 @@ def BufferResultsToOutParamsPass
               /*default=*/"false", "Hoist static allocations to call sites.">,
        Option<"hoistDynamicAllocs", "hoist-dynamic-allocs", "bool",
               /*default=*/"false", "Hoist dynamic allocations to call sites.">,
+       Option<"modifyPublicFunctions", "modify-public-functions", "bool",
+              /*default=*/"false", "Modify function signatures of public "
+              "functions.">,
   ];
   let dependentDialects = ["memref::MemRefDialect"];
 }

@umangyadav
Copy link
Member

I'm not even sure why we remove the warning, we could just let it be printed as well.

It is disabled because otherwise when MIGraphX compiles and tunes, it will dump lot of unnecessary warnings.

@umangyadav
Copy link
Member

umangyadav commented Nov 13, 2025

Sure, I will.

Can you update diff files in this PR description after you've made all the required chanes ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants