[VPlan] Get Addr computation cost with scalar type if it is uniform for gather/scatter. (NFC) #150371

ElvisWang123 · 2025-07-24T05:27:04Z

This patch query getAddressComputationCost() with scalar type if the address is uniform. This can help the cost for gather/scatter more accurate.

In current LV, non consecutive VPWidenMemoryRecipe (gather/scatter) will account the cost of address computation. But there are some cases that the address is uniform across all lanes, that makes the address can be calculated with scalar type and broadcast.

I have a followup optimization that tries to convert gather/scatter with uniform memory access to scalar load/store + broadcast (and select if needed). With this optimization, we can remove this temporary change.

This patch is preparation for #149955.

llvmbot · 2025-07-24T05:27:34Z

@llvm/pr-subscribers-vectorizers

@llvm/pr-subscribers-llvm-transforms

Author: Elvis Wang (ElvisWang123)

Changes

This patch query getAddressComputationCost() with scalar type if the address is uniform. This can help the cost for gather/scatter more accurate.

In current LV, non consecutive VPWidenMemoryRecipe (gather/scatter) will account the cost of address computation. But there are some cases that the address is uniform across all lanes, that makes the address can be calculated with scalar type and broadcast.

I have a followup optimization that tries to convert gather/scatter with uniform memory access to scalar load/store + broadcast (and select if needed). With this optimization, we can remove this temporary change.

This patch is preparation for #149955.

Full diff: https://github.com/llvm/llvm-project/pull/150371.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/LoopVectorize.cpp (+6)
(modified) llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp (+12-3)

diff --git a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
index 3ce9d29d34553..7adb87f4557f8 100644
--- a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+++ b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -6932,6 +6932,12 @@ static bool planContainsAdditionalSimplifications(VPlan &Plan,
   auto Iter = vp_depth_first_deep(Plan.getVectorLoopRegion()->getEntry());
   for (VPBasicBlock *VPBB : VPBlockUtils::blocksOnly<VPBasicBlock>(Iter)) {
     for (VPRecipeBase &R : *VPBB) {
+      if (auto *MR = dyn_cast<VPWidenMemoryRecipe>(&R)) {
+        // The address computation cost can be query as scalar type if the
+        // address is uniform.
+        if (!MR->isConsecutive() && vputils::isSingleScalar(MR->getAddr()))
+          return true;
+      }
       if (auto *IR = dyn_cast<VPInterleaveRecipe>(&R)) {
         auto *IG = IR->getInterleaveGroup();
         unsigned NumMembers = IG->getNumMembers();
diff --git a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
index 57b713d3dfcb9..e8a3951bbeb20 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
@@ -3083,9 +3083,18 @@ InstructionCost VPWidenMemoryRecipe::computeCost(ElementCount VF,
     const Value *Ptr = getLoadStorePointerOperand(&Ingredient);
     assert(!Reverse &&
            "Inconsecutive memory access should not have the order.");
-    return Ctx.TTI.getAddressComputationCost(Ty) +
-           Ctx.TTI.getGatherScatterOpCost(Opcode, Ty, Ptr, IsMasked, Alignment,
-                                          Ctx.CostKind, &Ingredient);
+    InstructionCost Cost = 0;
+
+    // If the address value is uniform across all lane, then the address can be
+    // calculated with scalar type and broacast.
+    if (vputils::isSingleScalar(getAddr()))
+      Cost += Ctx.TTI.getAddressComputationCost(Ty->getScalarType());
+    else
+      Cost += Ctx.TTI.getAddressComputationCost(Ty);
+
+    return Cost + Ctx.TTI.getGatherScatterOpCost(Opcode, Ty, Ptr, IsMasked,
+                                                 Alignment, Ctx.CostKind,
+                                                 &Ingredient);
   }
 
   InstructionCost Cost = 0;

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp

ElvisWang123 · 2025-07-28T00:41:52Z

Close since it is too optimistic. Will try another method to fix the regression caused by #149955.

lukel97 · 2025-08-07T13:31:07Z

Hi, after thinking about this a bit more I think this PR is actually NFC, and is probably the easiest way forward.

getAddressComputationCost is only called with a SCEV in one place, getMemInstScalarizationCost
getMemInstScalarizationCost is only used to drive the widening decision + legacy cost model, we're not currently passing in the SCEV anywhere in VPlan
The only targets that utilise the SCEV (ARM, AArch64 and X86) only distinguish between vector and non-vector types when the SCEV is passed.
So this PR shouldn't have any effect.

I really can't think of a better way to avoid the regression in #149955. Is it possible to reopen this PR? Hopefully all of this will become simpler once the widening decisions are driven by VPlan/done as VPlan transformations themselves.

Pasting in the TTI hooks for reference:

InstructionCost ARMTTIImpl::getAddressComputationCost(Type *Ty,
                                                      ScalarEvolution *SE,
                                                      const SCEV *Ptr) const {
  // Address computations in vectorized code with non-consecutive addresses will
  // likely result in more instructions compared to scalar code where the
  // computation can more often be merged into the index mode. The resulting
  // extra micro-ops can significantly decrease throughput.
  unsigned NumVectorInstToHideOverhead = 10;
  int MaxMergeDistance = 64;

  if (ST->hasNEON()) {
    if (Ty->isVectorTy() && SE &&
        !BaseT::isConstantStridedAccessLessThan(SE, Ptr, MaxMergeDistance + 1))
      return NumVectorInstToHideOverhead;

    // In many cases the address computation is not merged into the instruction
    // addressing mode.
    return 1;
  }
  return BaseT::getAddressComputationCost(Ty, SE, Ptr);
}


InstructionCost
AArch64TTIImpl::getAddressComputationCost(Type *Ty, ScalarEvolution *SE,
                                          const SCEV *Ptr) const {
  // Address computations in vectorized code with non-consecutive addresses will
  // likely result in more instructions compared to scalar code where the
  // computation can more often be merged into the index mode. The resulting
  // extra micro-ops can significantly decrease throughput.
  unsigned NumVectorInstToHideOverhead = NeonNonConstStrideOverhead;
  int MaxMergeDistance = 64;

  if (Ty->isVectorTy() && SE &&
      !BaseT::isConstantStridedAccessLessThan(SE, Ptr, MaxMergeDistance + 1))
    return NumVectorInstToHideOverhead;

  // In many cases the address computation is not merged into the instruction
  // addressing mode.
  return 1;
}


InstructionCost X86TTIImpl::getAddressComputationCost(Type *Ty,
                                                      ScalarEvolution *SE,
                                                      const SCEV *Ptr) const {
  // Address computations in vectorized code with non-consecutive addresses will
  // likely result in more instructions compared to scalar code where the
  // computation can more often be merged into the index mode. The resulting
  // extra micro-ops can significantly decrease throughput.
  const unsigned NumVectorInstToHideOverhead = 10;

  // Cost modeling of Strided Access Computation is hidden by the indexing
  // modes of X86 regardless of the stride value. We dont believe that there
  // is a difference between constant strided access in gerenal and constant
  // strided value which is less than or equal to 64.
  // Even in the case of (loop invariant) stride whose value is not known at
  // compile time, the address computation will not incur more than one extra
  // ADD instruction.
  if (Ty->isVectorTy() && SE && !ST->hasAVX2()) {
    // TODO: AVX2 is the current cut-off because we don't have correct
    //       interleaving costs for prior ISA's.
    if (!BaseT::isStridedAccess(Ptr))
      return NumVectorInstToHideOverhead;
    if (!BaseT::getConstantStrideStep(SE, Ptr))
      return 1;
  }

  return BaseT::getAddressComputationCost(Ty, SE, Ptr);
}

ElvisWang123 · 2025-08-07T23:55:31Z

Thanks @lukel97 will reopen this.

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

lukel97

LGTM.

Btw, I think this makes more sense if we change the Type argument to getAddressComputationCost to the type of the pointer, not the value. But we don't do this consistently today.

LoopVectorizationCostModel::getMemInstScalarizationCost passes in the pointer type:

  // Get the cost of the scalar memory instruction and address computation.
  InstructionCost Cost =
      VF.getFixedValue() * TTI.getAddressComputationCost(PtrTy, SE, PtrSCEV);

X86 calls it PtrTy:

InstructionCost getAddressComputationCost(Type *PtrTy, ScalarEvolution *SE,

But VectorCombine passes in the element type:

    ScalarizedCost += TTI.getAddressComputationCost(VecTy->getElementType());

As does VPWidenMemoryRecipe::computeCost.

I think we should fix this in a separate PR.

lukel97 · 2025-08-08T06:47:59Z

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp

Suggested change

// If the address value is uniform across all lane, then the address can be

// calculated with scalar type and broacast.

// If the address value is uniform across all lanes, then the address can be

// calculated with scalar type and broadcast.

Still pending?

Fixed, thanks!

fhahn

It should be possible to add a test for this?

lukel97 · 2025-08-08T09:31:57Z

It should be possible to add a test for this?

I think this is NFC with the current TTIs, since they only use the type when the SCEV argument is passed. We don't pass it in VPWidenMemoryRecipe::computeCost.

It makes a difference with #149955 though because RISC-V will start checking the type without the SCEV. Should the changes be bundled up into that PR?

ElvisWang123 · 2025-08-12T02:18:57Z

I stacked this patch on the #149955 and can find the test changes on 768c654.

lukel97 · 2025-08-12T03:06:13Z

This PR title should probably be marked as NFC

ElvisWang123 · 2025-08-19T01:10:41Z

@fhahn Gentle ping :)

…or gather/scatter. This patch query `getAddressComputationCost()` with scalar type if the address is uniform. This can help the cost for gather/scatter more accurate. In current LV, non consecutive VPWidenMemoryRecipe (gather/scatter) will account the cost of address computation. But there are some cases that the addr is uniform accross lanes, that makes the address can be calculated with scalar type and broadcast. I have a follow optimization that try to converts gather/scatter with uniform memory acces to scalar load/store + broadcast. With this optimization, we can remove this temporary change.

ElvisWang123 · 2025-08-25T03:07:04Z

@fhahn Ping.

Please see 4716215 for test changes.

fhahn

LGTM, thanks.

Having the test case in
4716215 should be fine, thanks

llvm-ci · 2025-08-26T02:44:43Z

LLVM Buildbot has detected a new failure on builder lldb-arm-ubuntu running on linaro-lldb-arm-ubuntu while building llvm at step 6 "test".

Full details are available at: https://lab.llvm.org/buildbot/#/builders/18/builds/20677

Here is the relevant piece of the build log for the reference

Step 6 (test) failure: build (failure)
...
PASS: lldb-shell :: SymbolFile/DWARF/x86/dwp-index-cache.cpp (1828 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/dwp.s (1829 of 3633)
XFAIL: lldb-shell :: SymbolFile/DWARF/x86/explicit-member-function-quals.cpp (1830 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/enum-declaration-uniqueness.cpp (1831 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/find-basic-namespace.cpp (1832 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/dwp-separate-debug-file.cpp (1833 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/find-basic-type.cpp (1834 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/find-basic-function.cpp (1835 of 3633)
PASS: lldb-shell :: SymbolFile/DWARF/x86/find-inline-method.s (1836 of 3633)
TIMEOUT: lldb-api :: tools/lldb-dap/module/TestDAP_module.py (1837 of 3633)
******************** TEST 'lldb-api :: tools/lldb-dap/module/TestDAP_module.py' FAILED ********************
Script:
--
/usr/bin/python3.10 /home/tcwg-buildbot/worker/lldb-arm-ubuntu/llvm-project/lldb/test/API/dotest.py -u CXXFLAGS -u CFLAGS --env LLVM_LIBS_DIR=/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./lib --env LLVM_INCLUDE_DIR=/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/include --env LLVM_TOOLS_DIR=/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./bin --arch armv8l --build-dir /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex --lldb-module-cache-dir /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/module-cache-lldb/lldb-api --clang-module-cache-dir /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/module-cache-clang/lldb-api --executable /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./bin/lldb --compiler /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./bin/clang --dsymutil /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./bin/dsymutil --make /usr/bin/gmake --llvm-tools-dir /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./bin --lldb-obj-root /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/tools/lldb --lldb-libs-dir /home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/./lib --cmake-build-type Release /home/tcwg-buildbot/worker/lldb-arm-ubuntu/llvm-project/lldb/test/API/tools/lldb-dap/module -p TestDAP_module.py
--
Exit Code: -9
Timeout: Reached timeout of 600 seconds

Command Output (stdout):
--
lldb version 22.0.0git (https://github.com/llvm/llvm-project.git revision ed52bdd453e3504b8cc0aa3c8a5852681c535432)
  clang revision ed52bdd453e3504b8cc0aa3c8a5852681c535432
  llvm revision ed52bdd453e3504b8cc0aa3c8a5852681c535432

--
Command Output (stderr):
--
========= DEBUG ADAPTER PROTOCOL LOGS =========
1756175610.984638929 (stdio) --> {"command":"initialize","type":"request","arguments":{"adapterID":"lldb-native","clientID":"vscode","columnsStartAt1":true,"linesStartAt1":true,"locale":"en-us","pathFormat":"path","supportsRunInTerminalRequest":true,"supportsVariablePaging":true,"supportsVariableType":true,"supportsStartDebuggingRequest":true,"supportsProgressReporting":true,"$__lldb_sourceInitFile":false},"seq":1}
1756175610.984893084 (stdio) queued (command=initialize seq=1)
1756175610.988506317 (stdio) <-- {"body":{"$__lldb_version":"lldb version 22.0.0git (https://github.com/llvm/llvm-project.git revision ed52bdd453e3504b8cc0aa3c8a5852681c535432)\n  clang revision ed52bdd453e3504b8cc0aa3c8a5852681c535432\n  llvm revision ed52bdd453e3504b8cc0aa3c8a5852681c535432","completionTriggerCharacters":["."," ","\t"],"exceptionBreakpointFilters":[{"description":"C++ Catch","filter":"cpp_catch","label":"C++ Catch","supportsCondition":true},{"description":"C++ Throw","filter":"cpp_throw","label":"C++ Throw","supportsCondition":true},{"description":"Objective-C Catch","filter":"objc_catch","label":"Objective-C Catch","supportsCondition":true},{"description":"Objective-C Throw","filter":"objc_throw","label":"Objective-C Throw","supportsCondition":true}],"supportTerminateDebuggee":true,"supportsBreakpointLocationsRequest":true,"supportsCancelRequest":true,"supportsCompletionsRequest":true,"supportsConditionalBreakpoints":true,"supportsConfigurationDoneRequest":true,"supportsDataBreakpoints":true,"supportsDelayedStackTraceLoading":true,"supportsDisassembleRequest":true,"supportsEvaluateForHovers":true,"supportsExceptionFilterOptions":true,"supportsExceptionInfoRequest":true,"supportsFunctionBreakpoints":true,"supportsHitConditionalBreakpoints":true,"supportsInstructionBreakpoints":true,"supportsLogPoints":true,"supportsModuleSymbolsRequest":true,"supportsModulesRequest":true,"supportsReadMemoryRequest":true,"supportsSetVariable":true,"supportsSteppingGranularity":true,"supportsValueFormattingOptions":true,"supportsWriteMemoryRequest":true},"command":"initialize","request_seq":1,"seq":0,"success":true,"type":"response"}
1756175610.989248276 (stdio) --> {"command":"launch","type":"request","arguments":{"program":"/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/tools/lldb-dap/module/TestDAP_module.test_compile_units/a.out","initCommands":["settings clear --all","settings set symbols.enable-external-lookup false","settings set target.inherit-tcc true","settings set target.disable-aslr false","settings set target.detach-on-error false","settings set target.auto-apply-fixits false","settings set plugin.process.gdb-remote.packet-timeout 60","settings set symbols.clang-modules-cache-path \"/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/module-cache-lldb/lldb-api\"","settings set use-color false","settings set show-statusline false"],"disableASLR":false,"enableAutoVariableSummaries":false,"enableSyntheticChildDebugging":false,"displayExtendedBacktrace":false},"seq":2}
1756175610.989335775 (stdio) queued (command=launch seq=2)
1756175610.989746571 (stdio) <-- {"body":{"category":"console","output":"Running initCommands:\n"},"event":"output","seq":0,"type":"event"}
1756175610.989787340 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings clear --all\n"},"event":"output","seq":0,"type":"event"}
1756175610.989813328 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set symbols.enable-external-lookup false\n"},"event":"output","seq":0,"type":"event"}
1756175610.989836693 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set target.inherit-tcc true\n"},"event":"output","seq":0,"type":"event"}
1756175610.989851236 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set target.disable-aslr false\n"},"event":"output","seq":0,"type":"event"}
1756175610.989864111 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set target.detach-on-error false\n"},"event":"output","seq":0,"type":"event"}
1756175610.989877701 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set target.auto-apply-fixits false\n"},"event":"output","seq":0,"type":"event"}
1756175610.989914656 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set plugin.process.gdb-remote.packet-timeout 60\n"},"event":"output","seq":0,"type":"event"}
1756175610.989930391 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set symbols.clang-modules-cache-path \"/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/module-cache-lldb/lldb-api\"\n"},"event":"output","seq":0,"type":"event"}
1756175610.989959717 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set use-color false\n"},"event":"output","seq":0,"type":"event"}
1756175610.989974499 (stdio) <-- {"body":{"category":"console","output":"(lldb) settings set show-statusline false\n"},"event":"output","seq":0,"type":"event"}
1756175611.173544407 (stdio) <-- {"command":"launch","request_seq":2,"seq":0,"success":true,"type":"response"}
1756175611.173623323 (stdio) <-- {"event":"initialized","seq":0,"type":"event"}
1756175611.173729420 (stdio) <-- {"body":{"module":{"addressRange":"0xf4c1b000","debugInfoSize":"983.3KB","id":"253BA35E-436C-EC85-2949-CBD09E38AFEE-11B460BF","name":"ld-linux-armhf.so.3","path":"/usr/lib/arm-linux-gnueabihf/ld-linux-armhf.so.3","symbolFilePath":"/usr/lib/arm-linux-gnueabihf/ld-linux-armhf.so.3","symbolStatus":"Symbols loaded."},"reason":"new"},"event":"module","seq":0,"type":"event"}
1756175611.173972368 (stdio) <-- {"body":{"module":{"addressRange":"0x5100000","debugInfoSize":"1.1KB","id":"EFF784D7","name":"a.out","path":"/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/tools/lldb-dap/module/TestDAP_module.test_compile_units/a.out","symbolFilePath":"/home/tcwg-buildbot/worker/lldb-arm-ubuntu/build/lldb-test-build.noindex/tools/lldb-dap/module/TestDAP_module.test_compile_units/a.out","symbolStatus":"Symbols loaded."},"reason":"new"},"event":"module","seq":0,"type":"event"}
1756175611.174506903 (stdio) --> {"command":"setBreakpoints","type":"request","arguments":{"source":{"path":"main.cpp"},"sourceModified":false,"lines":[5],"breakpoints":[{"line":5}]},"seq":3}

This patch check if the addr is uniform in legacy cost model to align vplan-based cost model after llvm#150371. This patch fixes llvm-test-suite assertion due to cost model misaligned after llvm#149955 under RISCV. I've tested this patch (on top of llvm#149955) on the llvm-test-suite locally with crached options `rva23u64`, `rva23u64_zvl1024b` and build successfully.

…w/ uniform addr. (#155739) This patch check if the addr is uniform in legacy cost model to align vplan-based cost model after #150371. This patch fixes llvm-test-suite assertion (https://lab.llvm.org/buildbot/#/builders/210/builds/1935) due to cost model misaligned after #149955 under RISCV. I've tested this patch (on top of #149955) on the llvm-test-suite locally with crashed options `rva23u64`, `rva23u64_zvl1024b` and build successfully. Since this fix will change LV, I think would be better to create a PR to fix this.

ElvisWang123 requested review from alexey-bataev, arcbbb, david-arm, fhahn and lukel97 July 24, 2025 05:27

llvmbot added vectorizers llvm:transforms labels Jul 24, 2025

ElvisWang123 mentioned this pull request Jul 24, 2025

[RISCV][TTI] Implement getAddressComputationCost() in RISCV TTI. #149955

Merged

lukel97 reviewed Jul 24, 2025

View reviewed changes

llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp Outdated Show resolved Hide resolved

ElvisWang123 closed this Jul 28, 2025

ElvisWang123 reopened this Aug 7, 2025

lukel97 reviewed Aug 8, 2025

View reviewed changes

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp Outdated Show resolved Hide resolved

ElvisWang123 force-pushed the lv-fix-addrComp-cost branch from d56bab9 to 8989f33 Compare August 8, 2025 06:05

lukel97 approved these changes Aug 8, 2025

View reviewed changes

fhahn requested changes Aug 8, 2025

View reviewed changes

ElvisWang123 force-pushed the lv-fix-addrComp-cost branch from 8989f33 to ee2b590 Compare August 12, 2025 03:56

ElvisWang123 changed the title ~~[VPlan] Get Addr computation cost with scalar type if it is uniform for gather/scatter.~~ [VPlan] Get Addr computation cost with scalar type if it is uniform for gather/scatter. (NFC) Aug 12, 2025

ElvisWang123 force-pushed the lv-fix-addrComp-cost branch from eccaeb2 to f55fa1c Compare August 19, 2025 01:30

ElvisWang123 force-pushed the lv-fix-addrComp-cost branch from f55fa1c to 04ff13c Compare August 25, 2025 03:04

fhahn approved these changes Aug 25, 2025

View reviewed changes

Merge branch 'main' into lv-fix-addrComp-cost

5258596

ElvisWang123 merged commit ed52bdd into llvm:main Aug 26, 2025
9 checks passed

ElvisWang123 deleted the lv-fix-addrComp-cost branch August 26, 2025 01:04

ElvisWang123 mentioned this pull request Aug 28, 2025

[LV] Align legacy cost model to vplan-based model for gather/scatter w/ uniform addr. #155739

Merged

[VPlan] Get Addr computation cost with scalar type if it is uniform for gather/scatter. (NFC) #150371

[VPlan] Get Addr computation cost with scalar type if it is uniform for gather/scatter. (NFC) #150371

Uh oh!

Conversation

ElvisWang123 commented Jul 24, 2025

Uh oh!

llvmbot commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ElvisWang123 commented Jul 28, 2025

Uh oh!

lukel97 commented Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ElvisWang123 commented Aug 7, 2025

Uh oh!

Uh oh!

lukel97 left a comment

Choose a reason for hiding this comment

Uh oh!

lukel97 Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

fhahn Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

ElvisWang123 Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

lukel97 commented Aug 8, 2025

Uh oh!

ElvisWang123 commented Aug 12, 2025

Uh oh!

lukel97 commented Aug 12, 2025

Uh oh!

ElvisWang123 commented Aug 19, 2025

Uh oh!

ElvisWang123 commented Aug 25, 2025

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvm-ci commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

llvmbot commented Jul 24, 2025 •

edited

Loading

lukel97 commented Aug 7, 2025 •

edited

Loading