[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost #153216

david-arm · 2025-08-12T15:43:30Z

We were incorrectly using the TTI::TCK_RecipThroughput cost kind and ignoring the kind set in the context.

llvmbot · 2025-08-12T15:44:05Z

@llvm/pr-subscribers-vectorizers

Author: David Sherwood (david-arm)

Changes

We were incorrectly using the TTI::TCK_RecipThroughput cost kind and ignoring the kind set in the context.

Full diff: https://github.com/llvm/llvm-project/pull/153216.diff

1 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp (+1-2)

diff --git a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
index e34cab117f321..a121f4f54845c 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
@@ -2944,7 +2944,6 @@ InstructionCost VPReplicateRecipe::computeCost(ElementCount VF,
   // transform, avoid computing their cost multiple times for now.
   Ctx.SkipCostComputation.insert(UI);
 
-  TTI::TargetCostKind CostKind = TTI::TCK_RecipThroughput;
   Type *ResultTy = Ctx.Types.inferScalarType(this);
   switch (UI->getOpcode()) {
   case Instruction::GetElementPtr:
@@ -2970,7 +2969,7 @@ InstructionCost VPReplicateRecipe::computeCost(ElementCount VF,
     auto Op2Info = Ctx.getOperandInfo(getOperand(1));
     SmallVector<const Value *, 4> Operands(UI->operand_values());
     return Ctx.TTI.getArithmeticInstrCost(
-               UI->getOpcode(), ResultTy, CostKind,
+               UI->getOpcode(), ResultTy, Ctx.CostKind,
                {TargetTransformInfo::OK_AnyValue, TargetTransformInfo::OP_None},
                Op2Info, Operands, UI, &Ctx.TLI) *
            (isSingleScalar() ? 1 : VF.getFixedValue());

llvmbot · 2025-08-12T15:44:05Z

@llvm/pr-subscribers-llvm-transforms

Author: David Sherwood (david-arm)

Changes

We were incorrectly using the TTI::TCK_RecipThroughput cost kind and ignoring the kind set in the context.

Full diff: https://github.com/llvm/llvm-project/pull/153216.diff

1 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp (+1-2)

diff --git a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
index e34cab117f321..a121f4f54845c 100644
--- a/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
+++ b/llvm/lib/Transforms/Vectorize/VPlanRecipes.cpp
@@ -2944,7 +2944,6 @@ InstructionCost VPReplicateRecipe::computeCost(ElementCount VF,
   // transform, avoid computing their cost multiple times for now.
   Ctx.SkipCostComputation.insert(UI);
 
-  TTI::TargetCostKind CostKind = TTI::TCK_RecipThroughput;
   Type *ResultTy = Ctx.Types.inferScalarType(this);
   switch (UI->getOpcode()) {
   case Instruction::GetElementPtr:
@@ -2970,7 +2969,7 @@ InstructionCost VPReplicateRecipe::computeCost(ElementCount VF,
     auto Op2Info = Ctx.getOperandInfo(getOperand(1));
     SmallVector<const Value *, 4> Operands(UI->operand_values());
     return Ctx.TTI.getArithmeticInstrCost(
-               UI->getOpcode(), ResultTy, CostKind,
+               UI->getOpcode(), ResultTy, Ctx.CostKind,
                {TargetTransformInfo::OK_AnyValue, TargetTransformInfo::OP_None},
                Op2Info, Operands, UI, &Ctx.TLI) *
            (isSingleScalar() ? 1 : VF.getFixedValue());

fhahn

Thanks. Would be great to have a test case for this. I can check if I can surface anything

david-arm · 2025-08-14T09:33:03Z

Thanks. Would be great to have a test case for this. I can check if I can surface anything

I don't think this is even possible at the moment, because this will currently only apply if you build with -Oz -fvectorize, but you can't even write a test case for this because we hit the error at the end of LoopVectorizationCostModel::computeMaxVF:

  reportVectorizationFailure(
      "Cannot optimize for size and vectorize at the same time.",

I suppose that really means this patch is NFC and should be harmless. I just wanted to change the CostKind for completeness.

fhahn · 2025-08-14T11:43:50Z

Thanks. Would be great to have a test case for this. I can check if I can surface anything

I don't think this is even possible at the moment, because this will currently only apply if you build with -Oz -fvectorize, but you can't even write a test case for this because we hit the error at the end of LoopVectorizationCostModel::computeMaxVF:
  reportVectorizationFailure(
      "Cannot optimize for size and vectorize at the same time.",
I suppose that really means this patch is NFC and should be harmless. I just wanted to change the CostKind for completeness.

Hm, I think we should definitely hit VPReplicateRecipe::computeCost with different CostKinds, e.g. for this test https://github.com/llvm/llvm-project/blob/main/llvm/test/Transforms/LoopVectorize/AArch64/optsize_minsize.ll#L221

For Os/Oz, the requirement is that there is no scalar tail + no runtime checks IIRC, and the message is for that case.

david-arm · 2025-08-14T12:01:53Z

Thanks. Would be great to have a test case for this. I can check if I can surface anything

I don't think this is even possible at the moment, because this will currently only apply if you build with -Oz -fvectorize, but you can't even write a test case for this because we hit the error at the end of LoopVectorizationCostModel::computeMaxVF:
  reportVectorizationFailure(
      "Cannot optimize for size and vectorize at the same time.",
I suppose that really means this patch is NFC and should be harmless. I just wanted to change the CostKind for completeness.
Hm, I think we should definitely hit VPReplicateRecipe::computeCost with different CostKinds, e.g. for this test https://github.com/llvm/llvm-project/blob/main/llvm/test/Transforms/LoopVectorize/AArch64/optsize_minsize.ll#L221

For Os/Oz, the requirement is that there is no scalar tail + no runtime checks IIRC, and the message is for that case.

Nope, we never create replicate recipes at all due to this from the debug output:

LV: Not considering vector loop of width 2 because it would cause replicated blocks to be generated, which isn't allowed when optimizing for size.
LV: Not considering vector loop of width 4 because it would cause replicated blocks to be generated, which isn't allowed when optimizing for size.
LV: Not considering vector loop of width 8 because it would cause replicated blocks to be generated, which isn't allowed when optimizing for size.
LV: Not considering vector loop of width 16 because it would cause replicated blocks to be generated, which isn't allowed when optimizing for size.

fhahn · 2025-08-14T17:31:11Z

We never create replicating replicate recpipes, but we can generate single-scalar replicate recipes, e.g. a load of a uniform address here https://llvm.godbolt.org/z/6bvnrY965

david-arm · 2025-08-15T08:48:34Z

We never create replicating replicate recpipes, but we can generate single-scalar replicate recipes, e.g. a load of a uniform address here https://llvm.godbolt.org/z/6bvnrY965

OK I can try playing around with variations of this, but certainly the test case shown above doesn't exercise the code I've changed because the CLONE recipes are only for loads and geps. The loads aren't covered because we fall back on the legacy cost model (which I have actually fixed in #153218) and the geps always return a cost of 0. I can either combine this PR together with #153218, or land #153218 first.

fhahn

LGTM, thanks. I checked on a large input set for both X86 and AArch64 and there were no differences, so likely not really feasible to write a test case.

david-arm · 2025-08-18T08:52:17Z

Ran make chek-all downstream and looks fine.

[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost

471eefa

We were incorrectly using the TTI::TCK_RecipThroughput cost kind and ignoring the kind set in the context.

david-arm requested review from fhahn and john-brawn-arm August 12, 2025 15:43

llvmbot added vectorizers llvm:transforms labels Aug 12, 2025

fhahn reviewed Aug 12, 2025

View reviewed changes

fhahn approved these changes Aug 15, 2025

View reviewed changes

david-arm merged commit 7ee6cf0 into llvm:main Aug 18, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost #153216

[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost #153216

Uh oh!

david-arm commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025

Uh oh!

fhahn left a comment

Uh oh!

david-arm commented Aug 14, 2025

Uh oh!

fhahn commented Aug 14, 2025

Uh oh!

david-arm commented Aug 14, 2025

Uh oh!

fhahn commented Aug 14, 2025

Uh oh!

david-arm commented Aug 15, 2025

Uh oh!

fhahn left a comment

Uh oh!

david-arm commented Aug 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost #153216

[LV] Fix incorrect cost kind in VPReplicateRecipe::computeCost #153216

Uh oh!

Conversation

david-arm commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025

Uh oh!

llvmbot commented Aug 12, 2025

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

david-arm commented Aug 14, 2025

Uh oh!

fhahn commented Aug 14, 2025

Uh oh!

david-arm commented Aug 14, 2025

Uh oh!

fhahn commented Aug 14, 2025

Uh oh!

david-arm commented Aug 15, 2025

Uh oh!

fhahn left a comment

Choose a reason for hiding this comment

Uh oh!

david-arm commented Aug 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants