Is there a way to disable the fused `Act(Conv+Bias+Z)` operation? #15048

test3211234 · 2025-04-20T20:49:09Z

test3211234
Apr 20, 2025

For me, Act(a1 * Conv + Bias) exists but I don't have the + z operation.

GreatV · 2025-04-20T20:53:40Z

GreatV
Apr 20, 2025
Maintainer

Yes, you can disable the fused Act(Conv+Bias+Z) operation by controlling how the layers are constructed and fused.

In your case, it seems you're using models like PP-HGNetV2 from PaddleClas, where certain blocks like ConvBNAct or DiverseBranchBlock can support various fusions of convolution, bias, activation, and potentially an additional z term (e.g., identity or residual input). However, by default, these fusions are manually configured during model definition and parameter fusion (e.g., via reparameterization in DiverseBranchBlock or affine scaling in LearnableAffineBlock).

To disable the fused Act(Conv+Bias+Z) pattern, consider the following strategies:

Use unfused blocks:
- In ConvBNAct, disable activation by setting use_act=False.
- Disable LearnableAffineBlock (LAB) by setting use_lab=False.
Example:

ConvBNAct(
in_channels, out_channels,
kernel_size=3,
stride=1,
padding=1,
use_act=False, # disables activation
use_lab=False # disables LAB fusion
)
Avoid reparameterization:
If you're using DiverseBranchBlock, avoid calling re_parameterize() since it fuses multiple branches (e.g., conv, avg, identity) into a single conv with bias. Keeping is_repped=False ensures branches remain separated and easier to inspect/modify individually.
Prevent adding z:
The Z in Act(Conv + Bias + Z) often refers to residual connections or skip paths (e.g. identity branch). To avoid this, use configurations where such additions do not occur. For example, in DiverseBranchBlock, z is present only when multiple paths including input identity are summed. You can control this by selectively editing which paths are used in forward() or adjusting the block to skip or isolate those paths.

If you are using your own custom module or modifying a model like PPHGNetV2, ensure that in forward passes you avoid:

calling self.nonlinear(z + conv_out + bias)
summing extra branches into the convolution result that would represent z

Since you mentioned that you only have Act(a1 * Conv + Bias) and not the extra z, you’re likely using a model configuration without explicit residual or fused affine/identity branches. That's expected if you're not using, for instance, DiverseBranchBlock with all branches enabled.

If needed, you can insert custom code to further isolate the Act(Conv + Bias) by replacing composite layers with atomic layers (Conv2D, BatchNorm2D, ReLU), and controlling exactly what gets fused.

Let me know your specific model definition if you'd like exact code replacements.

Response generated by 🤖 feifei-bot | chatgpt-4o-latest

⚠️ This is an AI-generated response. Please verify any critical information.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is there a way to disable the fused `Act(Conv+Bias+Z)` operation? #15048

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Is there a way to disable the fused Act(Conv+Bias+Z) operation? #15048

Uh oh!

test3211234 Apr 20, 2025

Replies: 1 comment

Uh oh!

GreatV Apr 20, 2025 Maintainer

Is there a way to disable the fused `Act(Conv+Bias+Z)` operation? #15048

test3211234
Apr 20, 2025

GreatV
Apr 20, 2025
Maintainer