[NVBug 5659126] The same workaround for RMSNorm exporting for diffusers >=0.35.0 (#642)

shengliangxu · web-flow · commit cfb247a1f3e8 · 2025-12-04T05:16:36.000Z
## What does this PR do?

**Type of change:** ?

Bug fix

**Overview:** ?


For the trt_diffusions script


## Testing

python diffusion_trt.py --model flux-dev --benchmark --skip-image

Signed-off-by: Shengliang Xu &lt;shengliangx@nvidia.com&gt;
diff --git a/examples/diffusers/quantization/diffusion_trt.py b/examples/diffusers/quantization/diffusion_trt.py
@@ -18,6 +18,18 @@
 
 import numpy as np
 import torch
+
+# This is a workaround for making the onnx export of models that use the torch RMSNorm work. We will
+# need to move on to use dynamo based onnx export to properly fix the problem. The issue has been hit
+# by both external users https://github.com/NVIDIA/TensorRT-Model-Optimizer/issues/262, and our
+# internal users from MLPerf Inference.
+#
+if __name__ == "__main__":
+    from diffusers.models.normalization import RMSNorm as DiffuserRMSNorm
+
+    torch.nn.RMSNorm = DiffuserRMSNorm
+    torch.nn.modules.normalization.RMSNorm = DiffuserRMSNorm
+
 from onnx_utils.export import (
     _create_trt_dynamic_shapes,
     generate_dummy_inputs_and_dynamic_axes_and_shapes,