Skip to content

Commit 4dfdc32

Browse files
authored
[Triton] Add SCCP to the end of make_ttgir (#7069)
Speculative placement of the pass. It needs to run after some IR simplification but before canonicalizer, typically.
1 parent 4742de0 commit 4dfdc32

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

third_party/nvidia/backend/compiler.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -291,6 +291,7 @@ def make_ttgir(mod, metadata, opt, capability):
291291
if capability // 10 >= 9:
292292
nvidia.passes.ttnvgpuir.add_tma_lowering(pm)
293293
nvidia.passes.ttnvgpuir.add_fence_insertion(pm)
294+
passes.common.add_sccp(pm)
294295
passes.common.add_canonicalizer(pm)
295296
pm.run(mod)
296297
metadata["cluster_dims"] = (cluster_info.clusterDimX, cluster_info.clusterDimY, cluster_info.clusterDimZ)

0 commit comments

Comments
 (0)