Skip to content

Commit 9c049d9

Browse files
committed
Improve comments for target GPU_ARCH
1 parent 8b72347 commit 9c049d9

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

op_builder/evoformer_attn.py

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,9 +16,11 @@ def __init__(self, name=None):
1616
name = self.NAME if name is None else name
1717
super().__init__(name=name)
1818
self.cutlass_path = os.environ.get("CUTLASS_PATH")
19-
# Target GPU architecture
20-
# Current useful values: >70, >75, >80, see gemm_kernel_utils.h
21-
# For modern GPUs, >80 is obfiously the right value
19+
# Target GPU architecture.
20+
# Current useful values are: 70, 75, 80.
21+
# For modern GPUs, 80 is the right value.
22+
# No specializations of the kernel beyond Ampere are implemented
23+
# See gemm_kernel_utils.h (also in cutlass example for fused attention) and cutlass/arch/arch.h
2224
self.gpu_arch = os.environ.get("DS_EVOFORMER_GPU_ARCH")
2325

2426
def absolute_name(self):

0 commit comments

Comments
 (0)