Skip to content

Commit 9f82e6c

Browse files
htyumeta-codesync[bot]
authored andcommitted
[TLX] Fix an issue in the blackwell gemm tutorial (#712)
Summary: Pull Request resolved: #712 Reviewed By: dshi7 Differential Revision: D88216358 Pulled By: htyu fbshipit-source-id: bdd18b1eca23ace237fe844e43e89de6e09689d1
1 parent 3584a9a commit 9f82e6c

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

third_party/tlx/tutorials/blackwell-gemm-ws_test.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -295,8 +295,7 @@ def benchmark(M, N, K, provider):
295295
ms, min_ms, max_ms = triton.testing.do_bench(lambda: torch.matmul(a, b), quantiles=quantiles, warmup=2000,
296296
rep=2000)
297297
if provider == "triton_persistent":
298-
ms, min_ms, max_ms = triton.testing.do_bench(lambda: matmul(a, b, False), quantiles=quantiles, warmup=2000,
299-
rep=2000)
298+
ms, min_ms, max_ms = triton.testing.do_bench(lambda: matmul(a, b), quantiles=quantiles, warmup=2000, rep=2000)
300299

301300
perf = lambda ms: 2 * M * N * K * 1e-12 / (ms * 1e-3)
302301
return perf(ms), perf(max_ms), perf(min_ms)

0 commit comments

Comments
 (0)