Skip to content

Commit a4ed93e

Browse files
aseembits93claude
andcommitted
feat: add gpu flag for CUDA event-based timing
Add a `gpu` parameter to instrument tests with torch.cuda.Event timing instead of time.perf_counter_ns() for measuring GPU kernel execution time. Falls back to CPU timing when CUDA is not available/initialized. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
1 parent 953b0fb commit a4ed93e

File tree

2 files changed

+829
-81
lines changed

2 files changed

+829
-81
lines changed

0 commit comments

Comments
 (0)