Commit a4ed93e
feat: add gpu flag for CUDA event-based timing
Add a `gpu` parameter to instrument tests with torch.cuda.Event timing
instead of time.perf_counter_ns() for measuring GPU kernel execution time.
Falls back to CPU timing when CUDA is not available/initialized.
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>1 parent 953b0fb commit a4ed93e
File tree
2 files changed
+829
-81
lines changed- codeflash/code_utils
- tests
2 files changed
+829
-81
lines changed
0 commit comments