Commit 03a0321
authored
[Frontend] Avoid
This takes `attention_kernel.get_capture_scope()` from 0.5 ms to 47 ns
or a little over a 10,000x speedup.
I see a combined 250ms compile time improvement in the gluon attention
example benchmarks.inspect.getclosurevars (triton-lang#8831)1 parent 9b27dff commit 03a0321
1 file changed
+5
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
489 | 489 | | |
490 | 490 | | |
491 | 491 | | |
492 | | - | |
| 492 | + | |
| 493 | + | |
| 494 | + | |
| 495 | + | |
| 496 | + | |
493 | 497 | | |
494 | 498 | | |
495 | 499 | | |
| |||
0 commit comments