Commit 84de9d6
authored
[SYCL] Improve sycl-post-link performance with -split=kernel (#6689)
Right now we are computing a new callgraph in every call to
extractCallGraph. extractCallGraph is called every time we do a module
split, so for -split=kernel, that would be once per kernel. For modules
with many kernels, this can take a very long time. We only need to
compute this once because the input IR doesn't seem to change between
splits.
This improves performance of sycl-post-link from ~45min to ~7min for an
example with 13k kernels
Signed-off-by: Sarnie, Nick <[email protected]>1 parent 32a2777 commit 84de9d6
1 file changed
+9
-4
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
391 | 391 | | |
392 | 392 | | |
393 | 393 | | |
394 | | - | |
| 394 | + | |
| 395 | + | |
395 | 396 | | |
396 | | - | |
| 397 | + | |
397 | 398 | | |
398 | 399 | | |
399 | 400 | | |
| |||
414 | 415 | | |
415 | 416 | | |
416 | 417 | | |
417 | | - | |
| 418 | + | |
| 419 | + | |
418 | 420 | | |
419 | 421 | | |
420 | | - | |
| 422 | + | |
421 | 423 | | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
422 | 427 | | |
423 | 428 | | |
424 | 429 | | |
| |||
0 commit comments