-
Notifications
You must be signed in to change notification settings - Fork 79
[0.10] Forbid divergent execution of work-group barriers #558
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Benchmark Results
Benchmark PlotsA plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR. |
|
This currently doesn't lower correctly: |
5e3e1f4 to
a48a158
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #558 +/- ##
=====================================
Coverage 0.00% 0.00%
=====================================
Files 21 21
Lines 1584 1575 -9
=====================================
+ Misses 1584 1575 -9 ☔ View full report in Codecov by Sentry. |
This stack of pull requests is managed by Graphite. Learn more about stacking. |
8072f4c to
f014e38
Compare
f014e38 to
cd60145
Compare
cd60145 to
7b88e3e
Compare
|
With pocl#main only two tests are still failing.
|
7b88e3e to
31f8f5f
Compare
a5f740a to
210658c
Compare
210658c to
7e448d1
Compare
7e448d1 to
b58c830
Compare
b58c830 to
58ed8cc
Compare
58ed8cc to
1163b32
Compare

As noted by @maleadt in JuliaGPU/OpenCL.jl#283 (comment)
Several backends have a strict requirement that barriers like
@synchronizemust be executed convergent,and the automatic bounds-checking in KA violates that.
Before GPU kernels lowered like:
Now they lower to: