- 
                Notifications
    You must be signed in to change notification settings 
- Fork 79
[0.10] Forbid divergent execution of work-group barriers #558
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|  | 
| Benchmark Results
 Benchmark PlotsA plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR. | 
| This currently doesn't lower correctly:  | 
5e3e1f4    to
    a48a158      
    Compare
  
    | Codecov ReportAttention: Patch coverage is  
 
 Additional details and impacted files@@          Coverage Diff          @@
##            main    #558   +/-   ##
=====================================
  Coverage   0.00%   0.00%           
=====================================
  Files         21      21           
  Lines       1584    1575    -9     
=====================================
+ Misses      1584    1575    -9     ☔ View full report in Codecov by Sentry. | 
| 
 This stack of pull requests is managed by Graphite. Learn more about stacking. | 
8072f4c    to
    f014e38      
    Compare
  
    f014e38    to
    cd60145      
    Compare
  
    cd60145    to
    7b88e3e      
    Compare
  
    | With pocl#main only two tests are still failing. 
 | 
7b88e3e    to
    31f8f5f      
    Compare
  
    a5f740a    to
    210658c      
    Compare
  
    210658c    to
    7e448d1      
    Compare
  
    7e448d1    to
    b58c830      
    Compare
  
    b58c830    to
    58ed8cc      
    Compare
  
    58ed8cc    to
    1163b32      
    Compare
  
    
As noted by @maleadt in JuliaGPU/OpenCL.jl#283 (comment)
Several backends have a strict requirement that barriers like
@synchronizemust be executed convergent,and the automatic bounds-checking in KA violates that.
Before GPU kernels lowered like:
Now they lower to: