- 
                Notifications
    
You must be signed in to change notification settings  - Fork 74
 
          [XPU][TritonGPUToLLVM] Use llvm.func attributes to express kernels ND-ranges
          #2770
        
          New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ND-ranges Use `llvm.func` `reqd_work_group_size` and `intel_reqd_sub_group_size` to express ND-range dimensions instead of `triton_gen` attributes that are later translated. Signed-off-by: victor-eds <[email protected]>
| 
           The changes LGTM. But I am not sure about the reason of the old code. Let @whitneywhtsang to approve.  | 
    
| 
           There are some   | 
    
          
 Interesting. I'll look into that tomorrow.  | 
    
          
 Confirmed it is due to changing from   | 
    
          
 Interesting. I'll take a look.  | 
    
          
 @whitneywhtsang @etiotto Apparently not setting anything at all fixes crashes. I'd go with this for now, get this merged to get going and open an investigation ticket to tackle ASAP. The  My guess is we're modifying the number of warps or warp size at some point during this lowering process and this mismatch leads to crashes. Does this course of action sound good?  | 
    
| 
           I'll restore back   | 
    
7cb8d7d    to
    f0a2fcc      
    Compare
  
    | 
           I will keep the dense array specification for   | 
    
        
          
                third_party/intel/lib/Target/LLVMIR/Dialect/TritonGEN/TritonGENToLLVMIRTranslation.cpp
              
                Outdated
          
            Show resolved
            Hide resolved
        
      | 
           @etiotto are we OK with this now that we're keeping the   | 
    
          
 yes  | 
    
Use
llvm.funcintel_reqd_sub_group_sizeto express sub-group size instead oftriton_genattributes that are later translated.Replace
triton_gen.max_work_group_sizevalue type with dense array.