You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Change compile_kernel to use threads_per_warp specified in metadata (#4814)
Intel Triton selects different `threads_per_warp` based on the kernel,
and stores the selected `threads_per_warp` in metadata.
This PR changes `compile_kernel` to use the stored `threads_per_warp` in
metadata.
This PR fixes below error with `igc-19724`:
```
terminate called after throwing an instance of 'sycl::_V1::exception'
what(): The specified local size {1, 1, 32} doesn't match the required work-group size specified in the program source {1, 1, 16}
```
CI with `igc-19724` + this change:
https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/16662411889
0 commit comments