-
Notifications
You must be signed in to change notification settings - Fork 15.5k
[mlir][gpu] Fix crash in transform.gpu.map_nested_forall_to_threads with zero iterations #170282
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
…ith zero iterations
|
✅ With the latest revision this PR passed the C/C++ code formatter. |
🐧 Linux x64 Test Results
✅ The build succeeded and all tests passed. |
|
I observe a discrepancy between the CI results and my local environment (Ubuntu 24.04 LTS on WSL2, RelWithDebInfo build). In CI, the test fails early with requires statically sized, normalized forall op, while my local build bypasses this check and instead fails later in the pipeline with: With that in mind, I pushed a follow-up commit to this PR, “Fix: revert error message change” (82379d1), |
|
I've submitted a fix for the crash in Thanks! |
|
In the commit 799bb9b, I replaced Previously, the side effect was lost, causing the function to return a vector of zeros instead of |
|
After some testing in my local environment, I found that the problem could be solved by leaving only the commit 799bb9b and deleting the others. |
Fix crash in
checkMappingSpecfor zero iterations or block/grid dimscomputeProductnow only runs whennumParallelIterationsis nonzero and we reject non‑positive block/grid sizes early, eliminating thetransform.gpumap crash and surfacing a clear diagnostic (see added test in mlir/test/Dialect/GPU/transform-gpu-failing.mlir).Fixes: #73562