[Triton-MLIR] Fix a few minor issues#689
Merged
ptillet merged 4 commits intotriton-lang:triton-mlirfrom Sep 22, 2022
Merged
Conversation
8041193 to
5b9e7c3
Compare
Contributor
Author
|
This PR also updates the version of With this change, it passes tests on |
Contributor
I don't quite get this one. Doesn't the following code also fail when nullptr is passed? |
Contributor
Author
|
Thanks for taking a look at this PR @Jokeren! Sorry for confusion. The first commit handles the last "NG" case: // Assume constantMaskCandidate is nullptr
auto *constantMaskCandidate = mask.getDefiningOp();
// OK
if (!constantMaskCandidate)
return mlir::failure();
// OK: dynamic_cast<> can take nullptr
#if 0
// The following code does not work because LLVM does not enable RTTI.
if (!dynamic_cast<arith::ConstantOp *>(constantMaskCandidate))
return mlir::failure();
#endif
// NG: llvm::dyn_cast<> cannot take nullptr (it causes assert() or SEGV) for some reasons
if (!llvm::dyn_cast<arith::ConstantOp>(constantMaskCandidate))
return mlir::failure(); |
Collaborator
|
What about |
c6b1c27 to
eddc6a4
Compare
Contributor
Author
|
Thanks, @ptillet. You are completely right. I updated the first commit to use |
eddc6a4 to
a805ee3
Compare
ptillet
pushed a commit
that referenced
this pull request
Apr 1, 2024
ZzEeKkAa
pushed a commit
to ZzEeKkAa/triton
that referenced
this pull request
Aug 5, 2024
…ng#689) Each lane owns a whole TF32 value's bits. By doing so, we can do the arithmetic operation on the operand A value. This is aligned to the requirements we have to the OpenCL interface.
brunomazzottiamd
pushed a commit
to brunomazzottiamd/triton
that referenced
this pull request
Jan 29, 2025
* add perfci tuning shapes and fall back configs * make test_correctness work with multiple kernels * change persistent_gemm kernel name back * add persistent_gemm unit tests * adapt tune_streamk to be able to switch kernel tunning from command line * fix output file name issue and merge final TFLOPS and time into yaml file * remove comments
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
In Triton-MLIR's combine pass, I didn't check if
getDefiningOp()returnsnullptror not. Unlikedynamic_cast,llvm::dyn_castfails ifnullptris passed. The first two commits fix this issue and add tests to check the behavior.Pipelinetransformation in TritonGPU-MLIR uses a wrong index to access themaskargument ofinsert_slice_async. The last commit fixes this issue.(Note that the these issues are found to try to compile the tutorial matmul code.)