Skip to content

Conversation

@jdenny-ornl
Copy link
Collaborator

@jdenny-ornl jdenny-ornl commented Sep 30, 2024

This patch combines the capabilities of PGO GPU support (PR #94268) and the kernel-info pass (PR #102944). In particular, it implements an estimation of the number of floating point operations executed or bytes moves for a GPU code. Example usage can be found in llvm/docs/KernelInfo.rst.

This PR formerly only supported -fprofile-instrument=clang. This commit adds support for -fprofile-instrument=llvm
Replace getPointerBitCastOrAddrSpaceCast with getAddrSpaceCast and allow no-op getAddrSpaceCast calls when types are identical
TODO: Fix tests
@github-actions
Copy link

github-actions bot commented Dec 27, 2024

✅ With the latest revision this PR passed the undef deprecator.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants