how to generate a kernel with cuda c code implementation with triton #461
Unanswered
TigerYang414
asked this question in
Q&A
Replies: 2 comments
-
Unfortunately this is not possible, but you can get the PTX -- though this would require more glue code to call it for inference |
Beta Was this translation helpful? Give feedback.
0 replies
-
Yes you can. Triton will generate cubin, then you can use CUmodule to load them in c file then executed in cuLaunchKernel. That means now you have the c source for the kernel. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
need to run with c++ code with online infer
Beta Was this translation helpful? Give feedback.
All reactions