how to generate a kernel with cuda c code implementation with triton #461

TigerYang414 · 2022-02-14T13:14:04Z

TigerYang414
Feb 14, 2022

need to run with c++ code with online infer

ptillet · 2022-02-20T03:20:20Z

ptillet
Feb 20, 2022
Maintainer

Unfortunately this is not possible, but you can get the PTX -- though this would require more glue code to call it for inference

0 replies

yiakwy-xpu-ml-framework-team · 2024-11-26T12:30:47Z

yiakwy-xpu-ml-framework-team
Nov 26, 2024

Yes you can. Triton will generate cubin, then you can use CUmodule to load them in c file then executed in cuLaunchKernel. That means now you have the c source for the kernel.

@ptillet @TigerYang414

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

how to generate a kernel with cuda c code implementation with triton #461

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

how to generate a kernel with cuda c code implementation with triton #461

Uh oh!

TigerYang414 Feb 14, 2022

Replies: 2 comments

Uh oh!

ptillet Feb 20, 2022 Maintainer

Uh oh!

yiakwy-xpu-ml-framework-team Nov 26, 2024

TigerYang414
Feb 14, 2022

ptillet
Feb 20, 2022
Maintainer

yiakwy-xpu-ml-framework-team
Nov 26, 2024