Skip to content
Discussion options

You must be logged in to vote

Theoretically it seems possible to weave a function pointer into a computations graph, I agree. However, in practice, the Optix/CUDA kernels we launch have certain limitations. We cannot arbitrarily insert any function into such a kernel.
The workaround is to split the kernels: have it stop right before the pytorch/tensorflow/etc. function and start again right after the function returned. This is what "wavefront mode" does and it is enabled by disabling recording features with the two lines I mentioned above.

No, that is incorrect. Recording has no influence on the result of any computation, it is simply a performance consideration. With recording features, we are able to generate a sing…

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@KoykL
Comment options

@KoykL
Comment options

@njroussel
Comment options

Answer selected by KoykL
@KoykL
Comment options

@wjakob
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants