Skip to content
Discussion options

You must be logged in to vote

That's exactly right. We may want to use multiple CUDA streams (I can't recall how much parallelism it will do if you don't).. but I would probably do the simple thing first (e.g. just launch kernels the default way). Maybe checkout the docs on CUDA streams and just to make sure we design in a way that's extensible to that if needed.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@zcbenz
Comment options

zcbenz Mar 12, 2025
Maintainer Author

@awni
Comment options

awni Mar 12, 2025
Maintainer

@zcbenz
Comment options

zcbenz Mar 12, 2025
Maintainer Author

@awni
Comment options

awni Mar 13, 2025
Maintainer

Answer selected by zcbenz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants