Skip to content
Discussion options

You must be logged in to vote

Fixed it, lightning is now as fast as my previous implementation, the problem was elsewhere but I didn't detect it using the profiler because of the asynchronous computation from GPUs which were not synchronized during profiling.

Replies: 2 comments 5 replies

Comment options

You must be logged in to vote
2 replies
@wangbingnan136
Comment options

@akihironitta
Comment options

Answer selected by juliendenize
Comment options

You must be logged in to vote
3 replies
@akihironitta
Comment options

@juliendenize
Comment options

@Bhathiya-hw
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
accelerator: cuda Compute Unified Device Architecture GPU
4 participants