-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Labels
Description
Inner loops on CUDA convolution code should run faster using a #pragma unroll statement.
#pragma unroll Nin CUDA#pragma unroll Nin OpenCL- Unavailable for OpenAcc, but try
-Munrollflag with pgcc #pragma unroll Nfor icc
Reactions are currently unavailable