JAX CUDNN Unknown Status Error. #6332
Replies: 2 comments 3 replies
-
I've had the best success running CUDA-accelerated JAX inside a tensorflow-gpu image. There, all of the library locations "just work" and are correct. See my discussion. |
Beta Was this translation helpful? Give feedback.
-
I'm seeing a similar @AndrewCSQ I tried using your docker image, but it unfortunately the error still occurs. EDIT: Solution here: huggingface/transformers#13416 (comment) Seems like the problem is that JAX tries to pre-allocate too much memory. I ran |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
JAX error -- CUDNN Unknown status
I looked through some of the other issues but couldn't find anything particularly related to this. For reference, I installed CUDA, CUDNN, verified the install with the mnistCUDNN file. Tried
nvcc -V
which returned the following:I ran a convolutional neural network with per example gradients and the following was the error:
The final traceback is as follows:
I checked my LD_LIBRARY_PATH and my PATH and they appear to have the right location present in them. This is also with CUDA 11.2 and the latest versions of JAX/jaxlib that are present in the library.
Would appreciate any insight into this :)
Beta Was this translation helpful? Give feedback.
All reactions