Unable to train NER model using A100 GPU #11224
-
Hi guys Got exception
spacy version : 3.4 Kindly share the solution. |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 5 replies
-
It looks like your GPU is not found. I'm not sure if the A100 requires special support or not, but to start, did you use the install widget to install spaCy with dependencies? What's the output of |
Beta Was this translation helpful? Give feedback.
-
After upgrading, I had the same problem with my A6000 GPUs. I had
Then I uninstalled |
Beta Was this translation helpful? Give feedback.
-
pip install cupy-wheel gives me this error : Collecting cupy-wheel The conflict is caused by: To fix this you could try to:
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/user_guide/#fixing-conflicting-dependencies |
Beta Was this translation helpful? Give feedback.
-
thank you very much for your kind help . okay will check . I am using Nvidia Tesla T4 GPU has 16bg vram, when i train spacy model on 1000 data points i get memory error : ":warning: Aborting and saving the final best model. Encountered exception: should i have more RAM for training or is it got something with batch size etc...i tried to reduce batch size still gives me issue . what should i do ? |
Beta Was this translation helpful? Give feedback.
After upgrading, I had the same problem with my A6000 GPUs. I had
cupy-cuda117
installed, but the above cupy test claimedNo module named 'cupy'
. Then I went to the cupy website and installedcupy-cuda11x
. Spacy worked again, but it complained that I had two cupy packages installed.Then I uninstalled
cupy-cuda117
, got theNo GPU devices detected
error again, reinstalledcupy-cuda117
, and now it works great with no errors or warnings—even though I once again have two cupy packages installed. What's going on here?