Replies: 1 comment 2 replies
-
This error message talks about CPU memory instead of GPU memory. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Dear developers,
I have successfully trained a DP model, and its accuracy is high based on the testing. It is able to reproduce the results of an AIMD simulations (200 atoms simulation, 10ps, is ok). However, when I use it for a large-scale MD simulation with 5000 atoms, it throws an OOM error.
It is strange that the error occurs consistently at the 90000th step each time. I am using Tesla V100 GPU with 36GB memory. Previously, I only needed one GPU card (for simulating 10000 atoms), but now even when I use two cards, this error still occurs. I have even tried the latest version of deepmd-kit, as well as the previous version 2.01 (I used before, without this error), but the issue can not be fixed. Can you provide me with some solutions to resolve this problem?
Beta Was this translation helpful? Give feedback.
All reactions