Tensorflow KMP_AFFINITY #1434
Unanswered
nisargjoshi10
asked this question in
Q&A
Replies: 2 comments 2 replies
-
This log doesn't provide useful information. There are two operators that may slow down the pre-training process if you have a lot of data or systems. One is neighbor stating. We have provided an option in #1313 to skip it. Another stating is to initialize the model parameters. We haven't provided an option to skip it. |
Beta Was this translation helpful? Give feedback.
0 replies
-
providing more information from the slurm output and also attaching my input script. I am training water system and dont think there's a lot of data. By neighbor stating, do you refer to the
|
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I have been using deepmd-kit for training some models. Whenever I train, the model gets stuck here (as shown below) for a couple of hours before it actually starts training. I'm wondering if there's a way to bypass this step and accelerate the training. I'm training the model on NVIDIA a40 GPUs.
Thanks for the help!
Beta Was this translation helpful? Give feedback.
All reactions