CPU-GPU collaboration problems during dp train #2004
-
Dear all,
task B is submitted next. From the bold text we see task B is first bounded to OS proc set 0, then appears to be automatically changed to proc set 1
It is expected that the two processes will not interfere each other, since they use different GPUs and CPUs. However, I found that the speed of task A was dramatically decreased (from 2.5s per 100 steps to 4s per 100 steps) immediately after task B was submitted, so did task B. I've also find a very similar issue (#1818), however there seems to be no conclusive answers. Does anyone have ideas on how to solve it? Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
You can list the CPU usage by |
Beta Was this translation helpful? Give feedback.
You can list the CPU usage by
ps -L -eo pcpu,psr,pid,lwp,user,args
.