Commit 4051123
fix: on GB200 use single-thread checkpoint save to avoid Cpu OOM (NVIDIA-NeMo#1703)
Signed-off-by: Guyue Huang <[email protected]>
Signed-off-by: Parth Mannan <[email protected]>1 parent 8c492ff commit 4051123
1 file changed
+1
-1
lines changed
0 commit comments