Commit e8ed6b9
committed
Fix multi-GPU parallel training being disabled
- Only disable multiprocessing for single GPU or non-CUDA devices
- Enable parallel training when multiple GPUs are detected
- Add clear logging to show sequential vs parallel mode
The issue was that NO_MULTIPROCESSING=1 was always being set, forcing
sequential training even on multi-GPU systems. Now:
- Multiple GPUs: Parallel training enabled
- Single GPU: Sequential mode (avoids overhead)
- CPU/MPS: Sequential mode (required)
This fixes the issue where 8-GPU systems were only using 1 GPU.1 parent f6bfaee commit e8ed6b9
1 file changed
+15
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
72 | 72 | | |
73 | 73 | | |
74 | 74 | | |
75 | | - | |
| 75 | + | |
76 | 76 | | |
77 | | - | |
78 | | - | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
79 | 91 | | |
80 | 92 | | |
81 | 93 | | |
| |||
0 commit comments