Commit a862b0a
authored
fix: fix resourcePerNode override not applied with Volcano scheduler (#2982)
* volcano integeration for resourcePerNode override in trainjob
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* Fixed the issue where PodGroup resources weren't scaling with spec.trainer.resourcesPerNode when specified on a TrainJob
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* limits removed
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* maintains consistency across all MLPolicy plugins
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* Fix pre-commit formatting
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* Unit test updates for mpi and torch pluggins
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* framework unit test prblm solved
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* fix:All MLPolicy Plugins update SinglePodRequests from TrainJob.spec.trainer.resourcesPerNode
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* Refactor ResourcesPerNode handling to use PodRequest helper for correct scaling with init/sidecar containers
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* refactor: simplify ResourcesPerNode by modifying jobSetTemplateSpec directly
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* test: modify TrainJob resources in coscheduling test to verify node container override
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
* Refactor: remove duplicate resource logic from plugins
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>
---------
Signed-off-by: sksingh2005 <shashanksgh3@gmail.com>1 parent f921483 commit a862b0a
File tree
2 files changed
+38
-12
lines changed- pkg/runtime
- core
- framework/core
2 files changed
+38
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
171 | 171 | | |
172 | 172 | | |
173 | 173 | | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
174 | 184 | | |
175 | 185 | | |
176 | 186 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
552 | 552 | | |
553 | 553 | | |
554 | 554 | | |
| 555 | + | |
| 556 | + | |
| 557 | + | |
| 558 | + | |
555 | 559 | | |
556 | 560 | | |
557 | 561 | | |
| |||
575 | 579 | | |
576 | 580 | | |
577 | 581 | | |
| 582 | + | |
| 583 | + | |
| 584 | + | |
| 585 | + | |
578 | 586 | | |
579 | 587 | | |
580 | 588 | | |
| |||
1019 | 1027 | | |
1020 | 1028 | | |
1021 | 1029 | | |
| 1030 | + | |
| 1031 | + | |
| 1032 | + | |
| 1033 | + | |
1022 | 1034 | | |
1023 | 1035 | | |
1024 | 1036 | | |
| |||
1089 | 1101 | | |
1090 | 1102 | | |
1091 | 1103 | | |
| 1104 | + | |
| 1105 | + | |
| 1106 | + | |
| 1107 | + | |
1092 | 1108 | | |
1093 | 1109 | | |
1094 | 1110 | | |
| |||
1355 | 1371 | | |
1356 | 1372 | | |
1357 | 1373 | | |
1358 | | - | |
1359 | | - | |
| 1374 | + | |
| 1375 | + | |
1360 | 1376 | | |
1361 | 1377 | | |
1362 | 1378 | | |
| |||
1473 | 1489 | | |
1474 | 1490 | | |
1475 | 1491 | | |
1476 | | - | |
1477 | | - | |
| 1492 | + | |
| 1493 | + | |
1478 | 1494 | | |
1479 | 1495 | | |
1480 | 1496 | | |
| |||
1586 | 1602 | | |
1587 | 1603 | | |
1588 | 1604 | | |
1589 | | - | |
1590 | | - | |
| 1605 | + | |
| 1606 | + | |
1591 | 1607 | | |
1592 | 1608 | | |
1593 | 1609 | | |
| |||
1665 | 1681 | | |
1666 | 1682 | | |
1667 | 1683 | | |
1668 | | - | |
1669 | | - | |
| 1684 | + | |
| 1685 | + | |
1670 | 1686 | | |
1671 | 1687 | | |
1672 | 1688 | | |
| |||
1689 | 1705 | | |
1690 | 1706 | | |
1691 | 1707 | | |
1692 | | - | |
1693 | | - | |
| 1708 | + | |
| 1709 | + | |
1694 | 1710 | | |
1695 | 1711 | | |
1696 | 1712 | | |
| |||
1702 | 1718 | | |
1703 | 1719 | | |
1704 | 1720 | | |
1705 | | - | |
1706 | | - | |
| 1721 | + | |
| 1722 | + | |
1707 | 1723 | | |
1708 | 1724 | | |
1709 | 1725 | | |
| |||
0 commit comments