Commit 0e51e09
Add getter APIs for TP/PP/DP ranks in DeepSpeedEngine (#7427)
Thanks again for giving opportunity for improving this Community!
This PR is from Issue #7423.
1) Motivation
To improve compatibility with low-level profiling tools (e.g., NVIDIA
CUPTI or DCGM), it can be useful to expose parallelism-specific rank
(tensor/pipeline/data) at the engine level.
2) Changes
I Added three getter methods to DeepSpeedEngine:
- get_tensor_parallel_rank()
- get_pipeline_parallel_rank()
- get_data_parallel_rank()
Thank you for reviewing this contribution!
---------
Signed-off-by: WoosungMyung <dntjd517@naver.com>
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>1 parent e1560d8 commit 0e51e09
2 files changed
+12
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
730 | 730 | | |
731 | 731 | | |
732 | 732 | | |
| 733 | + | |
| 734 | + | |
| 735 | + | |
| 736 | + | |
| 737 | + | |
| 738 | + | |
| 739 | + | |
| 740 | + | |
| 741 | + | |
733 | 742 | | |
734 | 743 | | |
735 | 744 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
535 | 535 | | |
536 | 536 | | |
537 | 537 | | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
538 | 541 | | |
539 | 542 | | |
540 | 543 | | |
| |||
0 commit comments