Skip to content

Conversation

@abhinavg4
Copy link
Contributor

Init README.md

Copy link
Contributor Author

@abhinavg4 abhinavg4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tagging relevant people

- Corrected the link in the README for the performance summary to point to the correct file.
- Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.
pablo-garay
pablo-garay previously approved these changes Nov 17, 2025
@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 18, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

parthmannan and others added 3 commits November 19, 2025 11:26
Signed-off-by: Parth Mannan <[email protected]>
Signed-off-by: Parth Mannan <[email protected]>
- Removed redundant description of the framework.
- Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.
README.md Outdated
<!-- @Huy please update the below command after you change defaults-->

```bash
uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py model.qkv_format=thd --mock
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py --config-file examples/megatron/recipes/wan/config/1.3B_mock.yaml

abhinavg4 and others added 4 commits November 20, 2025 18:56
…m descriptions

- Updated the Megatron Bridge Path section to include 6D parallelism details.
- Added state-of-the-art performance optimizations to the Dual Training Paths section.
- Clarified parallelism terminology in the comparison table for better understanding.
Signed-off-by: Parth Mannan <[email protected]>
Signed-off-by: linnan wang <[email protected]>
abhinavg4 and others added 6 commits December 1, 2025 04:31
Co-authored-by: Wenwen Gao <[email protected]>
Co-authored-by: Wenwen Gao <[email protected]>
Co-authored-by: Wenwen Gao <[email protected]>
Co-authored-by: Wenwen Gao <[email protected]>
…ness

- Simplified descriptions of Megatron Bridge and AutoModel paths in README.md.
- Removed outdated comparison table to streamline content.
- Updated performance-summary.md to generalize model references and improve clarity.

Co-authored-by: Wenwen Gao <[email protected]>
@abhinavg4
Copy link
Contributor Author

/ok to test 31e7def

@abhinavg4 abhinavg4 enabled auto-merge (squash) December 1, 2025 18:04
@abhinavg4
Copy link
Contributor Author

/ok to test f86c51e

@pablo-garay pablo-garay disabled auto-merge December 3, 2025 02:29
@pablo-garay pablo-garay merged commit b867706 into main Dec 3, 2025
6 checks passed
lbliii pushed a commit that referenced this pull request Dec 3, 2025
* Initial README commit

* Update README and add performance summary documentation

- Corrected the link in the README for the performance summary to point to the correct file.
- Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.

* add DiT megatron links.

Signed-off-by: sajadn <[email protected]>

* Performance Docs update

Signed-off-by: Parth Mannan <[email protected]>

* Performance Docs update fix

Signed-off-by: Parth Mannan <[email protected]>

* Update README to enhance clarity and accuracy

- Removed redundant description of the framework.
- Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

* Enhance README with detailed performance optimizations and parallelism descriptions

- Updated the Megatron Bridge Path section to include 6D parallelism details.
- Added state-of-the-art performance optimizations to the Dual Training Paths section.
- Clarified parallelism terminology in the comparison table for better understanding.

* Update perf doc

Signed-off-by: Parth Mannan <[email protected]>

* update

Signed-off-by: linnan wang <[email protected]>

* Update README with fine-tuning command

Removed TODO comment and added a command for fine-tuning a video diffusion model.

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Apply suggestion from @akoumpa

* Update README, Wan-related.

Updated command syntax and improved clarity in README.

* Apply suggestion from @akoumpa

* Fixing typo @akoumpa

* fix automodel section

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* fix

Signed-off-by: Alexandros Koumparoulis <[email protected]>

* update DFM-specific readme

Signed-off-by: Pablo Garay <[email protected]>

* Update performance-summary.md

Thanks a lot @linnanwang for the bench numbers.

* Update performance-summary.md

* Update performance-summary.md

* Update README.md

Co-authored-by: Wenwen Gao <[email protected]>

* Update README.md

Co-authored-by: Wenwen Gao <[email protected]>

* Update README.md

Co-authored-by: Wenwen Gao <[email protected]>

* Update README.md

Co-authored-by: Wenwen Gao <[email protected]>

* Refactor README.md and performance-summary.md for clarity and conciseness

- Simplified descriptions of Megatron Bridge and AutoModel paths in README.md.
- Removed outdated comparison table to streamline content.
- Updated performance-summary.md to generalize model references and improve clarity.

Co-authored-by: Wenwen Gao <[email protected]>

* Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency.

---------

Signed-off-by: sajadn <[email protected]>
Signed-off-by: Parth Mannan <[email protected]>
Signed-off-by: linnan wang <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Pablo Garay <[email protected]>
Co-authored-by: sajadn <[email protected]>
Co-authored-by: Parth Mannan <[email protected]>
Co-authored-by: linnan wang <[email protected]>
Co-authored-by: Alexandros Koumparoulis <[email protected]>
Co-authored-by: Huy Vu <[email protected]>
Co-authored-by: Alexandros Koumparoulis <[email protected]>
Co-authored-by: Pablo Garay <[email protected]>
Co-authored-by: Wenwen Gao <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.