Initial README commit #53

abhinavg4 · 2025-11-16T18:35:35Z

Init README.md

abhinavg4

Tagging relevant people

README.md

- Corrected the link in the README for the performance summary to point to the correct file. - Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations.

docs/performance-summary.md

README.md

Signed-off-by: sajadn <[email protected]>

copy-pr-bot · 2025-11-18T22:13:16Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

README.md

Signed-off-by: Parth Mannan <[email protected]>

- Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

abhinavg4 · 2025-11-20T17:49:46Z

README.md

+<!-- @Huy please update the below command after you change defaults-->
+
+```bash
+uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py model.qkv_format=thd --mock


uv run --group megatron-bridge python -m torch.distributed.run --nproc_per_node=2 examples/megatron/recipes/wan/pretrain_wan.py --config-file examples/megatron/recipes/wan/config/1.3B_mock.yaml

…m descriptions - Updated the Megatron Bridge Path section to include 6D parallelism details. - Added state-of-the-art performance optimizations to the Dual Training Paths section. - Clarified parallelism terminology in the comparison table for better understanding.

Signed-off-by: Parth Mannan <[email protected]>

…init Signed-off-by: Parth Mannan <[email protected]>

Signed-off-by: linnan wang <[email protected]>

docs/performance-summary.md

README.md

Co-authored-by: Wenwen Gao <[email protected]>

…ness - Simplified descriptions of Megatron Bridge and AutoModel paths in README.md. - Removed outdated comparison table to streamline content. - Updated performance-summary.md to generalize model references and improve clarity. Co-authored-by: Wenwen Gao <[email protected]>

abhinavg4 · 2025-12-01T12:59:05Z

/ok to test 31e7def

…ction header for consistency.

abhinavg4 · 2025-12-01T18:04:39Z

/ok to test f86c51e

@akoumpa

* Initial README commit * Update README and add performance summary documentation - Corrected the link in the README for the performance summary to point to the correct file. - Introduced a new `performance-summary.md` document detailing performance benchmarks for large language models using DFM, including nomenclature, performance metrics, and system configurations. * add DiT megatron links. Signed-off-by: sajadn <[email protected]> * Performance Docs update Signed-off-by: Parth Mannan <[email protected]> * Performance Docs update fix Signed-off-by: Parth Mannan <[email protected]> * Update README to enhance clarity and accuracy - Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section. * Enhance README with detailed performance optimizations and parallelism descriptions - Updated the Megatron Bridge Path section to include 6D parallelism details. - Added state-of-the-art performance optimizations to the Dual Training Paths section. - Clarified parallelism terminology in the comparison table for better understanding. * Update perf doc Signed-off-by: Parth Mannan <[email protected]> * update Signed-off-by: linnan wang <[email protected]> * Update README with fine-tuning command Removed TODO comment and added a command for fine-tuning a video diffusion model. * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Apply suggestion from @akoumpa * Update README, Wan-related. Updated command syntax and improved clarity in README. * Apply suggestion from @akoumpa * Fixing typo @akoumpa * fix automodel section Signed-off-by: Alexandros Koumparoulis <[email protected]> * fix Signed-off-by: Alexandros Koumparoulis <[email protected]> * update DFM-specific readme Signed-off-by: Pablo Garay <[email protected]> * Update performance-summary.md Thanks a lot @linnanwang for the bench numbers. * Update performance-summary.md * Update performance-summary.md * Update README.md Co-authored-by: Wenwen Gao <[email protected]> * Update README.md Co-authored-by: Wenwen Gao <[email protected]> * Update README.md Co-authored-by: Wenwen Gao <[email protected]> * Update README.md Co-authored-by: Wenwen Gao <[email protected]> * Refactor README.md and performance-summary.md for clarity and conciseness - Simplified descriptions of Megatron Bridge and AutoModel paths in README.md. - Removed outdated comparison table to streamline content. - Updated performance-summary.md to generalize model references and improve clarity. Co-authored-by: Wenwen Gao <[email protected]> * Fix typo in README.md: changed "Built" to "Build" in the container section header for consistency. --------- Signed-off-by: sajadn <[email protected]> Signed-off-by: Parth Mannan <[email protected]> Signed-off-by: linnan wang <[email protected]> Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: Pablo Garay <[email protected]> Co-authored-by: sajadn <[email protected]> Co-authored-by: Parth Mannan <[email protected]> Co-authored-by: linnan wang <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: Huy Vu <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: Pablo Garay <[email protected]> Co-authored-by: Wenwen Gao <[email protected]> Signed-off-by: Lawrence Lane <[email protected]>

Initial README commit

3266077

copy-pr-bot bot temporarily deployed to test November 16, 2025 18:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:37 Inactive

abhinavg4 commented Nov 16, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 18:53 Inactive

copy-pr-bot bot temporarily deployed to test November 16, 2025 19:35 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:36 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:38 Inactive

abhinavg4 commented Nov 16, 2025

View reviewed changes

docs/performance-summary.md Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 19:53 Inactive

euronymous-aithal reviewed Nov 16, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

pablo-garay previously approved these changes Nov 17, 2025

View reviewed changes

add DiT megatron links.

79f9d26

Signed-off-by: sajadn <[email protected]>

sajadn dismissed pablo-garay’s stale review via 79f9d26 November 18, 2025 22:13

abhinavg4 commented Nov 19, 2025

View reviewed changes

README.md Show resolved Hide resolved

bernardwin reviewed Nov 19, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

parthmannan and others added 3 commits November 19, 2025 11:26

Performance Docs update

b96cf8f

Signed-off-by: Parth Mannan <[email protected]>

Performance Docs update fix

2b00158

Signed-off-by: Parth Mannan <[email protected]>

Update README to enhance clarity and accuracy

8e471a0

- Removed redundant description of the framework. - Clarified the relationship between Megatron Bridge and Megatron Core in the Dual-Path Architecture section.

abhinavg4 commented Nov 20, 2025

View reviewed changes

abhinavg4 and others added 4 commits November 20, 2025 18:56

Update perf doc

2233811

Signed-off-by: Parth Mannan <[email protected]>

Merge branch 'readme_init' of github.com:NVIDIA-NeMo/DFM into readme_…

60fae1d

…init Signed-off-by: Parth Mannan <[email protected]>

update

88ddbf1

Signed-off-by: linnan wang <[email protected]>

abhinavg4 commented Nov 21, 2025

View reviewed changes

docs/performance-summary.md Outdated Show resolved Hide resolved

snowmanwwg reviewed Dec 1, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

snowmanwwg reviewed Dec 1, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

abhinavg4 and others added 6 commits December 1, 2025 04:31

Update README.md

796103e

Co-authored-by: Wenwen Gao <[email protected]>

Update README.md

9ea6116

Co-authored-by: Wenwen Gao <[email protected]>

Update README.md

ebf00bf

Co-authored-by: Wenwen Gao <[email protected]>

Update README.md

7083f86

Co-authored-by: Wenwen Gao <[email protected]>

Merge branch 'main' into readme_init

31e7def

copy-pr-bot bot temporarily deployed to test December 1, 2025 12:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 12:59 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 13:39 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 14:38 Inactive

Fix typo in README.md: changed "Built" to "Build" in the container se…

f86c51e

…ction header for consistency.

abhinavg4 enabled auto-merge (squash) December 1, 2025 18:04

copy-pr-bot bot temporarily deployed to test December 1, 2025 18:04 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 18:05 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 19:14 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci December 1, 2025 21:07 Inactive

ntajbakhsh self-requested a review December 2, 2025 01:11

ntajbakhsh approved these changes Dec 2, 2025

View reviewed changes

Merge branch 'main' into readme_init

8640f3f

pablo-garay disabled auto-merge December 3, 2025 02:29

pablo-garay merged commit b867706 into main Dec 3, 2025
6 checks passed

Initial README commit #53

Initial README commit #53

Uh oh!

Conversation

abhinavg4 commented Nov 16, 2025

Uh oh!

abhinavg4 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

copy-pr-bot bot commented Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

abhinavg4 Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

abhinavg4 commented Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

12 participants