Skip to content

Conversation

@yaoyu-33
Copy link
Contributor

This is a ported gitlab PR 4069. Expert review is done, needs final review.

@yaoyu-33 yaoyu-33 requested review from a team as code owners October 29, 2025 18:27
@copy-pr-bot
Copy link

copy-pr-bot bot commented Oct 29, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@yaoyu-33 yaoyu-33 requested review from a team as code owners November 13, 2025 20:23
@yanring yanring added the Expert Review Apply this label to indicate that your PR is ready for expert review. label Nov 19, 2025
@yanring yanring added this to the Core 0.16 milestone Nov 19, 2025
@shifangx
Copy link
Contributor

shifangx commented Jan 6, 2026

hi, @ko3n1g, can you help to check why this pr CI test failed for Codeovers approval?
https://github.com/NVIDIA/Megatron-LM/actions/runs/20747699475/job/59568935015?pr=2031

you and yash, jared have approved, but I donot know why the Codeovers approval test still failed.

@ko3n1g
Copy link
Contributor

ko3n1g commented Jan 6, 2026

core-nemo is missing

@shifangx
Copy link
Contributor

shifangx commented Jan 7, 2026

core-nemo is missing

Thanks @ko3n1g

@shifangx
Copy link
Contributor

shifangx commented Jan 7, 2026

Hi, @pablo-garay, Could you help explain what this error message means? I am trying to fix this error, but I am not sure what is the root cause of this error.
https://github.com/NVIDIA/Megatron-LM/actions/runs/20747699360/job/59568927927?pr=2031
This pr does not change megatron.core.model_parallel_config.ModelParallelConfig.init , but it is mentioned in error message.

Change 1: megatron.core.model_parallel_config.ModelParallelConfig.init(pipeline_dtype)
Clean: megatron.core.model_parallel_config.ModelParallelConfig.init
Clean repr: 'megatron.core.model_parallel_config.ModelParallelConfig.init'
✗ NO MATCH

@shifangx
Copy link
Contributor

shifangx commented Jan 7, 2026

/ok to test 19f0ecd

@yaoyu-33
Copy link
Contributor Author

/ok to test 2fe2eeb

@shifangx
Copy link
Contributor

/ok to test 762ec65

@yaoyu-33 yaoyu-33 added this pull request to the merge queue Jan 24, 2026
Merged via the queue into NVIDIA:main with commit 485ed18 Jan 24, 2026
52 of 54 checks passed
@yaoyu-33 yaoyu-33 deleted the shifang/multi_module_comm branch January 24, 2026 04:05
ko3n1g added a commit to ko3n1g/Megatron-LM that referenced this pull request Jan 24, 2026
ko3n1g added a commit that referenced this pull request Jan 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants