fix: fix exported model.layers.*.self_attn.rotary_emb.inv_freq shape #2021

conver334 · 2026-01-22T01:24:34Z

What does this PR do ?

#1838 add exporting to model.layers.*.self_attn.rotary_emb.inv_freq. Its shape is not correct. The following error occurred:

model.layers.9.self_attn.rotary_emb.inv_freq        │ torch.Size([32]) != torch.Size([64])

Changelog

The calculation method of this layer is modified and compared with the imported HF parameters.

GitHub Actions CI

See the CI sectionin the Contributing doc for how to trigger the CI. A Nvidia developer will need to approve and trigger the CI for external contributors.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Summary by CodeRabbit

Bug Fixes
- Enhanced weight verification with dtype normalization and shape-mismatch detection during model conversion.
- Fixed rotary frequency calculations to ensure proper dtype consistency across devices.
Tests
- Added functional test coverage for Moonlight model conversion with distributed training parallelism verification.
- Updated rotary embedding tests to reflect corrected calculation methodology.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: conver334 <[email protected]>

copy-pr-bot · 2026-01-22T01:24:38Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: conver334 <[email protected]>

yaoyu-33 · 2026-01-22T19:07:04Z

/ok to test a023beb

Signed-off-by: conver334 <[email protected]>

yaoyu-33 · 2026-01-23T04:16:50Z

/ok to test 8d4c394

Signed-off-by: conver334 <[email protected]>

BoxiangW

LGTM

BoxiangW · 2026-01-28T04:16:16Z

/ok to test 13ac99d

yaoyu-33 · 2026-01-28T17:24:00Z

/ok to test 664ed69

coderabbitai · 2026-01-28T17:30:37Z

📝 Walkthrough

Walkthrough

Updates weight verification logic with dtype normalization and shape-mismatch detection, corrects rotary frequency calculation in DeepSeek V3 bridge from step 2 to step 1, refactors benchmark script to use weights verification table, adds comprehensive Moonlight model conversion functional test, and adjusts corresponding unit test expectations.

Changes

Cohort / File(s)	Summary
Weight Verification Enhancements `src/megatron/bridge/models/conversion/utils.py`	Added dtype-normalized comparison for parameters and shape-mismatch detection prior to value comparison; "Matches Original" column now indicates shape mismatches or boolean allclose results
DeepSeek Rotary Frequency Fixes `src/megatron/bridge/models/deepseek/deepseek_v3_bridge.py`, `tests/unit_tests/models/deepseek/test_deepseek_bridges.py`	Changed torch.arange step from 2 to 1 in inverse frequency calculation; added explicit dtype coercion to reference tensor's dtype; updated test assertions to match new expected values
Benchmark Script Refactoring `examples/conversion/hf_megatron_roundtrip_benchmark.py`	Replaced per-iteration weight export loop with single weights_verification_table(pool) call; prints table only on rank 0; adjusted timing measurement to encapsulate table printing
Moonlight Model Conversion Test `tests/functional_tests/models/deepseek/test_moonlight_conversion.py`	New functional test module with TestMoonlightConversion class; adds fixture for toy model creation with config overrides and dynamic model class loading; includes GPU-only parallelism test validating conversion across multiple TP/PP/EP configurations and config preservation

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 2

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 57.14% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Test Results For Major Changes	⚠️ Warning	PR contains significant changes to inv_freq calculation but PR description lacks test results, numerical validation, or confirmation that shape mismatch errors are resolved.	Update PR description to include test execution results, explicit numerical validation of inv_freq shapes, and console output from test suite.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately describes the main fix: correcting the inv_freq shape in exported Deepseek models' rotary embeddings.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

BoxiangW · 2026-01-30T04:33:20Z

/ok to test 5acf621

Moonlight-16B-A3B-Instruct inv_freq fix

c5d70f7

Signed-off-by: conver334 <[email protected]>

github-actions bot added the community-request label Jan 22, 2026

optimize testing

10ab831

Signed-off-by: conver334 <[email protected]>

conver334 force-pushed the inv_freq branch from f6cd758 to 10ab831 Compare January 22, 2026 01:27

yaoyu-33 previously approved these changes Jan 22, 2026

View reviewed changes

conver334 dismissed yaoyu-33’s stale review via e67446e January 22, 2026 08:56

add unit test

a023beb

Signed-off-by: conver334 <[email protected]>

conver334 force-pushed the inv_freq branch from e67446e to a023beb Compare January 22, 2026 08:57

yaoyu-33 previously approved these changes Jan 22, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci January 22, 2026 19:07 Inactive

conver334 dismissed yaoyu-33’s stale review via 3c075b3 January 23, 2026 02:00

check precommit

8d4c394

Signed-off-by: conver334 <[email protected]>

conver334 force-pushed the inv_freq branch from 3c075b3 to 8d4c394 Compare January 23, 2026 02:01

yaoyu-33 enabled auto-merge (squash) January 23, 2026 04:16

yaoyu-33 previously approved these changes Jan 23, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci January 23, 2026 04:17 Inactive

copy-pr-bot bot temporarily deployed to test January 23, 2026 04:17 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci January 23, 2026 04:31 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci January 23, 2026 04:37 Failure

modify deepseek unit test for inv_freq

13ac99d

Signed-off-by: conver334 <[email protected]>

auto-merge was automatically disabled January 23, 2026 05:22
Head branch was pushed to by a user without write access

conver334 dismissed yaoyu-33’s stale review via 13ac99d January 23, 2026 05:22

BoxiangW enabled auto-merge (squash) January 28, 2026 02:51

BoxiangW approved these changes Jan 28, 2026

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 04:16 Inactive

copy-pr-bot bot temporarily deployed to test January 28, 2026 04:17 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 04:52 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci January 28, 2026 04:52 Failure

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 04:52 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci January 28, 2026 04:52 Failure

copy-pr-bot bot had a problem deploying to nemo-ci January 28, 2026 04:52 Error

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 04:52 Inactive

Merge branch 'main' into inv_freq

664ed69

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 17:24 Inactive

copy-pr-bot bot temporarily deployed to test January 28, 2026 17:24 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci January 28, 2026 18:00 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci January 28, 2026 18:40 Failure

Merge branch 'main' into inv_freq

5acf621

copy-pr-bot bot temporarily deployed to nemo-ci January 30, 2026 04:33 Inactive

copy-pr-bot bot temporarily deployed to test January 30, 2026 04:34 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci January 30, 2026 04:53 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci January 30, 2026 05:01 Failure

copy-pr-bot bot had a problem deploying to nemo-ci January 30, 2026 18:29 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix exported model.layers.*.self_attn.rotary_emb.inv_freq shape #2021

fix: fix exported model.layers.*.self_attn.rotary_emb.inv_freq shape #2021

Uh oh!

conver334 commented Jan 22, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

copy-pr-bot bot commented Jan 22, 2026

Uh oh!

yaoyu-33 commented Jan 22, 2026

Uh oh!

yaoyu-33 commented Jan 23, 2026

Uh oh!

BoxiangW left a comment

Uh oh!

BoxiangW commented Jan 28, 2026

Uh oh!

yaoyu-33 commented Jan 28, 2026

Uh oh!

coderabbitai bot commented Jan 28, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

BoxiangW commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: fix exported model.layers.*.self_attn.rotary_emb.inv_freq shape #2021

Are you sure you want to change the base?

fix: fix exported model.layers.*.self_attn.rotary_emb.inv_freq shape #2021

Uh oh!

Conversation

conver334 commented Jan 22, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

GitHub Actions CI

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

copy-pr-bot bot commented Jan 22, 2026

Uh oh!

yaoyu-33 commented Jan 22, 2026

Uh oh!

yaoyu-33 commented Jan 23, 2026

Uh oh!

BoxiangW left a comment

Choose a reason for hiding this comment

Uh oh!

BoxiangW commented Jan 28, 2026

Uh oh!

yaoyu-33 commented Jan 28, 2026

Uh oh!

coderabbitai bot commented Jan 28, 2026

Walkthrough

Changes

Estimated code review effort

Uh oh!

BoxiangW commented Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

conver334 commented Jan 22, 2026 •

edited by coderabbitai bot

Loading