Support nemotron nano vlm v1 nvfp4 quantize + export #347

Edwardf0t1 · 2025-09-19T07:43:23Z

What does this PR do?

Type of change: ?

Overview: ?

Usage

# Add a code snippet demonstrating how to use this

Testing

Before your PR is "Ready for review"

Make sure you read and follow Contributor guidelines and your commits are signed.
Is this change backward compatible?: Yes/No
Did you write any new necessary tests?: Yes/No
Did you add or update any necessary documentation?: Yes/No
Did you update Changelog?: Yes/No

Additional Information

Signed-off-by: Zhiyu Cheng <[email protected]>

copy-pr-bot · 2025-09-19T07:43:27Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

coderabbitai · 2025-09-19T07:43:31Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch zhiyu/support-nemotron-nano-vlm-v1-nvfp4

Tip

👮 Agentic pre-merge checks are now available in preview!

Pro plan users can now enable pre-merge checks in their settings to enforce checklists before merging PRs.

Built-in checks – Quickly apply ready-made checks to enforce title conventions, require pull request descriptions that follow templates, validate linked issues for compliance, and more.
Custom agentic checks – Define your own rules using CodeRabbit’s advanced agentic capabilities to enforce organization-specific policies and workflows. For example, you can instruct CodeRabbit’s agent to verify that API documentation is updated whenever API schema files are modified in a PR. Note: Upto 5 custom checks are currently allowed during the preview period. Pricing for this feature will be announced in a few weeks.

Please see the documentation for more information.

Example:

reviews:
  pre_merge_checks:
    custom_checks:
      - name: "Undocumented Breaking Changes"
        mode: "warning"
        instructions: |
          Pass/fail criteria: All breaking changes to public APIs, CLI flags, environment variables, configuration keys, database schemas, or HTTP/GraphQL endpoints must be documented in the "Breaking Change" section of the PR description and in CHANGELOG.md. Exclude purely internal or private changes (e.g., code not exported from package entry points or explicitly marked as internal).

Please share your feedback with us on this Discord post.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2025-09-19T07:56:03Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 73.84%. Comparing base (8a5736a) to head (281029e).
⚠️ Report is 21 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #347      +/-   ##
==========================================
- Coverage   73.86%   73.84%   -0.03%     
==========================================
  Files         172      172              
  Lines       17409    17453      +44     
==========================================
+ Hits        12860    12888      +28     
- Misses       4549     4565      +16

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…dels Signed-off-by: Zhiyu Cheng <[email protected]>

…we process language model part only in export Signed-off-by: Zhiyu Cheng <[email protected]>

Edwardf0t1 added 3 commits September 17, 2025 21:48

default attn_implementaion to eager to avoid issues

98772b9

Signed-off-by: Zhiyu Cheng <[email protected]>

add proper detection and handling for nemotron VL model in ptq examples

32bdfa9

Signed-off-by: Zhiyu Cheng <[email protected]>

create fake vl inputs in export for nemotron VL model

c71b661

Signed-off-by: Zhiyu Cheng <[email protected]>

Edwardf0t1 self-assigned this Sep 19, 2025

Edwardf0t1 added 2 commits September 19, 2025 22:38

update fake inputs generation, initialize distributed for Nemotron mo…

f40501d

…dels Signed-off-by: Zhiyu Cheng <[email protected]>

remove distributed prcessing setup and vision input generation since …

281029e

…we process language model part only in export Signed-off-by: Zhiyu Cheng <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support nemotron nano vlm v1 nvfp4 quantize + export #347

Support nemotron nano vlm v1 nvfp4 quantize + export #347

Edwardf0t1 commented Sep 19, 2025

Uh oh!

copy-pr-bot bot commented Sep 19, 2025

Uh oh!

coderabbitai bot commented Sep 19, 2025

Review skipped

Uh oh!

codecov bot commented Sep 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Support nemotron nano vlm v1 nvfp4 quantize + export #347

Are you sure you want to change the base?

Support nemotron nano vlm v1 nvfp4 quantize + export #347

Conversation

Edwardf0t1 commented Sep 19, 2025

What does this PR do?

Usage

Testing

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot bot commented Sep 19, 2025

Uh oh!

coderabbitai bot commented Sep 19, 2025

Review skipped

Uh oh!

codecov bot commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

codecov bot commented Sep 19, 2025 •

edited

Loading