Skip to content

Conversation

@jstjohn
Copy link
Collaborator

@jstjohn jstjohn commented Jan 9, 2026

Description

Example scripts demonstrating fine-tuning from a starting checkpoint, and a checkpoint conversion script for migrating nemo2 checkpoints to megatron bridge.

Usage

See README.md in evo2_megatron recipe in this PR for usage.

Type of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Refactor
  • Documentation update
  • Other (please describe):

CI Pipeline Configuration

Configure CI behavior by applying the relevant labels. By default, only basic unit tests are run.

  • ciflow:skip - Skip all CI tests for this PR
  • ciflow:notebooks - Run Jupyter notebooks execution tests for bionemo2
  • ciflow:slow - Run slow single GPU integration tests marked as @pytest.mark.slow for bionemo2
  • ciflow:all - Run all tests (unit tests, slow tests, and notebooks) for bionemo2. This label can be used to enforce running tests for all bionemo2.
  • ciflow:all-recipes - Run tests for all recipes (under bionemo-recipes). This label can be used to enforce running tests for all recipes.

Unit tests marked as @pytest.mark.multi_gpu or @pytest.mark.distributed are not run in the PR pipeline.

For more details, see CONTRIBUTING

Note

By default, only basic unit tests are run. Add appropriate labels to enable an additional test coverage.

Authorizing CI Runs

We use copy-pr-bot to manage authorization of CI
runs on NVIDIA's compute resources.

  • If a pull request is opened by a trusted user and contains only trusted changes, the pull request's code will
    automatically be copied to a pull-request/ prefixed branch in the source repository (e.g. pull-request/123)
  • If a pull request is opened by an untrusted user or contains untrusted changes, an NVIDIA org member must leave an
    /ok to test comment on the pull request to trigger CI. This will need to be done for each new commit.

Pre-submit Checklist

  • I have tested these changes locally
  • I have updated the documentation accordingly
  • I have added/updated tests as needed
  • All existing tests pass successfully

@copy-pr-bot
Copy link

copy-pr-bot bot commented Jan 9, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 9, 2026

Will mark ready once #1403 merges. Until then the new tests will not work.

@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 9, 2026

Pull in dataset bugfix from #1393

@jstjohn jstjohn marked this pull request as ready for review January 9, 2026 23:42
@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 9, 2026

/ready to test 7567299

@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 9, 2026

/ok to test 7567299

@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 10, 2026

/ok to test cf65fd7

@jstjohn jstjohn enabled auto-merge January 14, 2026 15:31
@jstjohn
Copy link
Collaborator Author

jstjohn commented Jan 14, 2026

/ok to test 86c2d06

@jstjohn jstjohn added this pull request to the merge queue Jan 14, 2026
Merged via the queue into main with commit 2214c45 Jan 14, 2026
18 checks passed
@jstjohn jstjohn deleted the jsjtohn/nemo2_to_mbridge_ckpts branch January 14, 2026 17:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants