feat: Add MFT (Minifinetuning) loss support for knowledge distillation #14298

pbelcak · 2025-07-21T21:30:25Z

What does this PR do ?

Adds MFT (Minifinetuning; NVR paper) loss support for knowledge distillation with configurable threshold-based teacher probability correction.

The MFT loss provides an alternative to standard KL divergence by correcting teacher distributions based on ground truth labels and a configurable threshold, potentially improving distillation quality for language models.

Collection:llm/modelopt

Changelog

Add MFTLoss class implementing Minifinetuning distillation loss based on https://arxiv.org/abs/2506.15702
Add _prepare_corrected_distributions method for threshold-based teacher probability correction
Update DistillationConfig dataclass with use_mft and mft_threshold parameters
Add MFT configuration validation in __post_init__ method
Integrate MFT loss selection logic in load_distillation_config function
Add MFTLoss import to utils.py for proper integration
Add NotImplementedError for tensor model parallelism in MFTLoss (sequence parallelism required)

Usage

You can enable MFT loss in your distillation configuration:

In your distillation YAML config file:

use_mft: true
mft_threshold: 0.1  # Threshold between 0 and 1 for correction factor
...

Or, programmatically:

from nemo.collections.llm.modelopt.distill.utils import DistillationConfig

config = DistillationConfig(
    use_mft=True,
    mft_threshold=0.1,
    ...
)

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests? Tested under ModelOpt, see ModelOpt merge_requests/2785. No ModelOpt tests (except for one quantization check) are currently present in the NeMo Framework tests, so I did not bring the tests here.
Did you add or update any necessary documentation? Every class and function is documented; additional examples are in ModelOpt docs.
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc): No
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

- Add MFTLoss class implementing Minifinetuning distillation loss from https://arxiv.org/abs/2506.15702 - Implement corrected distribution preparation with threshold-based teacher probability adjustment - Add support for both incorrect argmax and separation threshold corrections - Update DistillationConfig to support MFT mode with configurable threshold parameter - Integrate MFT loss option in distillation pipeline with automatic selection based on config - Add validation for MFT threshold parameter (must be between 0 and 1) - Note: MFT loss currently does not support tensor model parallelism The MFT loss provides an alternative to standard KL divergence by correcting teacher distributions based on ground truth labels and a configurable threshold, potentially improving distillation quality for language models. Signed-off-by: pbelcak <[email protected]>

Signed-off-by: pbelcak <[email protected]>

…commit. Signed-off-by: pbelcak <[email protected]>

pbelcak · 2025-07-21T23:59:44Z

(forgot to update CHANGELOG.md, made no other changes)

pbelcak · 2025-07-24T23:44:54Z

@sharathts all good?

sharathts · 2025-07-25T22:55:25Z

@pbelcak LGTM, approved!

github-actions · 2025-08-09T02:11:11Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

ZhiyuLi-Nvidia · 2025-08-13T01:06:41Z

@pbelcak could you cherry-pick the commit to fix pylint and I will try again if we can solve the lint failure.

branch: https://github.com/NVIDIA/NeMo/tree/pr-14298
commit: 1f24add
We don't have access to the branch of your cloned repo pbelcak:main
Otherwise, we will take it over and create the PR copy to merge it in.

github-actions · 2025-08-28T02:04:20Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

pbelcak · 2025-08-31T03:36:50Z

I see that the changes needed here are largely cosmetic; I plan to have a look at it this week.

github-actions · 2025-09-16T02:01:03Z

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

Signed-off-by: Zhiyu Li <[email protected]>

pbelcak · 2026-01-02T17:46:02Z

Hi @ZhiyuLi-Nvidia, thanks for the fix, it's done. Can we merge?

Signed-off-by: Zhiyu Li <[email protected]>

ZhiyuLi-Nvidia · 2026-01-05T01:49:14Z

codecov/patch
codecov/patch — 17.77% of diff hit (target 80.00%)

Hi, @pbelcak all tests passed except the low test coverage. Could you finally add a unit test and rebase the branch?

pbelcak and others added 2 commits July 21, 2025 14:18

Apply isort and black reformatting

c02bdae

Signed-off-by: pbelcak <[email protected]>

ericharper requested a review from sharathts July 21, 2025 22:19

sharathts previously approved these changes Jul 21, 2025

View reviewed changes

Update CHANGELOG.md to reflect the changes of the feature-adding MFT …

1fa3b30

…commit. Signed-off-by: pbelcak <[email protected]>

pbelcak dismissed sharathts’s stale review via 1fa3b30 July 21, 2025 23:59

sharathts self-requested a review July 25, 2025 22:54

sharathts previously approved these changes Jul 25, 2025

View reviewed changes

github-actions bot added the stale label Aug 9, 2025

github-actions bot removed the stale label Aug 13, 2025

github-actions bot added the stale label Aug 28, 2025

github-actions bot removed the stale label Sep 1, 2025

github-actions bot added the stale label Sep 16, 2025

Merge branch 'main' into main

2a7912c

github-actions bot added the community-request label Jan 2, 2026

lint fix

029832f

Signed-off-by: Zhiyu Li <[email protected]>

pbelcak dismissed sharathts’s stale review via 029832f January 2, 2026 17:37

lint fix

1114eee

Signed-off-by: Zhiyu Li <[email protected]>

ZhiyuLi-Nvidia approved these changes Jan 3, 2026

View reviewed changes

ZhiyuLi-Nvidia added the Run CICD label Jan 3, 2026

ZhiyuLi-Nvidia temporarily deployed to test January 3, 2026 16:29 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add MFT (Minifinetuning) loss support for knowledge distillation #14298

feat: Add MFT (Minifinetuning) loss support for knowledge distillation #14298

Uh oh!

pbelcak commented Jul 21, 2025 •

edited

Loading

Uh oh!

pbelcak commented Jul 21, 2025

Uh oh!

pbelcak commented Jul 24, 2025

Uh oh!

sharathts commented Jul 25, 2025

Uh oh!

github-actions bot commented Aug 9, 2025

Uh oh!

ZhiyuLi-Nvidia commented Aug 13, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 28, 2025

Uh oh!

pbelcak commented Aug 31, 2025

Uh oh!

github-actions bot commented Sep 16, 2025

Uh oh!

pbelcak commented Jan 2, 2026

Uh oh!

ZhiyuLi-Nvidia commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Add MFT (Minifinetuning) loss support for knowledge distillation #14298

Are you sure you want to change the base?

feat: Add MFT (Minifinetuning) loss support for knowledge distillation #14298

Uh oh!

Conversation

pbelcak commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

Uh oh!

pbelcak commented Jul 21, 2025

Uh oh!

pbelcak commented Jul 24, 2025

Uh oh!

sharathts commented Jul 25, 2025

Uh oh!

github-actions bot commented Aug 9, 2025

Uh oh!

ZhiyuLi-Nvidia commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Aug 28, 2025

Uh oh!

pbelcak commented Aug 31, 2025

Uh oh!

github-actions bot commented Sep 16, 2025

Uh oh!

pbelcak commented Jan 2, 2026

Uh oh!

ZhiyuLi-Nvidia commented Jan 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pbelcak commented Jul 21, 2025 •

edited

Loading

ZhiyuLi-Nvidia commented Aug 13, 2025 •

edited

Loading