Performance: Optimize .nemo tar extraction & model config processing #15245

paulirish · 2026-01-01T00:25:56Z

What does this PR do ?

Optimizes model loading performance for ASR models, specifically reducing Canary's setup time by ~44% (from 41.1s to 23.1s) through optimized config processing and eliminating redundant archive extractions.

Collection: [ASR]

Changelog

Prevent EncDecMultiTaskModel from re-extracting tarballs when they are already handled by nemo.utils.model_utils.
Optimized nemo.utils.model_utils.maybe_update_config_version and convert_model_config_to_dict_config to support in-place updates via a make_copy parameter, avoiding expensive OmegaConf deep copies.
Updated Serialization and FileIO mixins in nemo.core.classes.common to use non-copying config conversions where safe.
Enabled in-place config resolution in EncDecMultiTaskModel, EncDecCTCModelBPE, and EncDecHybridRNNTCTCBPEModel.

Measured performance gains on Canary model load:
- Baseline: 41.1s
- After config optimizations: 32.9s (~20% improvement)
- After just tar extraction fix: 25.6s (~37% improvement)
- Combined: 23.1s (~44% improvement)

These two commits are separate and I'm happy to drop one.

Usage

No changes to public APIs. Models will load significantly faster.

from nemo.collections.asr.models import EncDecMultiTaskModel
# This call is now ~18s faster for Canary
model = EncDecMultiTaskModel.from_pretrained("nvidia/canary-1b-v2")

See #15240 for a more complete repro script

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
- No, as the existing tests cover loading
Did you add or update any necessary documentation?
- Nope.
Does the PR affect components that are optional to install?
- No.

PR Type:

New Feature
Bugfix
Documentation

--

fixes #15240 cc @nithinraok

nemo/utils/model_utils.py

nithinraok

@paulirish Thanks for the great PR, really appreciate it.

Made some comments.

nemo/utils/model_utils.py

nemo/collections/asr/models/aed_multitask_models.py

nithinraok · 2026-01-02T02:02:46Z

For fixing format checker for CI-CD test, pls run:

python setup.py style --scope=<filepath> --fix

(we are working on improving contributing guide)

Reduces load time by ~8.1s (41.1s -> 32.96s) on warm start. Avoids unnecessary deep copies in `nemo.utils.model_utils` and `nemo.core.classes.modelPT`. Enables in-place config updates for `EncDecMultiTaskModel`, `EncDecCTCModelBPE`, and `EncDecHybridRNNTCTCBPEModel`. Updates Serialization and FileIO mixins to use optimized config conversion. Signed-off-by: Paul Irish <[email protected]>

Reduces load time by ~9.8s when combined with config optimizations (32.96s -> 23.14s). On its own, reduces load time by ~15.5s (41.1s -> 25.62s). Prevents `EncDecMultiTaskModel` from re-extracting tarballs when they are already handled by `nemo.utils.model_utils`. Signed-off-by: Paul Irish <[email protected]>

Signed-off-by: nithinraok <[email protected]>

paulirish · 2026-01-08T20:55:56Z

@nithinraok thank you.
I did much of the same over the weekend but my simplification wasn't as effective as yours, so I appreciate you stepping in.

nithinraok · 2026-01-08T21:36:07Z

@nithinraok thank you. I did much of the same over the weekend but my simplification wasn't as effective as yours, so I appreciate you stepping in.

Thanks @paulirish. Appreciate it. Delay in CI due to many open PRs, Will take care of this PR. :)

github-actions bot added core Changes to NeMo Core ASR labels Jan 1, 2026

paulirish mentioned this pull request Jan 1, 2026

Performance: EncDecMultiTaskModel (Canary) initialization triggers double .nemo extraction and recursive heavy reload (~52s cold start) #15240

Open

github-advanced-security bot found potential problems Jan 1, 2026

View reviewed changes

nemo/utils/model_utils.py Fixed Show fixed Hide fixed

nithinraok requested changes Jan 2, 2026

View reviewed changes

nithinraok added the skip-linting label Jan 2, 2026

github-actions bot added the community-request label Jan 2, 2026

nithinraok previously approved these changes Jan 7, 2026

View reviewed changes

chtruong814 added the Run CICD label Jan 7, 2026

chtruong814 temporarily deployed to test January 7, 2026 20:35 — with GitHub Actions Inactive

nithinraok dismissed their stale review via 94a633c January 8, 2026 15:35

chtruong814 added Run CICD and removed Run CICD labels Jan 8, 2026

chtruong814 had a problem deploying to test January 8, 2026 15:37 — with GitHub Actions Error

paulirish and others added 6 commits January 8, 2026 09:53

simplify changes

21fa0e4

Signed-off-by: nithinraok <[email protected]>

formatter fixes

38bdbda

Signed-off-by: nithinraok <[email protected]>

remove markers as its causing issues with safe instantiations

479fe16

Signed-off-by: nithinraok <[email protected]>

merge conflicts

a2a3eb4

Signed-off-by: nithinraok <[email protected]>

nithinraok force-pushed the faster-canary-setup branch from 94a633c to a2a3eb4 Compare January 8, 2026 18:00

chtruong814 added Run CICD and removed Run CICD labels Jan 8, 2026

chtruong814 deployed to test January 8, 2026 18:02 — with GitHub Actions Active

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance: Optimize .nemo tar extraction & model config processing #15245

Performance: Optimize .nemo tar extraction & model config processing #15245

Uh oh!

paulirish commented Jan 1, 2026

Uh oh!

Uh oh!

nithinraok left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nithinraok commented Jan 2, 2026

Uh oh!

paulirish commented Jan 8, 2026

Uh oh!

nithinraok commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Performance: Optimize .nemo tar extraction & model config processing #15245

Are you sure you want to change the base?

Performance: Optimize .nemo tar extraction & model config processing #15245

Uh oh!

Conversation

paulirish commented Jan 1, 2026

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Uh oh!

Uh oh!

nithinraok left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nithinraok commented Jan 2, 2026

Uh oh!

paulirish commented Jan 8, 2026

Uh oh!

nithinraok commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants