Transformation stage 0: NeMo docs revision#15363
Merged
Conversation
…text_processing) Remove all doc references to collections that are no longer part of NeMo: nlp, multimodal, vision, vlm, avlm, diffusion, llm, multimodal_autoregressive, speechlm, and text_processing. Update landing page, collections index, tutorials, conf.py bibtex entries, and cross-reference links. Delete the remaining docs/source/multimodal/ directory. Clean up false_positives.json and links_needing_review.json to remove entries for deleted doc pages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
… AI focus Remove obsolete docs (performance benchmarks, MoE, Megatron optimizations, distributed checkpoints, Megatron-LM conversion). Rewrite landing page, parallelisms, mixed precision, Quick Start, and best-practices pages to reflect NeMo's Speech AI focus. Update collection references throughout to include ASR, TTS, Audio, and SpeechLM2. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Mixed precision: document mixed vs true half precision modes, explain HalfPrecisionForAudio plugin that preserves audio tensor precision. Parallelisms: document DDP (all collections) and ModelParallelStrategy (SpeechLM2) with FSDP2/TP/SP concepts, configuration examples, and requirement for configure_model() implementation. Checkpoints: document .nemo as tar archive (unpack/repack), .safetensors for SpeechLM2 via HuggingFace Hub, and distributed checkpoints with ModelParallelStrategy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Rename "NeMo Framework" to "NeMo Toolkit" across docs. Rewrite landing page with three-tier capability overview (Models, Inference & Deployment, Voice Agent) and HuggingFace model links. Flatten collections toctree into a single index. Update intro: Python 3.12 / PyTorch 2.7+ prerequisites, add PyPI and source install instructions, remove broken User Guide link. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Replace broken NVIDIA docs URLs: use cross-reference to local ASR datasets doc in speechlm2, inline pip install command in g2p. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
nithinraok
previously approved these changes
Feb 9, 2026
Member
nithinraok
left a comment
There was a problem hiding this comment.
LGTM!
Getting few issues while compiling, pls check with make -C docs clean html
Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Collaborator
Author
|
@nithinraok fixed all errors and warnings in the docs build, this triggered some linter errors, which I also fixed. |
Contributor
|
[🤖]: Hi @pzelasko 👋, We wanted to let you know that a CICD pipeline for this PR just finished successfully. So it might be time to merge this PR or get some approvals. |
nemoramo
pushed a commit
to nemoramo/MoNeMo
that referenced
this pull request
Feb 13, 2026
* Remove deprecated collection documentation (nlp, multimodal, vision, text_processing) Remove all doc references to collections that are no longer part of NeMo: nlp, multimodal, vision, vlm, avlm, diffusion, llm, multimodal_autoregressive, speechlm, and text_processing. Update landing page, collections index, tutorials, conf.py bibtex entries, and cross-reference links. Delete the remaining docs/source/multimodal/ directory. Clean up false_positives.json and links_needing_review.json to remove entries for deleted doc pages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Deep cleanup of docs: remove Megatron/LLM content, rewrite for Speech AI focus Remove obsolete docs (performance benchmarks, MoE, Megatron optimizations, distributed checkpoints, Megatron-LM conversion). Rewrite landing page, parallelisms, mixed precision, Quick Start, and best-practices pages to reflect NeMo's Speech AI focus. Update collection references throughout to include ASR, TTS, Audio, and SpeechLM2. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Expand docs for mixed precision, parallelisms, and checkpoints Mixed precision: document mixed vs true half precision modes, explain HalfPrecisionForAudio plugin that preserves audio tensor precision. Parallelisms: document DDP (all collections) and ModelParallelStrategy (SpeechLM2) with FSDP2/TP/SP concepts, configuration examples, and requirement for configure_model() implementation. Checkpoints: document .nemo as tar archive (unpack/repack), .safetensors for SpeechLM2 via HuggingFace Hub, and distributed checkpoints with ModelParallelStrategy. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Rebrand to NeMo Toolkit, rewrite landing page, update intro Rename "NeMo Framework" to "NeMo Toolkit" across docs. Rewrite landing page with three-tier capability overview (Models, Inference & Deployment, Voice Agent) and HuggingFace model links. Flatten collections toctree into a single index. Update intro: Python 3.12 / PyTorch 2.7+ prerequisites, add PyPI and source install instructions, remove broken User Guide link. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix broken external links in speechlm2/datasets and tts/g2p docs Replace broken NVIDIA docs URLs: use cross-reference to local ASR datasets doc in speechlm2, inline pip install command in g2p. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix some warnings Signed-off-by: Piotr Żelasko <petezor@gmail.com> * All warnings and errors fixed Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Fix linter Signed-off-by: Piotr Żelasko <petezor@gmail.com> --------- Signed-off-by: Piotr Żelasko <petezor@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What does this PR do ?
Overhauled NeMo developer docs to reflect the project's refocus on Speech AI.
Collection: all
Changelog
processing) and Megatron-specific content (MoE, distributed checkpoints, Megatron-LM conversion,
LLM optimizations, performance benchmarks)
and HuggingFace model links
true half, HalfPrecisionForAudio), checkpoints (.nemo, .safetensors, distributed)
Sortformer, Canary-Qwen), actual install instructions, updated prerequisites
Usage
# Add a code snippet demonstrating how to use thisGitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information