Skip to content

Adds TRT-LLM Docker run of Nemotron Super 3 in DGX Spark#111

Open
dscain wants to merge 2 commits intoNVIDIA-NeMo:mainfrom
dscain:feature/y87-trtllm-nemotron-3s-spark
Open

Adds TRT-LLM Docker run of Nemotron Super 3 in DGX Spark#111
dscain wants to merge 2 commits intoNVIDIA-NeMo:mainfrom
dscain:feature/y87-trtllm-nemotron-3s-spark

Conversation

@dscain
Copy link
Copy Markdown

@dscain dscain commented Mar 12, 2026

Summary

Modified usage-cookbook/Nemotron-3-Super/AdvancedDeploymentGuide/README.md demonstrating inference with Nemotron 3 Super NVFP4 in DGX Spark via TensorRT-LLM Docker container.

What's covered:

  • Adds instruction on how to build TRT-LLM and the right commit.
  • How to create y.yaml for DGX Spark
  • How to start the container and load the model

Research & Context

No AI assistant used for the development of this PR.

dscain added 2 commits March 12, 2026 11:57
Signed-off-by: Daniel Scain <d.scain.farenzena@gmail.com>
Signed-off-by: Daniel Scain <d.scain.farenzena@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant