Name	Name	Last commit message	Last commit date
parent directory ..
Llama-3.1-Nemotron-Safety-Guard-V3	Llama-3.1-Nemotron-Safety-Guard-V3
Llama-Nemotron-Super-49B-v1.5	Llama-Nemotron-Super-49B-v1.5
Nemotron-3-Nano	Nemotron-3-Nano
Nemotron-3-Super	Nemotron-3-Super
Nemotron-3-Ultra-Base	Nemotron-3-Ultra-Base
Nemotron-Nano-9B-v2	Nemotron-Nano-9B-v2
Nemotron-Nano2-VL	Nemotron-Nano2-VL
Nemotron-Parse-v1.1	Nemotron-Parse-v1.1
README.md	README.md

Name

Last commit message

Last commit date

Usage Cookbook

Examples on how to get started with Nemotron models

What's Inside

This directory contains cookbook-style guides showing how to deploy and use the models directly:

TensorRT-LLM Launch Guide - Running Nemotron models efficiently with TensorRT-LLM
vLLM Integration - Steps for fast inference and scalable serving of Nemotron models with vLLM.
SGLang Deployment - Tutorials on serving and interacting with Nemotron via SGLang
NIM Microservice - Guide to deploying Nemotron as scalable, production-ready endpoints using NVIDIA Inference Microservices (NIM).
Hugging Face Transformers - Direct loading and inference of Nemotron models with Hugging Face Transformers