Skip to content
Change the repository type filter

All

    Repositories list

    • TensorRT-LLM

      Public
      TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inferen…
      Python
      2.1k13k540550Updated Feb 25, 2026Feb 25, 2026
    • NeMo-Agent-Toolkit

      Public
      The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
      Python
      5281.8k1819Updated Feb 25, 2026Feb 25, 2026
    • cccl

      Public
      CUDA Core Compute Libraries
      C++
      3462.2k1.3k201Updated Feb 25, 2026Feb 25, 2026
    • cuopt

      Public
      GPU accelerated decision optimization
      Cuda
      1277198414Updated Feb 25, 2026Feb 25, 2026
    • cloudai

      Public
      CloudAI Benchmark Framework
      Python
      438455Updated Feb 25, 2026Feb 25, 2026
    • nvidia-container-toolkit

      Public
      Build and run containers leveraging NVIDIA GPUs
      Go
      4804.1k8717Updated Feb 25, 2026Feb 25, 2026
    • KAI-Scheduler

      Public
      KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
      Go
      1541.1k2575Updated Feb 25, 2026Feb 25, 2026
    • cuda-quantum

      Public
      C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
      C++
      342940430108Updated Feb 25, 2026Feb 25, 2026
    • topograph

      Public
      A toolkit for discovering cluster network topology.
      Go
      1410021Updated Feb 25, 2026Feb 25, 2026
    • Megatron-LM

      Public
      Ongoing research training transformer models at scale
      Python
      3.6k15k305314Updated Feb 25, 2026Feb 25, 2026
    • Fuser

      Public
      A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
      C++
      78379212214Updated Feb 25, 2026Feb 25, 2026
    • bare-metal-manager-core

      Public
      NVIDIA Bare Metal Manager - Hardware Lifecycle Management and multitenant networking
      Rust
      42635516Updated Feb 25, 2026Feb 25, 2026
    • NVSentinel

      Public
      NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
      Go
      491844421Updated Feb 25, 2026Feb 25, 2026
    • nsight-python

      Public
      Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools
      Python
      1014341Updated Feb 25, 2026Feb 25, 2026
    • bare-metal-manager-rest

      Public
      NVIDIA Bare Metal Management - Hardware Lifeceycle managment (REST API)
      Go
      1521514Updated Feb 25, 2026Feb 25, 2026
    • DALI

      Public
      A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and infer…
      C++
      6595.6k22530Updated Feb 25, 2026Feb 25, 2026
    • edk2-platforms

      Public
      NVIDIA fork of tianocore/edk2-platforms
      C
      41300Updated Feb 25, 2026Feb 25, 2026
    • edk2

      Public
      NVIDIA fork of tianocore/edk2
      C
      1627015Updated Feb 25, 2026Feb 25, 2026
    • k8s-nim-operator

      Public
      An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
      Go
      40150720Updated Feb 25, 2026Feb 25, 2026
    • gpu-driver-container

      Public
      The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
      Shell
      761592536Updated Feb 25, 2026Feb 25, 2026
    • TileGym

      Public
      Helpful kernel tutorials and examples for tile-based GPU programming
      Python
      4965224Updated Feb 25, 2026Feb 25, 2026
    • doca-platform

      Public
      DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
      Go
      207701Updated Feb 25, 2026Feb 25, 2026
    • bionemo-framework

      Public
      BioNeMo Framework: For building and adapting AI models in drug discovery at scale
      Jupyter Notebook
      12366562126Updated Feb 25, 2026Feb 25, 2026
    • Model-Optimizer

      Public
      A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models …
      Python
      2832k6696Updated Feb 25, 2026Feb 25, 2026
    • cudaqx

      Public
      Accelerated libraries for quantum-classical computing built on CUDA-Q.
      C++
      52842816Updated Feb 25, 2026Feb 25, 2026
    • nv-redfish

      Public
      NVIDIA's Redfish next generation redfish crate
      Rust
      21411Updated Feb 25, 2026Feb 25, 2026
    • gpu-operator

      Public
      NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
      Go
      4562.6k6442Updated Feb 25, 2026Feb 25, 2026
    • holodeck

      Public
      Holodeck is a project to create test environments optimised for GPU projects.
      Go
      132626Updated Feb 25, 2026Feb 25, 2026
    • OSMO

      Public
      The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge device…
      TypeScript
      18975920Updated Feb 25, 2026Feb 25, 2026
    • nv-ingest

      Public
      NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized …
      Python
      2992.8k10257Updated Feb 25, 2026Feb 25, 2026