Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      13k808Updated Feb 13, 2026Feb 13, 2026
    • text-generation-inference

      Public
      Development fork of https://github.com/huggingface/text-generation-inference
      Python
      360011Updated Feb 13, 2026Feb 13, 2026
    • kserve

      Public
      Standardized Serverless ML Inference Platform on Kubernetes
      Go
      1.4k20591Updated Feb 13, 2026Feb 13, 2026
    • Python
      1000304Updated Feb 13, 2026Feb 13, 2026
    • Tuning scripts using Hugging Face `SFTTrainer`
      Python
      65201Updated Feb 13, 2026Feb 13, 2026
    • 27005Updated Feb 13, 2026Feb 13, 2026
    • 1002Updated Feb 13, 2026Feb 13, 2026
    • ods-ci

      Public
      odh qe tier tests
      RobotFramework
      115162655Updated Feb 13, 2026Feb 13, 2026
    • konflux-central

      Public
      Central repository for managing Konflux resource files. This streamlines maintenance by consolidating configurations and leveraging GitHub Actions for automated…
      Shell
      282214Updated Feb 13, 2026Feb 13, 2026
    • llama-stack

      Public
      Composable building blocks to build Llama Apps
      Python
      1.3k0058Updated Feb 13, 2026Feb 13, 2026
    • llama-stack-provider-trustyai-garak

      Public
      Out-Of-Tree Llama Stack Eval Provider for Red Teaming LLM Systems with Garak
      Python
      600149Updated Feb 13, 2026Feb 13, 2026
    • Scripts and instructions to support RHOAI migrations and upgrades
      Shell
      4103Updated Feb 13, 2026Feb 13, 2026
    • rhods-operator

      Public
      RHODS operator implementation, based on Kubeflow Operator
      Go
      22911037Updated Feb 13, 2026Feb 13, 2026
    • notebooks

      Public
      Notebook images for ODH
      Python
      12712030Updated Feb 13, 2026Feb 13, 2026
    • NeMo-Guardrails

      Public
      NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
      Python
      5970085Updated Feb 13, 2026Feb 13, 2026
    • A scalable inference server for models optimized with OpenVINO™
      C++
      23710224Updated Feb 13, 2026Feb 13, 2026
    • Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
      Go
      1.5k0046Updated Feb 13, 2026Feb 13, 2026
    • Shell
      350096Updated Feb 13, 2026Feb 13, 2026
    • MLServer

      Public
      An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
      Python
      2240051Updated Feb 13, 2026Feb 13, 2026
    • Dockerfile
      27311Updated Feb 13, 2026Feb 13, 2026
    • A framework for few-shot evaluation of language models.
      Python
      3k10374Updated Feb 13, 2026Feb 13, 2026
    • llama-stack-provider-ragas

      Public
      TrustyAI's RAGAS provider for Llama Stack
      Python
      110071Updated Feb 13, 2026Feb 13, 2026
    • TypeScript
      2766026Updated Feb 13, 2026Feb 13, 2026
    • Rust
      150041Updated Feb 13, 2026Feb 13, 2026
    • 🚀 Guardrails orchestration server for application of various detections on text generation input and output.
      Rust
      3900114Updated Feb 13, 2026Feb 13, 2026
    • Rust
      100014Updated Feb 13, 2026Feb 13, 2026
    • vllm-cpu

      Public
      Python
      1201176Updated Feb 13, 2026Feb 13, 2026
    • Python
      0002Updated Feb 13, 2026Feb 13, 2026
    • vllm-rocm

      Public
      40099Updated Feb 13, 2026Feb 13, 2026
    • mlflow

      Public
      The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and eval…
      Python
      5.3k0054Updated Feb 13, 2026Feb 13, 2026