Skip to content

neutree-ai/neutree

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

258 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Neutree

Neutree

Neutree is an open-source Large Language Model (LLM) infrastructure management platform.

Features

  • Multi-cluster Management: Deploy and manage inference workloads across Kubernetes clusters and static node clusters (Ray + Docker)
  • OpenAI-compatible API: Unified inference gateway with API key authentication and usage tracking
  • Multi-tenancy: Workspace-based resource isolation with fine-grained RBAC
  • Production-ready Observability: Integrated metrics collection and Grafana dashboards
  • Flexible Model Storage: Support for HuggingFace Hub and file-based model registries

Documentation

Visit docs.neutree.ai for installation guides, tutorials, and API references.

Development

Design Documents

Technical design documents for contributors are available in the docs/ directory:

Contributing

Prerequisites

  • Go 1.23+
  • Docker
  • Make

Common workflows

# Build all binaries
make build

# Run unit tests
make test

# Run linter
make lint

# Run database tests
make db-test

# Quick iteration: rebuild and restart local containers
make docker-test-api
make docker-test-core

Roadmap

  • More accelerator support (e.g., Intel XPU)
  • Inference endpoint auto-scaling
  • External KV cache integration
  • Quota and usage limits
  • GPU memory hard isolation
  • More inference engine adapters
  • External endpoint support for unified management of local and external model services

Community

License

Neutree is licensed under the Apache License 2.0.

About

Enterprise-grade Private Model-as-a-Service Platform

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors