Venus - Universal Inference Engine

A cross-platform, high-performance inference engine for Large Language Models (LLMs) with OpenAI-compatible API.

Features

🚀 Universal Platform Support: Runs on Apple Silicon, x86_64, ARM, RISC-V, and more
🔥 High Performance: Optimized with platform-specific SIMD instructions
📦 Memory Efficient: Advanced quantization (Q8_0, Q4_0) and PagedAttention
🌐 OpenAI Compatible: Drop-in replacement for OpenAI API
🛠️ Production Ready: Docker, Kubernetes, and auto-scaling support

Quick Start

# Clone the repository
git clone https://github.com/swchoi1994/venus.git
cd venus

# Build the engine
./build.sh

# Download and convert a model
python scripts/download_model.py --model Qwen/Qwen2.5-0.5B
python scripts/convert_hf_model.py --input models/Qwen2.5-0.5B --output models/qwen2.5-0.5b.bin

# Start the API server
./target/release/venus --model-dir ./models --port 8000

Minimum Requirements

CPU: 8+ cores (Apple M1 equivalent)
RAM: 32GB minimum
Storage: 100GB for models

Documentation

See docs/ for detailed documentation.

License

MIT License - see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
benchmarks		benchmarks
configs		configs
docs		docs
examples		examples
scripts		scripts
src		src
.gitignore		.gitignore
=0.22,		=0.22,
Cargo.toml		Cargo.toml
IMG_4077.jpg		IMG_4077.jpg
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
build.rs		build.rs
build.sh		build.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_benchmarks.sh		run_benchmarks.sh
setup.py		setup.py
venus 3		venus 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Venus - Universal Inference Engine

Features

Quick Start

Minimum Requirements

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Venus - Universal Inference Engine

Features

Quick Start

Minimum Requirements

Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages