Shenma Deployment Tool (for docker-compose)

Introduction

Overall Architecture

Shenma adopts a microservice architecture.

The backend system is divided into three layers and four components. The three layers are: Gateway Layer, Service Layer, and Storage Layer. These three layers, plus the unified Operation & Maintenance Center that spans all layers, form the four major components.

Gateway Layer

The Gateway Layer is responsible for application distribution, load balancing, traffic control, and API authorization control.

Traffic forwarding, SSL offloading: Sangfor AD is recommended, but other load balancing devices with SSL offloading capabilities can also be used
Application distribution, traffic control, etc.: APISIX
Login authentication, authorization control: Keycloak, Trampoline, Kaptcha
- User management component: Keycloak
- Login trampoline: Trampoline
- CAPTCHA service used in the login process: Kaptcha

Service Layer

The Service Layer consists of several core services, currently including:

Proxy backend responsible for code completion: completion-server
Proxy backend responsible for chat services: chat-server

For both chat and completion, the purpose of using proxy backends is to shield the details of different model APIs and provide additional context processing capabilities.

Storage Layer

Storage Layer:

Relational database: PostgreSQL
Key-value database: etcd
Cache: Redis

Operation & Maintenance Center

Operation & Maintenance Center:

Grafana (optional)
Prometheus (optional)
Kibana (optional)
Elasticsearch (required)

Deployment Steps

0. Prerequisites

Using self-deployed model instances

An X64 hardware device with a minimum configuration of 16C, 32G, 512G storage, equipped with a GPU that supports model inference services (at least 2 RTX4090 or 1 A800)
CentOS 7 or WSL Ubuntu installed with necessary components such as nvidia-docker, docker-compose, etc.

Using third-party API services or self-deploying model instances

An X64 hardware device with a minimum configuration of 16C, 32G, 512G storage
CentOS 7 installed with docker, docker-compose, and other necessary components

1. Modify configuration according to requirements

vim deploy.sh
vim configure.sh

2. Execute the deployment script

bash deploy.sh

3. Configure LLM API keys in the One-API backend

 Default address is http://localhost:30000, default account is root, password is 123456.
 Click on "Channels", add a new channel, select the LLM provider, fill in the name and key, other fields can be left empty.

4. Configure APISIX address in Shenma plugin

Shenma baseurl: Default is http://{local machine IP}:8090/v1 (Note: Using localhost may cause issues, it's recommended to use ipconfig to get the actual IP address).

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
apisix		apisix
apisix_dashboard		apisix_dashboard
chatgpt		chatgpt
dex		dex
etcd		etcd
fauxpilot		fauxpilot
grafana		grafana
kaptcha		kaptcha
keycloak		keycloak
kubernetes		kubernetes
portal		portal
prometheus		prometheus
trampoline/data		trampoline/data
.gitignore		.gitignore
README.md		README.md
apisix-chatgpt.sh		apisix-chatgpt.sh
apisix-copilot.sh		apisix-copilot.sh
apisix-issue.sh		apisix-issue.sh
apisix-keycloak.sh		apisix-keycloak.sh
apisix-oidc.sh		apisix-oidc.sh
apisix-oneapi.sh		apisix-oneapi.sh
chatgpt-initdb.sh		chatgpt-initdb.sh
configure.sh		configure.sh
deploy.sh		deploy.sh
docker-compose.yml.tpl		docker-compose.yml.tpl
keycloak-import.sh		keycloak-import.sh
setup-accessibility.sh		setup-accessibility.sh
tpl-resolve.sh		tpl-resolve.sh
utils.sh		utils.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Shenma Deployment Tool (for docker-compose)

Introduction

Overall Architecture

Gateway Layer

Service Layer

Storage Layer

Operation & Maintenance Center

Deployment Steps

0. Prerequisites

Using self-deployed model instances

Using third-party API services or self-deploying model instances

1. Modify configuration according to requirements

2. Execute the deployment script

3. Configure LLM API keys in the One-API backend

4. Configure APISIX address in Shenma plugin

About

Uh oh!

Releases

Packages

Languages

weiz3630/zgsm-backend-deploy

Folders and files

Latest commit

History

Repository files navigation

Shenma Deployment Tool (for docker-compose)

Introduction

Overall Architecture

Gateway Layer

Service Layer

Storage Layer

Operation & Maintenance Center

Deployment Steps

0. Prerequisites

Using self-deployed model instances

Using third-party API services or self-deploying model instances

1. Modify configuration according to requirements

2. Execute the deployment script

3. Configure LLM API keys in the One-API backend

4. Configure APISIX address in Shenma plugin

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages