Skip to content

Commit 6b3199a

Browse files
committed
chore: update vllm and xinference version (#2278)
(cherry picked from commit 327db27)
1 parent b1bcf16 commit 6b3199a

File tree

12 files changed

+66
-16
lines changed

12 files changed

+66
-16
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ KubeBlocks add-ons.
1717
| hive | hive-3.1.2<br>hive-3.1.3<br>hive-4.0.1 | A Helm chart for Kubernetes | |
1818
| influxdb | influxdb-2.7.11<br>influxdb-data-1.8.10<br>influxdb-meta-1.8.10 | InfluxDB is an open source time-series database. It is a core component of the TICK (Telegraf, InfluxDB(TM), Chronograf, Kapacitor) stack. | |
1919
| kafka | kafka-broker-2.7.0<br>kafka-broker-2.8.2<br>kafka-broker-3.3.2<br>kafka-broker-3.7.1<br>kafka-broker-3.8.1<br>kafka-broker-3.9.0<br>kafka-combine-3.3.2<br>kafka-combine-3.7.1<br>kafka-combine-3.8.1<br>kafka-combine-3.9.0<br>kafka-controller-3.3.2<br>kafka-controller-3.7.1<br>kafka-controller-3.8.1<br>kafka-controller-3.9.0<br>kafka-exporter-1.6.0 | Apache Kafka is a distributed streaming platform designed to build real-time pipelines and can be used as a message broker or as a replacement for a log aggregation solution for big data applications. | caiq1nyu vipshop |
20-
| llm | ggml-0.1.0<br>llm-0.2.7 | Large language models. | ApeCloud |
20+
| llm | llm-ggml-20241115-latest<br>llm-vllm-0.11.2 | Large language models. | ApeCloud |
2121
| loki | loki-1.0.0 | Loki is a horizontally-scalable, highly-available, multi-tenant log aggregation system inspired by Prometheus. It is designed to be very cost effective and easy to operate. | Chen-speculation |
2222
| mariadb | mariadb-10.6.15 | MariaDB is a high performance open source relational database management system that is widely used for web and application servers | yinmin |
2323
| milvus | milvus-2.5.13<br>milvus-v2.3.2 | A cloud-native vector database, storage for next generation AI applications. | leon-inf |
@@ -46,6 +46,6 @@ KubeBlocks add-ons.
4646
| vanilla-postgresql | supabase-15.6.1-138<br>anilla-postgresql-12.15.0<br>anilla-postgresql-14.7.0<br>anilla-postgresql-15.7.0 | Vanilla-PostgreSQL is compatible with the native PostgreSQL kernel, enabling it to quickly provide HA solutions for various variants based on the native PostgreSQL kernel. | kizuna-lek |
4747
| victoria-metrics | ictoria-metrics-1.0.0 | VictoriaMetrics is a fast, cost-effective and scalable monitoring solution and time series database. | sophon-zt ButterBright |
4848
| weaviate | weaviate-1.19.6 | Weaviate is an open-source vector database. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects. | iziang |
49-
| xinference | xinference-0.11.0<br>xinference-0.11.0-cpu | Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. | ApeCloud |
49+
| xinference | xinference-0.11.0<br>xinference-0.11.0-cpu<br>xinference-1.13.0<br>xinference-1.13.0-cpu | Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. | ApeCloud |
5050
| yashandb | yashandb-23.1.1-100 | YashanDB is a new database system completely independently designed and developed by SICS. Based on classical database theories, it incorporates original Bounded Evaluation theory, Approximation theory, Parallel Scalability theory and Cross-Modal Fusion Computation theory, supports multiple deployment methods such as stand-alone/primary-standby, shared cluster, and distributed ones, covers OLTP/HTAP/OLAP transactions and analyzes mixed load scenarios, and is fully compatible with privatization and cloud infrastructure, providing clients with one-stop enterprise-level converged data management solutions to meet the needs of key industries such as finance, government, telecommunications and energy for high performance, concurrency and security. | JesseAtSZ shanshanying |
5151
| zookeeper | zookeeper-3.4.14<br>zookeeper-3.6.4<br>zookeeper-3.7.2<br>zookeeper-3.8.4<br>zookeeper-3.9.2<br>zookeeper-3.9.4 | Apache ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. | ApeCloud kissycn |

addons-cluster/xinference/templates/cluster.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ spec:
2626
cpu: {{ .Values.cpu | quote }}
2727
memory: {{ print .Values.memory "Gi" | quote }}
2828
{{ if .Values.cpuMode }}
29-
serviceVersion: 0.11.0-cpu
29+
serviceVersion: 1.13.0-cpu
3030
{{ else }}
31-
serviceVersion: 0.11.0
31+
serviceVersion: 1.13.0
3232
{{ end }}

addons/llm/templates/cmpd-ggml.yaml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,11 +5,11 @@ metadata:
55
labels:
66
{{- include "llm.labels" . | nindent 4}}
77
annotations:
8+
apps.kubeblocks.io/skip-immutable-check: "true"
89
{{- include "llm.annotations" . | nindent 4 }}
910
spec:
1011
provider: kubeblocks
1112
serviceKind: ggml
12-
serviceVersion: 0.1.0
1313
updateStrategy: BestEffortParallel
1414
scripts:
1515
- name: scripts
@@ -30,7 +30,6 @@ spec:
3030
emptyDir: {}
3131
containers:
3232
- name: ggml
33-
image: {{ .Values.imageDev.registry | default ( .Values.image.registry | default "docker.io" ) }}/{{ .Values.imageDev.repository}}:{{ default .Chart.AppVersion .Values.imageDev.tag}}
3433
imagePullPolicy: {{default .Values.imageDev.pullPolicy "IfNotPresent"}}
3534
securityContext:
3635
runAsUser: 0

addons/llm/templates/cmpd-vllm.yaml

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,11 +5,11 @@ metadata:
55
labels:
66
{{- include "llm.labels" . | nindent 4}}
77
annotations:
8+
apps.kubeblocks.io/skip-immutable-check: "true"
89
{{- include "llm.annotations" . | nindent 4 }}
910
spec:
1011
provider: kubeblocks
1112
serviceKind: vllm
12-
serviceVersion: 0.2.7
1313
updateStrategy: BestEffortParallel
1414
scripts:
1515
- name: scripts
@@ -42,7 +42,6 @@ spec:
4242
emptyDir: {}
4343
containers:
4444
- name: vllm
45-
image: {{ .Values.image.registry | default "docker.io" }}/{{ .Values.image.repository}}:{{ default .Chart.AppVersion .Values.image.tag }}
4645
imagePullPolicy: {{ default "IfNotPresent" .Values.image.pullPolicy }}
4746
securityContext:
4847
runAsUser: 0
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
apiVersion: apps.kubeblocks.io/v1
2+
kind: ComponentVersion
3+
metadata:
4+
name: llm-ggml
5+
labels:
6+
{{- include "llm.labels" . | nindent 4 }}
7+
annotations:
8+
{{- include "llm.annotations" . | nindent 4 }}
9+
spec:
10+
compatibilityRules:
11+
- compDefs:
12+
- {{ include "llm.cmpdNameGGML" . }}
13+
releases:
14+
- 20241115-latest
15+
releases:
16+
- name: 20241115-latest
17+
serviceVersion: 20241115-latest
18+
images:
19+
ggml: {{ .Values.imageDev.registry | default ( .Values.image.registry | default "docker.io" ) }}/{{ .Values.imageDev.repository }}:20241115-latest
20+
Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,20 @@
1+
apiVersion: apps.kubeblocks.io/v1
2+
kind: ComponentVersion
3+
metadata:
4+
name: llm-vllm
5+
labels:
6+
{{- include "llm.labels" . | nindent 4 }}
7+
annotations:
8+
{{- include "llm.annotations" . | nindent 4 }}
9+
spec:
10+
compatibilityRules:
11+
- compDefs:
12+
- {{ include "llm.cmpdNameVLLM" . }}
13+
releases:
14+
- 0.11.2
15+
releases:
16+
- name: 0.11.2
17+
serviceVersion: 0.11.2
18+
images:
19+
vllm: {{ .Values.image.registry | default "docker.io" }}/{{ .Values.image.repository }}:v0.11.2
20+

addons/llm/templates/scripts.yaml

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,16 +19,19 @@ data:
1919
if [[ "${domain,,}" == "baichuan-inc" ]]; then
2020
pip install transformers==4.33.1 tokenizers==0.13.3
2121
fi
22+
apt-get update
23+
apt install git-lfs
24+
CLONE_MODEL_SCRIPT="git lfs install; git clone https://huggingface.co/${MODEL_NAME}"
2225
country=`curl https://ifconfig.io/country_code`
2326
if [ "$country" == "CN" ]; then
2427
CLONE_MODEL_SCRIPT="git lfs install; git clone https://www.modelscope.cn/${MODEL_NAME}.git"
25-
export MODEL_NAME="../${MODEL_NAME##*/}"
28+
export MODEL_NAME="./${MODEL_NAME##*/}"
2629
else
2730
curl --max-time 10 https://huggingface.co/${MODEL_NAME} > /dev/null 2>&1
2831
code=$?
2932
if [ "$code" -ne 0 ]; then
3033
CLONE_MODEL_SCRIPT="git lfs install; git clone https://www.modelscope.cn/${MODEL_NAME}.git"
31-
export MODEL_NAME="../${MODEL_NAME##*/}"
34+
export MODEL_NAME="./${MODEL_NAME##*/}"
3235
fi
3336
fi
3437
echo "model=${MODEL_NAME}"

addons/llm/values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ image:
33
pullPolicy: IfNotPresent
44
registry: docker.io
55
# Overrides the image tag whose default is the chart appVersion.
6-
tag: v0.7.2
6+
tag: v0.11.2
77
imageDev:
88
# refer: ghcr.io/abetlen/llama-cpp-python:latest
99
repository: apecloud/llama-cpp-python

addons/xinference/Chart.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ version: 1.0.1
2121
# incremented each time you make changes to the application. Versions are not expected to
2222
# follow Semantic Versioning. They should reflect the version the application is using.
2323
# It is recommended to use it with quotes.
24-
appVersion: "0.11.0"
24+
appVersion: "1.13.0"
2525

2626
maintainers:
2727
- name: ApeCloud

addons/xinference/templates/cmpd.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,6 @@ spec:
1010
provider: kubeblocks
1111
description: Xinference is a powerful and versatile library designed to serve language, speech recognition, and multimodal models.
1212
serviceKind: xinference
13-
serviceVersion: 0.11.0
1413
updateStrategy: BestEffortParallel
1514
scripts:
1615
- name: scripts

0 commit comments

Comments
 (0)