Skip to content

Commit 26eb207

Browse files
Add more comments on embedding deployment (#4929)
1 parent ca7cf03 commit 26eb207

File tree

2 files changed

+3
-1
lines changed

2 files changed

+3
-1
lines changed

examples/deploy/embedding/client.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ def run_client(host: str = '127.0.0.1', port: int = 8000):
4949
from swift.llm import run_deploy, DeployArguments
5050
with run_deploy(
5151
DeployArguments(
52-
model='Qwen/Qwen3-Embedding-0.6B',
52+
model='Qwen/Qwen3-Embedding-0.6B', # GME/GTE models or your checkpoints are also supported
5353
task_type='embedding',
5454
infer_backend='vllm',
5555
verbose=False,

examples/deploy/embedding/server.sh

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,5 @@
1+
# GME/GTE models or your checkpoints are also supported
2+
# pt/vllm/sglang supported
13
CUDA_VISIBLE_DEVICES=0 swift deploy \
24
--host 0.0.0.0 \
35
--port 8000 \

0 commit comments

Comments
 (0)