Skip to content

Commit dc4c795

Browse files
committed
update readme
1 parent 9706afc commit dc4c795

File tree

2 files changed

+25
-20
lines changed

2 files changed

+25
-20
lines changed

README.md

Lines changed: 19 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -293,26 +293,27 @@ Chat UI界面:
293293

294294
以下模型经过测试可放心使用:
295295

296-
| Models / BackEnd | sentence_transformers | infinity | vllm|
297-
| ---------------------------------------------------- | --- | -------- |-------- |
298-
| bge-m3 |||× |
299-
| bge-vl || × |× |
300-
| jina-reranker-m0 || × |× |
296+
| Models / BackEnd | sentence_transformers | infinity | vllm|
297+
| ----------------------------------------------------------------------------------- | --------------- | -------------- |----------- |
298+
| bge-m3 ||||
299+
| bge-embedding ||||
300+
| bce-embedding ||||
301+
| puff ||||
302+
| piccolo-base-zh-embedding ||||
303+
| acge_text_embedding ||||
304+
| Yinka ||||
305+
| zpoint_large_embedding_zh ||||
306+
| xiaobu-embedding ||||
307+
| Conan-embedding-v1 ||||
308+
| qwen3-embedding ||||
309+
| ritrieve_zh_v1 ||||
310+
| jina-embeddings-v3 ||||
311+
| KoalaAI/Text-Moderation(文本审核/多分类,审核文本是否存在暴力、色情等) | × ||× |
312+
| protectai/deberta-v3-base-prompt-injection-v2(提示注入/2分类,审核文本为提示注入) | × ||× |
313+
| bge-vl || × |× |
314+
| jina-reranker-m0 || × |× |
301315
| bge-reranker |||× |
302-
| bge-embedding |||× |
303316
| bce-reranker |||× |
304-
| bce-embedding |||× |
305-
| puff |||× |
306-
| piccolo-base-zh-embedding |||× |
307-
| acge_text_embedding |||× |
308-
| Yinka |||× |
309-
| zpoint_large_embedding_zh |||× |
310-
| xiaobu-embedding |||× |
311-
| Conan-embedding-v1 |||× |
312-
| qwen3-embedding ||||
313-
| ritrieve_zh_v1 |||× |
314-
| KoalaAI/Text-Moderation(文本审核/多分类,审核文本是否存在暴力、色情等) | × ||× |
315-
| protectai/deberta-v3-base-prompt-injection-v2(提示注入/2分类,审核文本为提示注入) | × ||× |
316317

317318
目前 **ritrieve_zh_v1** C-MTEB榜单排行第一(MTEB: https://huggingface.co/spaces/mteb/leaderboard)
318319

gpt_server/model_worker/embedding_vllm.py

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,8 +7,7 @@
77
from infinity_emb.inference.select_model import get_engine_type_from_config
88
from gpt_server.model_worker.base.model_worker_base import ModelWorkerBase
99
from gpt_server.model_worker.utils import get_embedding_mode
10-
import torch
11-
import vllm
10+
import numpy as np
1211
from vllm import LLM
1312

1413
label_to_category = {
@@ -71,6 +70,11 @@ async def get_embeddings(self, params):
7170
# ----------
7271
outputs = self.engine.embed(texts)
7372
embedding = [o.outputs.embedding for o in outputs]
73+
embeddings_np = np.array(embedding)
74+
# ------ L2归一化(沿axis=1,即对每一行进行归一化)-------
75+
norm = np.linalg.norm(embeddings_np, ord=2, axis=1, keepdims=True)
76+
normalized_embeddings_np = embeddings_np / norm
77+
embedding = normalized_embeddings_np.tolist()
7478

7579
ret["embedding"] = embedding
7680
return ret

0 commit comments

Comments
 (0)