Skip to content

Commit 600cebc

Browse files
committed
server : update readme [no ci]
1 parent 2a5510e commit 600cebc

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

examples/server/README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -763,7 +763,7 @@ curl http://localhost:8080/v1/chat/completions \
763763

764764
### POST `/v1/embeddings`: OpenAI-compatible embeddings API
765765

766-
This endpoint requires that the model uses a pooling different than type `none`.
766+
This endpoint requires that the model uses a pooling different than type `none`. The embeddings are normalized using the Eucledian norm.
767767

768768
*Options:*
769769

@@ -799,9 +799,9 @@ See [OpenAI Embeddings API documentation](https://platform.openai.com/docs/api-r
799799

800800
### POST `/embeddings`: non-OpenAI-compatible embeddings API
801801

802-
This endpoint supports `--pooling none`. When used, the responses will contain the embeddings for all input tokens.
803-
Note that the response format is slightly different than `/v1/embeddings` - it does not have the `"data"` sub-tree and the
804-
embeddings are always returned as vector of vectors.
802+
This endpoint supports all poolings, including `--pooling none`. When the pooling is `none`, the responses will contain the *unnormalized* embeddings for *all* input tokens. For all other pooling types, only the pooled embeddings are returned, normalized using Euclidian norm.
803+
804+
Note that the response format of this endpoint is different from `/v1/embeddings`.
805805

806806
*Options:*
807807

0 commit comments

Comments
 (0)