Skip to content

Commit c643de8

Browse files
ngxsonggerganov
andauthored
Update examples/server/tests/features/embeddings.feature
Co-authored-by: Georgi Gerganov <[email protected]>
1 parent 3a64932 commit c643de8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/server/tests/features/embeddings.feature

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ Feature: llama.cpp server
1010
And 42 as server seed
1111
And 2 slots
1212
# the bert-bge-small model has context size of 512
13-
# since the generated prompts are as big as the batch size, we need to set the batch size to 512
13+
# since the generated prompts are as big as the batch size, we need to set the batch size to <= 512
1414
# ref: https://huggingface.co/BAAI/bge-small-en-v1.5/blob/5c38ec7c405ec4b44b94cc5a9bb96e735b38267a/config.json#L20
1515
And 128 as batch size
1616
And 128 as ubatch size

0 commit comments

Comments
 (0)