Skip to content

Commit 772ecdc

Browse files
mayabarsmarunich
authored andcommitted
Use same version of tokenizer in both Dockerfile and Makefile (#132)
* - Use same version of tokenizer in both Dockerfile and Makefile - Fixes in readme file Signed-off-by: Maya Barnea <[email protected]> * updates according PR's review Signed-off-by: Maya Barnea <[email protected]> --------- Signed-off-by: Maya Barnea <[email protected]> Signed-off-by: Sergey Marunich <[email protected]>
1 parent e2a7c97 commit 772ecdc

File tree

2 files changed

+8
-4
lines changed

2 files changed

+8
-4
lines changed

Makefile

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -39,14 +39,16 @@ help: ## Print help
3939
LDFLAGS ?= -extldflags '-L$(shell pwd)/lib'
4040
CGO_ENABLED=1
4141
TOKENIZER_LIB = lib/libtokenizers.a
42+
# Extract TOKENIZER_VERSION from Dockerfile
43+
TOKENIZER_VERSION := $(shell grep '^ARG TOKENIZER_VERSION=' Dockerfile | cut -d'=' -f2)
4244

4345
.PHONY: download-tokenizer
4446
download-tokenizer: $(TOKENIZER_LIB)
4547
$(TOKENIZER_LIB):
4648
## Download the HuggingFace tokenizer bindings.
47-
@echo "Downloading HuggingFace tokenizer bindings..."
49+
@echo "Downloading HuggingFace tokenizer bindings for version $(TOKENIZER_VERSION)..."
4850
mkdir -p lib
49-
curl -L https://github.com/daulet/tokenizers/releases/download/v1.22.1/libtokenizers.$(TARGETOS)-$(TARGETARCH).tar.gz | tar -xz -C lib
51+
curl -L https://github.com/daulet/tokenizers/releases/download/$(TOKENIZER_VERSION)/libtokenizers.$(TARGETOS)-$(TARGETARCH).tar.gz | tar -xz -C lib
5052
ranlib lib/*.a
5153

5254
##@ Development

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -116,13 +116,15 @@ For more details see the <a href="https://docs.vllm.ai/en/stable/getting_started
116116
- `min-tool-call-array-param-length`: the minimum possible length of array parameters in a tool call, optional, defaults to 1
117117
- `tool-call-not-required-param-probability`: the probability to add a parameter, that is not required, in a tool call, optional, defaults to 50
118118
- `object-tool-call-not-required-field-probability`: the probability to add a field, that is not required, in an object in a tool call, optional, defaults to 50
119-
- `enable-kvcache`: if true, the KV cache support will be enabled in the simulator. In this case, the KV cache will be simulated, and ZQM events will be published when a KV cache block is added or evicted.
119+
<!--
120+
- `enable-kvcache`: if true, the KV cache support will be enabled in the simulator. In this case, the KV cache will be simulated, and ZQM events will be published when a KV cache block is added or evicted.
120121
- `kv-cache-size`: the maximum number of token blocks in kv cache
121122
- `block-size`: token block size for contiguous chunks of tokens, possible values: 8,16,32,64,128
122123
- `tokenizers-cache-dir`: the directory for caching tokenizers
123124
- `hash-seed`: seed for hash generation (if not set, is read from PYTHONHASHSEED environment variable)
124125
- `zmq-endpoint`: ZMQ address to publish events
125-
126+
- `event-batch-size`: the maximum number of kv-cache events to be sent together, defaults to 16
127+
-->
126128
In addition, as we are using klog, the following parameters are available:
127129
- `add_dir_header`: if true, adds the file directory to the header of the log messages
128130
- `alsologtostderr`: log to standard error as well as files (no effect when -logtostderr=true)

0 commit comments

Comments
 (0)