You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -118,15 +118,16 @@ For more details see the <a href="https://docs.vllm.ai/en/stable/getting_started
118
118
-`min-tool-call-array-param-length`: the minimum possible length of array parameters in a tool call, optional, defaults to 1
119
119
-`tool-call-not-required-param-probability`: the probability to add a parameter, that is not required, in a tool call, optional, defaults to 50
120
120
-`object-tool-call-not-required-field-probability`: the probability to add a field, that is not required, in an object in a tool call, optional, defaults to 50
121
+
<!--
121
122
- `enable-kvcache`: if true, the KV cache support will be enabled in the simulator. In this case, the KV cache will be simulated, and ZQM events will be published when a KV cache block is added or evicted.
122
123
- `kv-cache-size`: the maximum number of token blocks in kv cache
123
124
- `block-size`: token block size for contiguous chunks of tokens, possible values: 8,16,32,64,128
124
125
- `tokenizers-cache-dir`: the directory for caching tokenizers
125
126
- `hash-seed`: seed for hash generation (if not set, is read from PYTHONHASHSEED environment variable)
126
127
- `zmq-endpoint`: ZMQ address to publish events
128
+
- `event-batch-size`: the maximum number of kv-cache events to be sent together, defaults to 16
127
129
- `failure-injection-rate`: probability (0-100) of injecting failures, optional, default is 0
128
130
- `failure-types`: list of specific failure types to inject (rate_limit, invalid_api_key, context_length, server_error, invalid_request, model_not_found), optional, if empty all types are used
129
-
-`event-batch-size`: the maximum number of kv-cache events to be sent together, defaults to 16
130
131
-->
131
132
In addition, as we are using klog, the following parameters are available:
132
133
-`add_dir_header`: if true, adds the file directory to the header of the log messages
0 commit comments