We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent bede3df commit a9653c8Copy full SHA for a9653c8
articles/gpt-oss/run-vllm.md
@@ -200,7 +200,7 @@ convo = Conversation.from_messages(
200
prefill_ids = encoding.render_conversation_for_completion(convo, Role.ASSISTANT)
201
202
# Harmony stop tokens (pass to sampler so they won't be included in output)
203
-stop_token_ids = encoding.stop_tokens_for_assistant_action()
+stop_token_ids = encoding.stop_tokens_for_assistant_actions()
204
205
# --- 2) Run vLLM with prefill ---
206
llm = LLM(
0 commit comments