Skip to content

Commit e6307ed

Browse files
SkylarKeltyclaude
andcommitted
feat: send include_reasoning: false to suppress think tags at source
Backends like sglang and vllm honour this flag and won't emit <think> blocks in the response content. We still strip them client-side as a fallback for backends that ignore the flag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
1 parent fab2172 commit e6307ed

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

artemis/llm.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -327,6 +327,9 @@ async def chat_completion(
327327
body["tool_choice"] = "none"
328328
if response_format is not None:
329329
body["response_format"] = response_format
330+
# Ask the backend to suppress <think> reasoning blocks if supported
331+
# (works with sglang, vllm, and other OpenAI-compatible backends)
332+
body["include_reasoning"] = False
330333

331334
try:
332335
response = await client.post(

0 commit comments

Comments
 (0)