fix: add security_risk and summary to tool examples for non-native function calling by ixchio · Pull Request #2251 · OpenHands/software-agent-sdk

ixchio · 2026-03-01T14:05:34Z

Summary

This PR fixes issue #2175 where models using prompt-based function calling (native_tool_calling=False) would fail to provide the security_risk parameter because the examples in the system prompt didn't demonstrate it.

Problem

When using weaker models (like qwen2.5-coder:7b) that don't support native function calling, the SDK falls back to prompt-based function calling. This injects tool definitions and examples directly into the system prompt. However, these examples were missing the security_risk and summary parameters, causing validation errors:

Error validating args {"command": "pwd && ls"} for tool 'terminal': 
Failed to provide security_risk field in tool 'terminal'

The model didn't know to include these fields because the examples didn't show them.

Solution

Added security_risk and summary parameters to all tool examples in:

openhands-sdk/openhands/sdk/llm/mixins/fn_call_converter.py (TOOL_EXAMPLES dict)
openhands-sdk/openhands/sdk/agent/prompts/in_context_learning_example.j2 (template)

Now the examples teach models to include these parameters:

<function=terminal>
<parameter=command>pwd && ls</parameter>
<parameter=security_risk>LOW</parameter>
<parameter=summary>Check current directory and list files</parameter>
</function>

Details

security_risk is required when LLMSecurityAnalyzer is configured
summary is optional but recommended for action transparency
Both parameters are always added to tool schemas via add_security_risk_prediction=True

Testing

All existing tests pass (27 fn_call_converter tests, 21 security_risk extraction tests)
Pre-commit hooks pass

Fixes #2175

ixchio · 2026-03-01T14:10:59Z

Note on Failing API Breakage Checks

The failing API breakage checks are not related to this PR's changes.

The failure is caused by PR #2243 (fix: override server_image default to None in DockerDevWorkspace) which was intentionally merged to main on Feb 28. This PR changed DockerDevWorkspace.server_image default from a specific image to None, which Griffe detects as a breaking change.

My changes only modify:

String constants in TOOL_EXAMPLES dictionary (internal prompt examples)
The in_context_learning_example.j2 template (prompt content)

Neither of these affect the public SDK API surface.

For openhands-sdk: "No breaking changes detected" ✓

…nction calling This fixes issue OpenHands#2175 where models using prompt-based function calling (native_tool_calling=False) would fail to provide the security_risk parameter because the examples in the system prompt didn't show it. Changes: - Add security_risk parameter to all tool examples in TOOL_EXAMPLES dict - Add summary parameter to all tool examples for transparency - Update in_context_learning_example.j2 template with the same parameters The security_risk parameter is required when LLMSecurityAnalyzer is active, and the summary parameter is always recommended for action transparency. Fixes: OpenHands#2175 Co-authored-by: openhands <openhands@all-hands.dev>

ixchio · 2026-03-03T05:43:25Z

@all-hands-bot roast it

enyst

Hey, thank you for the contribution! Yes… yes I think missing security risk from prompts was causing models to miss it in responses.

I’m curious though, is security risk always supposed to be present? It’s worth making sure what happens when the security analyzer is disabled.

On another note, about the summary, I wonder if including it affects performance, it might be worth to run a mini-evaluation. Maybe it’s too small and obvious 🤷‍♂️ but I don’t know

enyst · 2026-03-03T06:49:51Z

@OpenHands Do a /codereview-roasted on this PR, after you read all comments, and try to understand deeply how the code works. Post your review directly on github as a comment.

openhands-ai · 2026-03-03T06:50:24Z

I'm on it! enyst can track my progress at all-hands.dev

enyst · 2026-03-03T06:56:45Z

Taste rating: 🟡 Acceptable (but sloppy around the exact bug you’re claiming to fix)

You’re fixing a real problem (prompt-mocked tool calling models missing security_risk), and the general direction is fine. But the change is incomplete in a way that will keep the same class of failure alive for at least one of the default tools.

[CRITICAL ISSUES]

[openhands-sdk/openhands/sdk/llm/mixins/fn_call_converter.py, Lines 351–380] Missing security_risk in task_tracker examples
- TaskTrackerTool is explicitly readOnlyHint=False (see openhands-tools/openhands/tools/task_tracker/definition.py), and make_llm_completion() always exposes security_risk in tool schemas.
- When LLMSecurityAnalyzer is enabled, Agent._extract_security_risk() raises if the field is missing.
- Your examples for task_tracker still don’t show security_risk, so prompt-based FC models will keep “learning” to omit it, and you’ll get the same validation error — just for a different tool.
- Actionable fix: add <parameter=security_risk>…</parameter> to both task_tracker examples (view and plan).
[openhands-sdk/openhands/sdk/agent/prompts/in_context_learning_example.j2, Lines 166–168] finish example is invalid (missing required message)
- FinishAction requires message: str (openhands-sdk/openhands/sdk/tool/builtins/finish.py).
- This template shows <function=finish> without a message parameter, which teaches models the wrong call shape.
- Even if this file isn’t currently wired into the non-native FC path, it’s shipped in the SDK and is now being edited, so it should not contain known-invalid examples.
- Actionable fix: include <parameter=message>…</parameter> in that example (and keep summary if you want).

[IMPROVEMENT OPPORTUNITIES]

[openhands-sdk/openhands/sdk/llm/mixins/fn_call_converter.py, Lines 88–90] Misleading comment about “always added” fields
- summary is always added to the schema.
- security_risk is conditionally added (suppressed for read-only tools via ToolDefinition._get_tool_schema).
- Not a huge deal, but comments that state “always” while the code does “sometimes” rot fast.
Token/perf tradeoff for summary
- You’re now explicitly demonstrating summary everywhere, which will nudge weaker models to always emit it. That’s extra tokens every tool call.
- It’s not a correctness problem (agent will auto-generate if missing), but it is a throughput/cost knob. Worth a quick sanity check on any evals that are sensitive to token budget.

[TESTING GAPS]

No test proves the thing you’re fixing for all default tools.
- If this PR is meant to prevent “missing security_risk” regressions, add a focused test that exercises the non-native FC path with task_tracker present and asserts the model output is accepted when LLMSecurityAnalyzer is enabled.

Verdict: ✅ likely worth merging after the missing task_tracker.security_risk and the invalid finish example are fixed.

Key insight: If you’re going to “teach the model” via examples, leaving one default tool with the old broken example defeats the entire point.

…ples' into fix/pr-2251-review-feedback

… example Addresses review feedback: - Add security_risk parameter to task_tracker 'view' and 'plan' examples (task_tracker has readOnlyHint=False so requires security_risk) - Add required 'message' parameter to finish example in in_context_learning_example.j2 (FinishAction.message is a required field) Co-authored-by: openhands <openhands@all-hands.dev>

ixchio · 2026-03-03T07:13:49Z

Addressed the review feedback:

Fixes

Added security_risk to task_tracker examples (fn_call_converter.py)
- task_tracker has readOnlyHint=False, so it requires security_risk when LLMSecurityAnalyzer is enabled
- Added <parameter=security_risk>LOW</parameter> to both view and plan examples
Added required message parameter to finish example (in_context_learning_example.j2)
- FinishAction.message is a required field
- The example was teaching models the wrong call shape

Note on `security_risk` for read-only tools

To clarify @enyst's question: security_risk is NOT always required. It's only added to tool schemas when:

add_security_risk_prediction=True (default)
AND the tool is NOT read-only (readOnlyHint != True)

finish has readOnlyHint=True, so it doesn't need security_risk in examples. The same applies to other read-only tools like browser_get_state, browser_get_content, etc.

Regarding @enyst's question about summary affecting performance - it's a reasonable concern. However, summary is optional and auto-generated if missing, so models can skip it when they want. The examples just demonstrate the full format.

enyst · 2026-03-08T07:06:21Z

Please don't worry about updating, nor about the checks (fixing those separately, they're only unable to comment at the moment)

xingyaoww

Thank you!

ixchio marked this pull request as ready for review March 1, 2026 14:09

ixchio force-pushed the fix/add-security-risk-to-tool-examples branch from 2b9eb62 to 975265e Compare March 1, 2026 14:13

ixchio mentioned this pull request Mar 1, 2026

CI: API breakage checks failing on all PRs due to DockerDevWorkspace.server_image change #2252

Closed

ixchio added 2 commits March 2, 2026 21:56

Merge branch 'main' into fix/add-security-risk-to-tool-examples

c653bd5

Merge branch 'main' into fix/add-security-risk-to-tool-examples

26b42c0

enyst reviewed Mar 3, 2026

View reviewed changes

This comment was marked as duplicate.

Sign in to view

openhands-agent added 2 commits March 3, 2026 07:11

Merge remote-tracking branch 'fork/fix/add-security-risk-to-tool-exam…

4263811

…ples' into fix/pr-2251-review-feedback

ixchio added 3 commits March 5, 2026 11:53

Merge branch 'main' into fix/add-security-risk-to-tool-examples

f707da6

Merge branch 'main' into fix/add-security-risk-to-tool-examples

e22c88e

Merge branch 'main' into fix/add-security-risk-to-tool-examples

fc05839

ixchio requested a review from enyst March 6, 2026 13:09

enyst requested a review from VascoSch92 March 8, 2026 04:58

enyst mentioned this pull request Mar 8, 2026

refactor(sdk/llm/mixim): separate data from logic #2354

Merged

5 tasks

Merge branch 'main' into fix/add-security-risk-to-tool-examples

438fe15

Merge branch 'main' into fix/add-security-risk-to-tool-examples

a740aa9

xingyaoww approved these changes Mar 10, 2026

View reviewed changes

xingyaoww enabled auto-merge (squash) March 10, 2026 16:03

xingyaoww merged commit b411762 into OpenHands:main Mar 10, 2026
19 of 21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add security_risk and summary to tool examples for non-native function calling#2251

fix: add security_risk and summary to tool examples for non-native function calling#2251
xingyaoww merged 10 commits intoOpenHands:mainfrom
ixchio:fix/add-security-risk-to-tool-examples

ixchio commented Mar 1, 2026

Uh oh!

ixchio commented Mar 1, 2026

Uh oh!

ixchio commented Mar 3, 2026

Uh oh!

enyst left a comment

Uh oh!

enyst commented Mar 3, 2026

Uh oh!

openhands-ai bot commented Mar 3, 2026

Uh oh!

enyst commented Mar 3, 2026

Uh oh!

This comment was marked as duplicate.

ixchio commented Mar 3, 2026

Uh oh!

enyst commented Mar 8, 2026

Uh oh!

xingyaoww left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ixchio commented Mar 1, 2026

Summary

Problem

Solution

Details

Testing

Uh oh!

ixchio commented Mar 1, 2026

Note on Failing API Breakage Checks

Uh oh!

ixchio commented Mar 3, 2026

Uh oh!

enyst left a comment

Choose a reason for hiding this comment

Uh oh!

enyst commented Mar 3, 2026

Uh oh!

openhands-ai bot commented Mar 3, 2026

Uh oh!

enyst commented Mar 3, 2026

[CRITICAL ISSUES]

[IMPROVEMENT OPPORTUNITIES]

[TESTING GAPS]

Uh oh!

This comment was marked as duplicate.

ixchio commented Mar 3, 2026

Fixes

Note on security_risk for read-only tools

Uh oh!

enyst commented Mar 8, 2026

Uh oh!

xingyaoww left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Note on `security_risk` for read-only tools