fix: deserialize tool call args #4176

ryan-lempka · 2025-11-07T05:00:14Z

Overview

Parse tool call arguments JSON strings into structured JSON before Jinja rendering to fix double-encoding and enable iteration.

Details

Normalize:
- messages[*].tool_calls[*].function.arguments
- messages[*].function_call.arguments
Normalization isolated from messages() and only applied immediately before Jinja rendering

Where to start

oai.rs: normalize_tool_arguments_in_messages()

Tests

|tojson no longer double-encodes.
|items iteration works.
Legacy path parsed.
Malformed JSON passes through unchanged.

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Fixes GitHub issue: #4161

Summary by CodeRabbit

Improvements
- Improved consistency and reliability in processing AI tool function arguments within chat completion requests. Enhanced handling ensures proper normalization of tool arguments across both current and legacy formats, leading to more predictable behavior.
Tests
- Added comprehensive test coverage for tool argument normalization across multiple scenarios, including edge cases.

coderabbitai · 2025-11-07T05:03:07Z

Walkthrough

Adds a new helper function normalize_tool_arguments_in_messages to deserialize tool argument strings into JSON objects within message processing. The function is integrated into NvCreateChatCompletionRequest::messages and includes comprehensive tests for normalization behavior, legacy function calls, and edge cases.

Changes

Cohort / File(s)	Summary
Tool argument normalization `lib/llm/src/preprocessor/prompt/template/oai.rs`	New helper function `normalize_tool_arguments_in_messages` that traverses messages and deserializes tool argument strings into JSON objects/arrays for both `tool_calls.function.arguments` and `function_call.arguments`. Integrated into request message processing with test coverage for deserialization, iteration, legacy handling, and malformed JSON passthrough.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Deserialize logic correctness: Verify string-to-JSON conversion handles all supported formats and edge cases
Error handling: Confirm malformed JSON is handled gracefully per design
Integration point: Review how normalization fits into existing message processing pipeline
Test coverage: Validate test cases cover tool_calls, function_call, and error scenarios

Poem

🐰 Hops through messages with delight,
Arguments once tangled, now shining bright!
From strings to JSON, a transformation so neat,
Tool calls unwrapped—the preprocessing's complete! ✨

Pre-merge checks

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 66.67% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title 'fix: deserialize tool call args' clearly and concisely describes the main change - deserializing tool call arguments from JSON strings.
Description check	✅ Passed	PR description covers all required sections: overview clearly states the purpose, details specify exact paths being modified, starting points identified, tests outlined, and related issue linked with action keyword.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

lib/llm/src/preprocessor/prompt/template/oai.rs (2)
124-156: LGTM! Clean implementation with solid error handling.

The function correctly normalizes JSON string arguments into parsed objects for both current (tool_calls) and legacy (function_call) formats. The graceful handling of malformed JSON (leaving it unchanged) aligns with the PR objectives.

Minor style suggestion: The Result::Ok pattern can be simplified to just Ok:
-                        if let Result::Ok(parsed) = serde_json::from_str(s) {
+                        if let Ok(parsed) = serde_json::from_str(s) {
                             *args = parsed;
                         }
Apply the same simplification at line 149.

740-847: Good test coverage for the primary scenarios.

The tests validate the key behaviors: prevention of double-encoding, support for iteration, legacy format handling, and malformed JSON passthrough. The order-insensitive assertion at line 804 is a nice touch.

Consider adding a few edge-case tests to strengthen coverage:

Arguments already parsed (not strings): Verify that arguments already in object form are left unchanged.

Multiple tool_calls per message: Ensure all tool calls in a single message are normalized.

Array-type arguments: Test with "arguments": "[1,2,3]" to confirm array deserialization works.

Example test for case 1:
#[test]
fn test_normalize_tool_arguments_already_object() {
    let mut messages = serde_json::Value::Array(vec![serde_json::json!({
        "role": "assistant",
        "tool_calls": [{
            "type": "function",
            "function": {
                "name": "f",
                "arguments": {"key": "value"}  // Already an object
            }
        }]
    })]);
    
    normalize_tool_arguments_in_messages(&mut messages);
    
    // Should remain unchanged
    assert_eq!(
        messages[0]["tool_calls"][0]["function"]["arguments"],
        serde_json::json!({"key": "value"})
    );
}

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f509493 and eaadff1.

📒 Files selected for processing (1)

lib/llm/src/preprocessor/prompt/template/oai.rs (2 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

lib/llm/src/preprocessor/prompt/template/oai.rs (2)

lib/llm/src/preprocessor/prompt.rs (2)

messages (51-51)

model (50-50)

lib/llm/src/preprocessor/prompt/template/tokcfg.rs (1)

tojson (114-146)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (12)

GitHub Check: sglang (amd64)
GitHub Check: operator (amd64)
GitHub Check: vllm (arm64)
GitHub Check: vllm (amd64)
GitHub Check: Build and Test - dynamo
GitHub Check: tests (launch/dynamo-run)
GitHub Check: clippy (lib/bindings/python)
GitHub Check: tests (lib/runtime/examples)
GitHub Check: tests (lib/bindings/python)
GitHub Check: tests (.)
GitHub Check: clippy (launch/dynamo-run)
GitHub Check: clippy (.)

🔇 Additional comments (1)

lib/llm/src/preprocessor/prompt/template/oai.rs (1)

163-180: Integration looks correct.

The normalization is properly positioned: after serialization and before template rendering. The unconditional application matches the PR objectives, and the order of operations with may_be_fix_msg_content is appropriate since they operate on different message fields.

rmccorm4 · 2025-11-07T05:29:50Z

CC @2ez4bz

lib/llm/src/preprocessor/prompt/template/oai.rs

rmccorm4 · 2025-11-07T20:05:49Z

@ryan-lempka thanks for fixing this! based on the source issue:

looks like this PR addresses the escaped json in bullet point 1
does this PR also address the dictionary arguments case in bullet point 2 below? Seems like it might from test_normalize_tool_arguments_items_loop ?

E.g. this line in Qwen3 Coder's template:
{%- for args_name, args_value in tool_call.arguments|items %}.

For such templates, the dynamo frontend will return a 500 error code as it fails to render the template entirely.

ryan-lempka · 2025-11-07T20:10:26Z

@ryan-lempka thanks for fixing this! based on the source issue:

looks like this PR addresses the escaped json in bullet point 1

does this PR also address the dictionary arguments case in bullet point 2 below? Seems like it might from test_normalize_tool_arguments_items_loop ?

E.g. this line in Qwen3 Coder's template:
{%- for args_name, args_value in tool_call.arguments|items %}.
For such templates, the dynamo frontend will return a 500 error code as it fails to render the template entirely.

@rmccorm4 yes, this PR aims to fix everything outlined in #4161. Everything is validated with unit tests. The example you give there is the unit test: test_normalize_tool_arguments_items_loop

rmccorm4 · 2025-11-07T20:23:13Z

Thanks @ryan-lempka , think it just needs a rebase with merge conflicts resolved from your other merged add_generation_prompt PR, and a quick additional input case mentioned by @indrajit96 - nice work!

2ez4bz

LGTM! Not sure if my approval does anything, but doing it anyway :)

rlempka · 2025-11-07T21:13:04Z

Thanks @ryan-lempka , think it just needs a rebase with merge conflicts resolved from your other merged add_generation_prompt PR, and a quick additional input case mentioned by @indrajit96 - nice work!

@rmccorm4 rebase done and @indrajit96 test request added

Signed-off-by: Ryan Lempka <[email protected]>

rmccorm4 · 2025-11-08T01:27:17Z

Pulled in #4198

ryan-lempka requested review from ayushag-nv, elyasmnvidian and rmccorm4 November 7, 2025 05:00

ryan-lempka self-assigned this Nov 7, 2025

ryan-lempka requested a review from a team as a code owner November 7, 2025 05:00

pull-request-size bot added the size/L label Nov 7, 2025

github-actions bot added the fix label Nov 7, 2025

coderabbitai bot reviewed Nov 7, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 05:17 Inactive

2ez4bz reviewed Nov 7, 2025

View reviewed changes

lib/llm/src/preprocessor/prompt/template/oai.rs Outdated Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:14 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:15 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:25 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:26 Inactive

ryan-lempka force-pushed the rlempka/fix-deserialize-tool-call-args branch from 3b2196a to 10fedd2 Compare November 7, 2025 16:30

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:30 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 16:31 Inactive

ayushag-nv approved these changes Nov 7, 2025

View reviewed changes

indrajit96 self-requested a review November 7, 2025 19:24

indrajit96 reviewed Nov 7, 2025

View reviewed changes

lib/llm/src/preprocessor/prompt/template/oai.rs Show resolved Hide resolved

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 20:08 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 20:09 Inactive

rmccorm4 added the frontend `python -m dynamo.frontend` and `dynamo-run in=http|text|grpc` label Nov 7, 2025

2ez4bz approved these changes Nov 7, 2025

View reviewed changes

ryan-lempka force-pushed the rlempka/fix-deserialize-tool-call-args branch from f57788c to fe8e5a8 Compare November 7, 2025 21:11

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 21:11 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 21:12 Inactive

rmccorm4 approved these changes Nov 7, 2025

View reviewed changes

rmccorm4 enabled auto-merge (squash) November 7, 2025 21:27

ryan-lempka added 6 commits November 7, 2025 22:13

fix: deserialize tool call args

aaea189

Signed-off-by: Ryan Lempka <[email protected]>

style: minor style improvements

b0a3cb2

fix: only normalize tool args for render

5e30d46

style: DRY principle for test import

4cf067c

Signed-off-by: Ryan Lempka <[email protected]>

chore: add test with multimodal content

214e79e

chore: rebase

40bd379

ryan-lempka force-pushed the rlempka/fix-deserialize-tool-call-args branch from fe8e5a8 to 40bd379 Compare November 7, 2025 22:13

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 22:13 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 7, 2025 22:14 Inactive

rmccorm4 mentioned this pull request Nov 8, 2025

ci: Skip broken etcd_ha tests until fixed to unblock unrelated PRs #4198

Merged

Merge branch 'main' into rlempka/fix-deserialize-tool-call-args

bced069

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 01:27 Inactive

copy-pr-bot bot temporarily deployed to GITLAB November 8, 2025 01:37 Inactive

rmccorm4 merged commit 51c4fe6 into main Nov 8, 2025
34 of 36 checks passed

rmccorm4 deleted the rlempka/fix-deserialize-tool-call-args branch November 8, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: deserialize tool call args #4176

fix: deserialize tool call args #4176

Uh oh!

ryan-lempka commented Nov 7, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented Nov 7, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

Uh oh!

Uh oh!

rmccorm4 commented Nov 7, 2025 •

edited

Loading

Uh oh!

ryan-lempka commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

2ez4bz left a comment

Uh oh!

rlempka commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

fix: deserialize tool call args #4176

fix: deserialize tool call args #4176

Uh oh!

Conversation

ryan-lempka commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Details

Where to start

Tests

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

Uh oh!

Uh oh!

rmccorm4 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryan-lempka commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 7, 2025

Uh oh!

2ez4bz left a comment

Choose a reason for hiding this comment

Uh oh!

rlempka commented Nov 7, 2025

Uh oh!

rmccorm4 commented Nov 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

ryan-lempka commented Nov 7, 2025 •

edited

Loading

coderabbitai bot commented Nov 7, 2025 •

edited

Loading

rmccorm4 commented Nov 7, 2025 •

edited

Loading