Skip to content

Json Error in evaluation process #10

@wkfdb

Description

@wkfdb

I was running the evaluation of the LongVT-RFT model and I found that the result is extremely low:
"videomme_w_subtitle_reward_tool": {
"alias": "videomme_w_subtitle_reward_tool",
"videomme_perception_score,none": 19.185185185185187,
"videomme_perception_score_stderr,none": "N/A",
"acc_score,none": 0.18555555555555556,
"acc_score_stderr,none": 0.00748283676164936,
"format_score,none": 0.27518518518518487,
"format_score_stderr,none": 0.008474372792357103
}

By checking the reason I found that the LongVT-RFT server gets the following json error:

Error in extracting tool call from response.
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] Traceback (most recent call last):
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] File "xxxxxxxxx/envs/eval-longvt/lib/python3.11/site-packages/vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py", line 142, in extract_tool_calls
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] raw_function_calls = [
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] ^
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] File "xxxxxxxxx/envs/eval-longvt/lib/python3.11/site-packages/vllm/entrypoints/openai/tool_parsers/hermes_tool_parser.py", line 143, in
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] json.loads(match[0] if match[0] else match[1])
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] File "xxxxxxxxx/envs/eval-longvt/lib/python3.11/json/init.py", line 346, in loads
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] return _default_decoder.decode(s)
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] ^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] File "xxxxxxxxxx/envs/eval-longvt/lib/python3.11/json/decoder.py", line 337, in decode
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] obj, end = self.raw_decode(s, idx=_w(s, 0).end())
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] File "xxxxxxxxxx/envs/eval-longvt/lib/python3.11/json/decoder.py", line 353, in raw_decode
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] obj, end = self.scan_once(s, idx)
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] ^^^^^^^^^^^^^^^^^^^^^^
(APIServer pid=65904) ERROR 12-23 08:25:31 [hermes_tool_parser.py:168] json.decoder.JSONDecodeError: Unterminated string starting at: line 1 column 9 (char 8)

I didn't change any source codes and set the whole framework following the README file, any ideas on why the above JSON error occurs? I'm trying to find out.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions