feat: add JSON mode and reasoning_effort support by kolmogorov-quyet · Pull Request #197 · NevaMind-AI/memU

kolmogorov-quyet · 2026-01-09T09:22:58Z

Summary

Add two new features to improve LLM output quality and control:

1. JSON Mode (`response_format="json_object"`)

Add response_format parameter to summarize() method
Enable JSON mode for LLM rankers in retrieve.py:
- _llm_rank_categories
- _llm_rank_items
- _llm_rank_resources
Enable JSON mode for _preprocess_conversation in memorize.py
Support in openai_sdk.py, wrapper.py, and http_client.py

2. Reasoning Effort (`reasoning_effort`)

Add reasoning_effort field to LLMConfig in settings.py
Accepts values: "low", "medium", "high", or None
Default value is "high" for better reasoning quality
Pass through openai_sdk.py to chat.completions.create()
Wire up in service.py when creating LLM clients

Why these changes?

JSON Mode: Ensures consistent JSON output from LLMs, reducing parsing errors like "No JSON object found"
Reasoning Effort: Enables control over reasoning depth for compatible models (OpenAI o-series, Cerebras, etc.)

Add two new features to improve LLM output quality and control: 1. JSON Mode (response_format="json_object"): - Add response_format parameter to summarize() method - Enable JSON mode for LLM rankers in retrieve.py: - _llm_rank_categories - _llm_rank_items - _llm_rank_resources - Enable JSON mode for _preprocess_conversation in memorize.py - Support in openai_sdk.py, wrapper.py, and http_client.py 2. Reasoning Effort (reasoning_effort="low"|"medium"|"high"): - Add reasoning_effort field to LLMConfig in settings.py - Default value is "high" for better reasoning quality - Pass through openai_sdk.py to chat.completions.create() - Wire up in service.py when creating LLM clients These features help ensure consistent JSON output from LLMs and enable control over reasoning depth for compatible models. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

sairin1202 · 2026-01-10T07:54:00Z

Hi kolmogorov-quyet, thanks for the contribution! One small suggestion: it might be better to make these arguments optional, since not all models support them. As it stands, passing them unconditionally can cause errors for some models, for example:
openai.BadRequestError: Error code: 400 - {'error': {'message': 'Unrecognized request argument supplied: reasoning_effort', 'type': 'invalid_request_error', 'param': None, 'code': None}

Merge branch 'NevaMind-AI:main' into feat/json-mode-reasoning-effort

d41fde2

kolmogorov-quyet closed this by deleting the head repository Jan 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add JSON mode and reasoning_effort support#197

feat: add JSON mode and reasoning_effort support#197
kolmogorov-quyet wants to merge 2 commits intoNevaMind-AI:mainfrom
kolmogorov-quyet:feat/json-mode-reasoning-effort

kolmogorov-quyet commented Jan 9, 2026

Uh oh!

sairin1202 commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kolmogorov-quyet commented Jan 9, 2026

Summary

1. JSON Mode (response_format="json_object")

2. Reasoning Effort (reasoning_effort)

Why these changes?

Uh oh!

sairin1202 commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

1. JSON Mode (`response_format="json_object"`)

2. Reasoning Effort (`reasoning_effort`)