fix: prevent Kimi K2 model from completing tasks without performing actions #6000

roomote · 2025-07-21T09:33:02Z

This PR fixes issue #5999 where the Kimi K2 model immediately marks tasks as completed without actually performing any fixes.

Changes Made

Added validation in attemptCompletionTool to check if actual work was done before allowing completion
- Checks if any tools were used or files were edited
- Specifically targets Kimi K2 models through case-insensitive model ID detection
Added model-specific instructions in the system prompt for Kimi K2 models
- Instructs the model to complete implementation before using attempt_completion
- Emphasizes the need to perform actual work before marking tasks complete
Added telemetry tracking for premature completion attempts
- Tracks when Kimi K2 models attempt to complete without doing work
- Helps monitor the effectiveness of this fix
Comprehensive test coverage for the new validation logic
- Tests various scenarios including tool usage, file edits, and model detection
- Ensures the fix works correctly for Kimi K2 while not affecting other models

Testing

All existing tests pass
Added new test suite specifically for Kimi K2 validation
Manually tested the validation logic

Fixes #5999

Important

Adds validation to ensure Kimi K2 model performs actions before marking tasks as complete, with model-specific instructions and telemetry tracking.

Behavior:
- Adds validation in attemptCompletionTool to ensure Kimi K2 model performs actions before task completion.
- Checks for tool usage or file edits before allowing completion.
- Targets Kimi K2 models using case-insensitive model ID detection.
Prompts:
- Adds Kimi K2 specific instructions in getRulesSection() in rules.ts.
- Instructs model to complete implementation before using attempt_completion.
Telemetry:
- Tracks premature completion attempts for Kimi K2 models in attemptCompletionTool.
Testing:
- Adds comprehensive tests in attemptCompletionTool.spec.ts for Kimi K2 validation logic.
- Tests scenarios including tool usage, file edits, and model detection.

^{This description was created by}^{for 74d6c57. You can customize this summary. It will automatically update as commits are pushed.}

…ctions - Add validation in attempt_completion to check if actual work was done - Add model-specific instructions in system prompt for Kimi K2 - Track premature completion attempts via telemetry - Add comprehensive tests for the new validation logic Fixes #5999

M-505 · 2025-07-21T10:03:21Z

will this be merged ?

daniel-lxs · 2025-07-22T22:47:03Z

This is not how we should fix this, it seems that this is a problem with the AI model itself, not Roo Code

roomote bot requested review from cte, jr and mrubens as code owners July 21, 2025 09:33

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jul 21, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jul 21, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jul 21, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Jul 21, 2025

roomote bot mentioned this pull request Jul 21, 2025

kimi k2 unable to delegate task and immediatly completing it #5999

Closed

hannesrudolph added the Issue/PR - Triage New issue. Needs quick review to confirm validity and assign labels. label Jul 21, 2025

daniel-lxs closed this Jul 22, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jul 22, 2025

github-project-automation bot moved this from Triage to Done in Roo Code Roadmap Jul 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent Kimi K2 model from completing tasks without performing actions #6000

fix: prevent Kimi K2 model from completing tasks without performing actions #6000

Uh oh!

roomote bot commented Jul 21, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

M-505 commented Jul 21, 2025

Uh oh!

daniel-lxs commented Jul 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix: prevent Kimi K2 model from completing tasks without performing actions #6000

fix: prevent Kimi K2 model from completing tasks without performing actions #6000

Uh oh!

Conversation

roomote bot commented Jul 21, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

Testing

Uh oh!

M-505 commented Jul 21, 2025

Uh oh!

daniel-lxs commented Jul 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

roomote bot commented Jul 21, 2025 •

edited by ellipsis-dev bot

Loading