feat: add experimental flag to disable command execution in attempt_c… #4352

hannesrudolph · 2025-06-04T22:34:33Z

Related GitHub Issue

Closes: #4351

Description

This PR adds an experimental flag to disable the command parameter in the attempt_completion tool. The change addresses the issue where Roo can execute commands without first verifying that the task was completed successfully, violating the principle of step-by-step verification.

Key implementation details:

Added a new experimental flag DISABLE_COMPLETION_COMMAND that users can enable to test the new behavior
Modified attemptCompletionTool.ts to check the experimental flag before executing commands
Made system prompts dynamic - they now exclude command-related instructions when the experiment is enabled
The feature is backward compatible with opt-in behavior (defaults to false)

Important: This is implemented as an experiment to test the behavior. If it works well, we will simply remove the command execution feature entirely from attempt_completion without adding it as a permanent setting.

Design choices:

Used the existing experiments system for feature flagging to ensure consistency
Implemented dynamic prompt generation to provide a seamless experience when the flag is toggled
Added comprehensive test coverage for both enabled and disabled states

Test Procedure

Unit Tests:

Run npm test to execute all tests
New test files added:
- src/core/prompts/tools/__tests__/attempt-completion.experiment.test.ts (6 tests)
- src/core/tools/__tests__/attemptCompletionTool.experiment.test.ts (7 tests)
All existing tests updated to include the new experiment property

Manual Testing:

Open VSCode settings and navigate to Experimental Features
Enable "Disable command execution in attempt_completion"
Create a task that would normally execute a command (e.g., "Create a simple HTML file and show it")
Verify that:
- With flag disabled (default): Commands execute as normal
- With flag enabled: Commands are ignored, task completes without execution
- System prompts don't mention command execution when flag is enabled

Verification:

The AI assistant should use execute_command separately when the flag is enabled
No breaking changes to existing workflows when flag is disabled

Type of Change

🐛 Bug Fix: Non-breaking change that fixes an issue.
✨ New Feature: Non-breaking change that adds functionality.
💥 Breaking Change: Fix or feature that would cause existing functionality to not work as expected.
♻️ Refactor: Code change that neither fixes a bug nor adds a feature.
💅 Style: Changes that do not affect the meaning of the code (white-space, formatting, etc.).
📚 Documentation: Updates to documentation files.
⚙️ Build/CI: Changes to the build process or CI configuration.
🧹 Chore: Other changes that don't modify src or test files.

Pre-Submission Checklist

Screenshots / Videos

Settings UI - New Experimental Toggle:

The new "Disable command execution in attempt_completion" toggle appears in the Experimental Features section

Behavior Comparison:

Flag Disabled (Default): Commands execute normally after task completion
Flag Enabled: Commands are ignored, requiring separate execute_command tool usage

Documentation Updates

No documentation updates are required.
Yes, documentation updates are required.

Note: Since this is an experimental feature that will be removed entirely if successful (not moved to settings), no permanent documentation is needed.

Additional Notes

Files Changed Summary:

13 files modified
513 insertions, 16 deletions
2 new test files created
All tests passing

Impact Analysis:

No performance impact (simple conditional checks)
Fully backward compatible
Users can test their workflows before the feature is removed entirely

Future Plan:
If the experiment proves successful, we will remove the command execution capability from attempt_completion entirely, without moving it to a permanent setting. This ensures cleaner separation of concerns and better adherence to the step-by-step verification principle.

Important

Introduces an experimental flag to disable command execution in attempt_completion, with updates to code, tests, and localization.

Behavior:
- Adds DISABLE_COMPLETION_COMMAND flag to disable command execution in attempt_completion.
- Updates attemptCompletionTool.ts to respect the new flag, preventing command execution when enabled.
- Dynamic prompt generation excludes command instructions when the flag is enabled.
Experiments:
- Adds disableCompletionCommand to experiment.ts and related schemas.
Testing:
- New tests in attempt-completion.experiment.test.ts and attemptCompletionTool.experiment.test.ts to cover both enabled and disabled states.
- Updates existing tests to include the new experiment property.
Localization:
- Updates localization files to include descriptions for the new experimental feature.

^{This description was created by}^{for cdeef22. You can customize this summary. It will automatically update as commits are pushed.}

…ompletion tool

…st.ts

webview-ui/src/context/__tests__/ExtensionStateContext.test.tsx

src/core/tools/attemptCompletionTool.ts

…emove type assertion

daniel-lxs

Looks good to me!

It might be a good idea to expose this setting to the API configuration to allow disabling the suggested command parameter on evals and integration testing.

feat: add experimental flag to disable command execution in attempt_c…

f6004d0

…ompletion tool

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jun 4, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jun 4, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jun 4, 2025

hannesrudolph and others added 5 commits June 4, 2025 16:36

fix: remove deprecation phase comments from attemptCompletionTool

adb13d2

Merge branch 'main' into attempt_completion_fix - resolve conflicts

4e2253c

feat: add translations for disable completion command experiment

5e160ca

fix: revert unintended package.json change

6d13d12

Rename attempt-completion.experiment.test.ts to attempt-completion.te…

d9ecb75

…st.ts

daniel-lxs reviewed Jun 4, 2025

View reviewed changes

webview-ui/src/context/__tests__/ExtensionStateContext.test.tsx Show resolved Hide resolved

webview-ui/src/context/__tests__/ExtensionStateContext.test.tsx Show resolved Hide resolved

src/core/tools/attemptCompletionTool.ts Outdated Show resolved Hide resolved

daniel-lxs moved this from Triage to PR [Draft / In Progress] in Roo Code Roadmap Jun 4, 2025

hannesrudolph added the PR - Draft / In Progress label Jun 5, 2025

fix: address PR feedback - restore autoCondenseContext in tests and r…

cdeef22

…emove type assertion

hannesrudolph marked this pull request as ready for review June 5, 2025 05:00

hannesrudolph requested review from cte and mrubens as code owners June 5, 2025 05:00

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Jun 5, 2025

hannesrudolph moved this from PR [Draft / In Progress] to PR [Needs Prelim Review] in Roo Code Roadmap Jun 5, 2025

dosubot bot added the enhancement New feature or request label Jun 5, 2025

hannesrudolph added PR - Needs Preliminary Review and removed PR - Draft / In Progress labels Jun 5, 2025

daniel-lxs approved these changes Jun 7, 2025

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jun 7, 2025

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Needs Review] in Roo Code Roadmap Jun 7, 2025

daniel-lxs added PR - Needs Review and removed PR - Needs Preliminary Review labels Jun 7, 2025

mrubens approved these changes Jun 8, 2025

View reviewed changes

mrubens merged commit 67d238a into main Jun 8, 2025
26 checks passed

mrubens deleted the attempt_completion_fix branch June 8, 2025 14:14

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jun 8, 2025

github-project-automation bot moved this from PR [Needs Review] to Done in Roo Code Roadmap Jun 8, 2025

hannesrudolph mentioned this pull request Jun 19, 2025

Remove experimental setting: Make command execution permanently disabled in attempt_completion #4882

Closed

roomote mentioned this pull request Jun 19, 2025

Fixes #4882: Remove experimental setting for command execution in attempt_completion #4884

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add experimental flag to disable command execution in attempt_c… #4352

feat: add experimental flag to disable command execution in attempt_c… #4352

Uh oh!

hannesrudolph commented Jun 4, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel-lxs left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

feat: add experimental flag to disable command execution in attempt_c… #4352

feat: add experimental flag to disable command execution in attempt_c… #4352

Uh oh!

Conversation

hannesrudolph commented Jun 4, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Description

Test Procedure

Type of Change

Pre-Submission Checklist

Screenshots / Videos

Documentation Updates

Additional Notes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel-lxs left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hannesrudolph commented Jun 4, 2025 •

edited by ellipsis-dev bot

Loading

daniel-lxs left a comment •

edited

Loading