feat: auto-approve commands based on risk level #2670

KJ7LNW · 2025-04-16T01:18:28Z

Context

This is similar to Cline's competing "auto-approve safe commands" feature—but approval in this PR is much smarter: command risk analysis is performed based on formal predicate logic.

For example, when set to read-only:

Roo will auto-execute only commands that have formally matched the predicate logic for read only (see below):

Implementation

Risk levels are formally defined through predicate logic and the model determines the risk level after producing in the command and risk analysis, so risk level determination is retrospective and accurate. When "disabled" (default), all system instructions for related risk analysis are excluded, so there is no change to Roo's default system prompt size.

Risk level total ordering:
    readOnly < reversibleChanges < complexChanges < serviceInterruptingChanges < destructiveChanges

S = F ∪ P ∪ N ∪ H ∪ M ∪ Z ∪ E
    F = storage: files, databases, git repos, disks, …
    P = processes, services, VMs, containers, …
    N = network configs, …
    H = hardware, physical controls, …
    M = memory states, locks, resources, …
    Z = security: permissions, keys, tokens, …
    E = temporal state: tasks, cron, timers, …

s₀ ∈ S:          initial state before C
s′ ∈ S′:         final state after C
atoms(x):        minimal addressable units
content(x,s):    accessible atoms in s
a ∉ content(x,s) if deleted, truncated, overwritten, corrupted, encrypted with lost key

C = {c₁,c₂,…,cₙ}:  command sequence
T = Tr ∪ Tm: intended targets
    Tr ⊆ S: read-only targets
    Tm = {x | x ∈ S ∨ (x ∉ S ∧ x ∈ S′)}: modifiable/creatable targets
R = {x | (x ∈ s₀ ∧ c(x) ≠ x) ∨ (x ∉ s₀ ∧ x ∈ S′)}: modified/created elements
c⁻¹(R): inverse operations

r(C)=readOnly ⟺
    Tm = ∅ ∧
    R = ∅ ∧
    (Tr ≠ ∅ ∨ Tr = ∅) ∧
    S′ = S ∧
    (∀x ∈ S: content(x,s₀) = content(x,s′))

r(C)=reversibleChanges ⟺
    Tm ≠ ∅ ∧
    ∀s₀ ∈ S: ∃c⁻¹: (c⁻¹(c(s₀))=s₀) ∧ (|c⁻¹|=1) ∧
    R = Tm

r(C)=complexChanges ⟺
    ¬destructiveChanges ∧
    Tm ≠ ∅ ∧
    ∃s′=c(s₀):
        (R ⊆ Tm) ∧
        (∀x ∈ R:
            (∃p ⊂ atoms(x): p ∈ content(x,s′)) ∧
            (∃q ⊂ atoms(x): q ∉ content(x,s′) ∧ ¬∃c⁻¹: c⁻¹(c(q))=q))

r(C)=serviceInterruptingChanges ⟺
    ∃p ∈ Tm ∩ P, ∃t₁ ∈ ℝ⁺:
        A(p,0) ∧
        ¬A(p,t₁) ∧
        ((∃t₂ > t₁: A(p,t₂)) ∨ p ∉ s′)
    where A(x,t): availability at time t

r(C)=destructiveChanges ⟺
    (∃x ∈ s₀: x ∉ s′) ∨
    (R ⊈ Tm) ∨
    (∃x ∈ s₀:
        content(x,s₀) ≠ ∅ ∧
        (∀a ∈ atoms(x): a ∉ content(x,s′)))

For compound commands:
    r(C) = max{r(c₁), r(c₂), …, r(cₙ)}

### Usage:
<execute_command>
<command>Your command here</command>
<risk_analysis>[reasoning]</risk_analysis>
<risk>Risk level here</risk>
<cwd>Working directory path (optional)</cwd>
</execute_command>

Screenshots

This PR provides greater command safety and user feedback before running commands:

and allows users to select the auto-approve risk level they are comfortable with:

How to Test

Open settings and verify new 'Allowed Auto-Execute Risk Level' option
Set different risk levels and verify command auto-approval behavior
Test commands of varying risk levels to ensure proper validation
Verify error handling for invalid risk levels

Get in Touch

Discord: KJ7LNW

Important

Introduces auto-approve feature for commands based on risk levels, updating settings, UI, and logic across multiple files.

Behavior:
- Introduces auto-approve feature for commands based on risk levels in global-settings.ts and terminal.ts.
- Risk levels include readOnly, reversibleChanges, complexChanges, serviceInterruptingChanges, and destructiveChanges.
- Adds logic to determine and validate risk levels in terminal.ts.
Settings:
- Adds commandRiskLevel setting in global-settings.ts and ClineProvider.ts.
- Updates SettingsView.tsx and AutoApproveSettings.tsx to include risk level selection.
UI:
- Updates ChatView.tsx and CommandExecution.tsx to display risk levels and analyses.
- Adds translations for risk levels in multiple locale files.
Functions:
- Adds utility functions isRiskAnalysisEnabled, isValidRiskLevel, and isRiskAllowed in terminal.ts.
- Modifies executeCommandTool in executeCommandTool.ts to handle risk levels.

^{This description was created by}^{for c6b43fb77746c38c2622188b20d865b4c01c070e. You can customize this summary. It will automatically update as commits are pushed.}

changeset-bot · 2025-04-16T01:18:31Z

⚠️ No Changeset found

Latest commit: 7e088aef189df631f9bf840cbc7cebeccf43d5e9

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

ellipsis-dev · 2025-04-16T01:19:47Z

The pull request introduces a significant number of changes across various components, primarily focused on adding risk level parameters and metadata. While these changes are related to the new feature of risk level handling, the size of the pull request is quite large, with 22 files changed and over 1300 lines added.

To improve reviewability and maintainability, consider splitting the pull request into smaller, more focused pull requests. Here are some suggestions on how to split the changes:

Core Logic Changes: Group changes related to the core logic, such as modifications in src/core/Cline.ts, src/core/assistant-message/index.ts, and src/core/tools/executeCommandTool.ts, into one pull request.
UI Component Updates: Separate the changes related to UI components, like those in webview-ui/src/components/chat/ChatRow.tsx and webview-ui/src/components/settings/AutoApproveSettings.tsx, into another pull request.
Testing and Validation: Consider having a separate pull request for updates to tests and validation logic, such as changes in src/core/tools/__tests__/executeCommandTool.test.ts and webview-ui/src/utils/commandRiskUtils.ts.

By splitting the changes into these categories, it will be easier for reviewers to focus on specific areas and ensure thorough review and testing. Thank you for your hard work on this feature!

KJ7LNW · 2025-04-16T01:29:42Z

translations coming to a pull request near you ...

KJ7LNW · 2025-04-16T01:37:44Z

languages are already

adamhill · 2025-04-16T04:21:28Z

It would be nice to add this Markdown table at /src/core/prompts/tools/executeCommandTool-explainer.md (so it will sort next to the .ts file) to explain a little better what its doing by letting you read through the rules in English, for non-math nerds. I don't think I can add files to the PR, so I am leaving a comment.

This was a really cool & practical example of using symbolic math to solve common CS problems we have to deal with.

Thanks!

Symbol	Explanation
`S`	Represents the complete system state, including all files, processes, and configurations.
`c`	Represents the command operation being executed.
`s`	Represents an individual state within the complete system state `S`.
`∀`	Universal quantifier, meaning "for all". For example, `∀s ∈ S` means "for all states `s` in set `S`".
`∈`	Element of, meaning "is a member of". For example, `s ∈ S` means "state `s` is a member of set `S`".
`=`	Equals, meaning the system state remains unchanged.
`∃`	Existential quantifier, meaning "there exists". For example, `∃c⁻¹` means "there exists an inverse command".
`c⁻¹`	Inverse of the command operation `c`, which undoes the changes made by `c`.
`≠`	Not equal, meaning the state has changed.
`∧`	Logical AND, meaning both conditions must be true.
`∨`	Logical OR, meaning at least one of the conditions must be true.
`t`	Represents time.
`A(s, t)`	Availability function, where `A(s, t) = 0` means a service is unavailable for state `s` at time `t`.
`d`	Represents data or information being removed from the system.
`D`	Represents the set of all data or information in the system.
`⊂`	Proper subset, meaning `d(s)` is a subset of `s` but not equal to `s`.
`¬`	Negation, meaning "not". For example, `¬∃f` means "there does not exist a function `f`".
`f`	Represents a recovery function attempting to restore the original state.

KJ7LNW · 2025-04-16T04:56:29Z

It would be nice to add this Markdown table at /src/core/prompts/tools/executeCommandTool-explainer.md

great idea!

This was a really cool & practical example of using symbolic math to solve common CS problems we have to deal with.

I really enjoyed predicate logic it and set notation in some of my early computer science discrete math courses, but it's funny that I have only really used them for any practical purpose as part of artificial intelligence instructions (oh yeah, and whiteboard notes!).

KJ7LNW · 2025-04-22T23:33:35Z

this is ready for review

ellipsis-dev · 2025-04-22T23:35:51Z

This pull request is quite large, with 34 files changed and over 2000 lines added. It seems to introduce a comprehensive set of features related to command risk levels and risk analysis.

To improve the review process, consider splitting the changes into smaller, more focused pull requests if possible. For example:

Core Functionality: Changes related to the core logic of risk analysis and command execution (e.g., src/core/Cline.ts, src/core/tools/executeCommandTool.ts).
Webview Integration: Updates related to the webview components and settings (e.g., webview-ui/src/components/chat/ChatRow.tsx, webview-ui/src/components/settings/AutoApproveSettings.tsx).
Localization: Changes related to localization files (e.g., webview-ui/src/i18n/locales/en/settings.json).

This will make it easier to review and test each part independently. Thank you!

src/core/prompts/tools/execute-command.ts

webview-ui/src/i18n/locales/en/settings.json

Use consistent translation '只读' for readOnly across settings to improve clarity Signed-off-by: Eric Wheeler <[email protected]>

Clarify risk level definitions: - Specify inverse operations exclude unconfirmed data sources - Add content preservation check for reversible changes - Simplify complex changes as complement of other risk levels - Streamline destructive changes definition around content loss Signed-off-by: Eric Wheeler <[email protected]>

… proof Redefine the 'reversibleChanges' risk level using rigorous mathematical notation: - Replace the informal definition with a formal set-theoretic approach - Add operational semantics for command sequences with termination judgments - Express reversibility as an injective function property - Introduce domain and image sets for command functions This change establishes that command reversibility is equivalent to function injectivity, providing a clear mathematical criterion for determining whether operations can be safely undone. Signed-off-by: Eric Wheeler <[email protected]>

Add risk level and analysis display to CommandExecution component: - Show risk level with codicon icons and colors - Display risk analysis with proper styling - Add translations for risk level descriptions - Pass metadata from ChatRow to CommandExecution Signed-off-by: Eric Wheeler <[email protected]>

Use consistent state access pattern for commandRiskLevel setting: - Remove direct contextProxy.getGlobalState access - Destructure commandRiskLevel from getState like other settings - Pass commandRiskLevel prop to AutoApproveSettings component This aligns with the documented settings persistence chain in cline_docs/settings.md, ensuring proper state hydration and display. Signed-off-by: Eric Wheeler <[email protected]>

This addresses ambiguity in the risk_analysis parameter description by requiring proof of key relationships, ensuring clarity for contributors. It also tightens the readOnly criteria to explicitly exclude all non-read operations and enforce formal constraints, preventing inconsistent implementations. Signed-off-by: Eric Wheeler <[email protected]>

Define inverse_is_knowable to require inverse operations be derivable solely from command sequence, result state, and known inverse commands (C, s′, I). Key changes: - Add local/remote distinction to state space S - Clarify c⁻¹(R) excludes external knowledge - Make readOnly condition more precise with s′ = s₀ - Replace abstract inverse existence with knowable inverse - Add comprehensive command examples by risk level - Define destructiveChanges to catch indirect data loss Improves accuracy of risk assessment by ensuring inverse operations: 1. Do not rely on external context or backups 2. Are inferable from command and result alone 3. Use only standardized inverse commands Signed-off-by: Eric Wheeler <[email protected]>

gh create is not reversible because creating issues can only be closed not deleted extended the set cross product to include api,web,cloud,virt

This change ensures that risk information is always displayed in the CommandExecution component, even during partial streaming of command execution blocks. Previously, the metadata object was only created if commandRisk was truthy, which prevented risk information from being displayed during streaming. Signed-off-by: Eric Wheeler <[email protected]>

Fixed the import path for CommandRiskLevel in ChatView.tsx to use @roo-code/types instead of a non-existent path, resolving the build error. Signed-off-by: Eric Wheeler <[email protected]>

…disabled Adds a new 'disabled' option to command risk levels that: - Completely excludes risk analysis from system instructions when disabled - Bypasses risk and risk_analysis parameter validation for execute_command - Simplifies command execution in trusted environments - Sets 'disabled' as the default for better initial user experience The implementation: - Adds 'disabled' enum value to commandRiskLevel - Changes default from 'none' to 'disabled' across the codebase - Adds conditional logic to skip validation when disabled - Updates UI components with new option and description - Removes duplicate utility functions from webview - Passes commandRiskLevel to system prompt generation

Signed-off-by: Eric Wheeler <[email protected]>

KJ7LNW · 2025-07-01T21:05:36Z

Reddit stats, people really want this PR to merge:

https://www.reddit.com/r/RooCode/comments/1lmzm6l/autoapprove_commands_based_on_risk_level_pending/

daniel-lxs

This is a solid implementation and functions as expected when tested. However, the addition of ~2,000 tokens to the system prompt for risk analysis is concerning. If the context window fills up, there's a risk the model will ignore these instructions, especially with weaker models, leading to potentially incorrect risk assertions.

I believe that allowing people to whitelist and eventually blacklist command and command prefixes is a pretty simple but effective approach.

KJ7LNW · 2025-07-01T22:42:59Z

This is a solid implementation and functions as expected when tested

Awesome, thanks for testing. I've been using it almost every day for months and it works great, even in very large contexts.

KJ7LNW · 2025-07-02T22:12:42Z

This is a solid implementation and functions as expected when tested

Awesome, thanks for testing. I've been using it almost every day for months and it works great, even in very large contexts.

Hi @daniel-lxs, what do you think about making this an experimental feature?

This would give users a chance to test it, and since it is experimental it can always be pulled out if you find that it is not beneficial to the project.

mrubens · 2025-07-03T04:31:47Z

This is definitely a clever idea, but it seems hard to get right - I don't really trust the LLMs to make the right call on risk level without human oversight.

My energy is more around using the LLMs to generate good suggested prefixes for allowlisting commands inline (and maybe eventually blocklisting), but still with the human making the final call on which commands execute in a deterministic way. I'd rather use the pixels in the command execution area to support this interaction.

Thank you for your efforts on this one, but I am going to close it for now.

KJ7LNW · 2025-07-03T20:17:37Z

This is definitely a clever idea, but it seems hard to get right - I don't really trust the LLMs to make the right call on risk level without human oversight.

I understand your concern, which is why I thought an experimental flag could be appropriate.

I've been using this every day since April and it works great with Sonnet 3.x,4 and Gemini 2.5.

I'm not pressuring you, this is your project. I hope you give it some more thought and consider letting other users test it to see how it how it performs.

KJ7LNW requested review from cte and mrubens as code owners April 16, 2025 01:18

github-project-automation bot added this to Roo Code Roadmap Apr 16, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Apr 16, 2025

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Apr 16, 2025

dosubot bot added the enhancement New feature or request label Apr 16, 2025

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Apr 16, 2025

KJ7LNW force-pushed the feat-auto-approve-command-risk branch 2 times, most recently from d484b04 to c622c2f Compare April 16, 2025 05:56

hannesrudolph moved this from New to PR [Pre Approval Review] in Roo Code Roadmap Apr 17, 2025

KJ7LNW marked this pull request as draft April 18, 2025 19:40

KJ7LNW force-pushed the feat-auto-approve-command-risk branch 2 times, most recently from a4d5737 to bfcfe75 Compare April 22, 2025 21:57

KJ7LNW marked this pull request as ready for review April 22, 2025 23:33

ellipsis-dev bot reviewed Apr 22, 2025

View reviewed changes

src/core/prompts/tools/execute-command.ts Outdated Show resolved Hide resolved

webview-ui/src/i18n/locales/en/settings.json Outdated Show resolved Hide resolved

KJ7LNW force-pushed the feat-auto-approve-command-risk branch 2 times, most recently from a4342f5 to f48427d Compare April 23, 2025 00:00

KJ7LNW marked this pull request as draft April 23, 2025 01:41

KJ7LNW force-pushed the feat-auto-approve-command-risk branch 3 times, most recently from 0b80781 to 4bae6db Compare April 26, 2025 04:58

Eric Wheeler added 13 commits July 1, 2025 13:53

i18n: standardize Chinese translation for readOnly

b522cca

Use consistent translation '只读' for readOnly across settings to improve clarity Signed-off-by: Eric Wheeler <[email protected]>

feat: add SQL primitives

57acf99

rules: create more clarity around scope and gh

1e1327b

gh create is not reversible because creating issues can only be closed not deleted extended the set cross product to include api,web,cloud,virt

fix: correct CommandRiskLevel import path in ChatView

a15174d

Fixed the import path for CommandRiskLevel in ChatView.tsx to use @roo-code/types instead of a non-existent path, resolving the build error. Signed-off-by: Eric Wheeler <[email protected]>

lang: add translations for auto-approve command risk levels

0037ccd

Signed-off-by: Eric Wheeler <[email protected]>

KJ7LNW force-pushed the feat-auto-approve-command-risk branch from 13928ff to 219062f Compare July 1, 2025 20:53

lang: fix grammar, appease ellipsis

b622a26

KJ7LNW force-pushed the feat-auto-approve-command-risk branch from 219062f to b622a26 Compare July 1, 2025 21:06

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Jul 1, 2025

daniel-lxs reviewed Jul 1, 2025

View reviewed changes

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Needs Review] in Roo Code Roadmap Jul 1, 2025

hannesrudolph added PR - Needs Review and removed PR - Needs Preliminary Review labels Jul 1, 2025

mrubens closed this Jul 3, 2025

github-project-automation bot moved this from PR [Draft/WIP] to Done in Roo Code Roadmap Jul 3, 2025

github-project-automation bot moved this from PR [Needs Review] to Done in Roo Code Roadmap Jul 3, 2025

feat: auto-approve commands based on risk level #2670

feat: auto-approve commands based on risk level #2670

Uh oh!

Conversation

KJ7LNW commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Implementation

Screenshots

How to Test

Get in Touch

Uh oh!

changeset-bot bot commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

ellipsis-dev bot commented Apr 16, 2025

Uh oh!

KJ7LNW commented Apr 16, 2025

Uh oh!

KJ7LNW commented Apr 16, 2025

Uh oh!

adamhill commented Apr 16, 2025

Uh oh!

KJ7LNW commented Apr 16, 2025

Uh oh!

KJ7LNW commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ellipsis-dev bot commented Apr 22, 2025

Uh oh!

Uh oh!

Uh oh!

KJ7LNW commented Jul 1, 2025

Uh oh!

daniel-lxs left a comment

Choose a reason for hiding this comment

Uh oh!

KJ7LNW commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KJ7LNW commented Jul 2, 2025

Uh oh!

mrubens commented Jul 3, 2025

Uh oh!

KJ7LNW commented Jul 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

KJ7LNW commented Apr 16, 2025 •

edited

Loading

changeset-bot bot commented Apr 16, 2025 •

edited

Loading

KJ7LNW commented Apr 22, 2025 •

edited

Loading

KJ7LNW commented Jul 1, 2025 •

edited

Loading