fix(tools): adversarial policy exempt list and /status display by bug-ops · Pull Request #2471 · bug-ops/zeph

bug-ops · 2026-03-30T21:07:47Z

Summary

bug(tools): adversarial policy falsely denies internal agent operations (memory_save, write) #2469: Add exempt_tools field to AdversarialPolicyConfig with a default list of internal agent operations (memory_save, memory_search, read_overflow, load_skill, schedule_deferred). The gate now skips LLM validation for these tools, preventing false denials from policies like "Do not write files".
enh(tools): adversarial policy gate not shown in /status output #2467: /status now shows adversarial gate state (provider, policy_count, fail_open) when [tools.adversarial_policy] enabled = true.

Changes

crates/zeph-tools/src/config.rs: exempt_tools: Vec<String> field with default_exempt_tools() returning 5 internal tool names
crates/zeph-tools/src/adversarial_policy.rs: PolicyValidator::new() accepts exempt_tools; early-return Allow at start of validate() for exempt tools
crates/zeph-core/src/agent/state/mod.rs: AdversarialPolicyInfo struct + field on RuntimeConfig (behind policy-enforcer feature)
crates/zeph-core/src/agent/builder.rs: with_adversarial_policy_info() builder method
crates/zeph-core/src/agent/mod.rs: Adv gate: line in handle_status_command()
src/runner.rs: pass exempt_tools to validator; populate and apply AdversarialPolicyInfo

Test plan

7408 tests pass (cargo nextest run --workspace --features full --lib --bins)
cargo +nightly fmt --check passes
cargo clippy --workspace --features full -- -D warnings passes
Live: policy "Do not write files" no longer blocks memory_save
Live: /status shows Adv gate: line when adversarial_policy is enabled

Add exempt_tools to AdversarialPolicyConfig with a default list of internal agent operations (memory_save, memory_search, read_overflow, load_skill, schedule_deferred). The gate skips LLM validation for these tools, preventing false denials from policies like "Do not write files". Add AdversarialPolicyInfo to RuntimeConfig and a with_adversarial_policy_info() builder method. /status now shows adversarial gate state (provider, policy count, fail_open) when the feature is enabled. Closes #2469, #2467.

github-actions bot added documentation Improvements or additions to documentation rust Rust code changes core zeph-core crate bug Something isn't working size/M Medium PR (51-200 lines) labels Mar 30, 2026

This was linked to issues Mar 30, 2026

enh(tools): adversarial policy gate not shown in /status output #2467

Closed

bug(tools): adversarial policy falsely denies internal agent operations (memory_save, write) #2469

Closed

bug-ops enabled auto-merge (squash) March 30, 2026 21:08

bug-ops merged commit ee95a1d into main Mar 30, 2026
27 checks passed

bug-ops deleted the 2469-adversarial-policy-exempt branch March 30, 2026 21:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(tools): adversarial policy exempt list and /status display#2471

fix(tools): adversarial policy exempt list and /status display#2471
bug-ops merged 1 commit intomainfrom
2469-adversarial-policy-exempt

bug-ops commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bug-ops commented Mar 30, 2026

Summary

Changes

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant