Skip to content

Commit b70b7a7

Browse files
github-actions[bot]CopilotCopilot
authored
docs: sync smoke-claude toolset and max-turns after token optimization (#1641)
Update docs/test-analysis/ci-smoke.md to reflect changes from commit ac27bc7 (feat(smoke-claude): trim unused tools to reduce token spend ~8%): - max-turns 15 → 8 (cost-runaway protection) - Remove cache-memory tool (was removed from workflow) - Remove edit tool (was removed from workflow) - Remove MCP container row (no longer in workflow config) - Update 'What it tests' description to drop MCP server mention Co-authored-by: Copilot <copilot@github.com> Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
1 parent 07b4e4e commit b70b7a7

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

docs/test-analysis/ci-smoke.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -163,14 +163,13 @@ These are gh-aw agentic workflows compiled from `.md` source files into `.lock.y
163163

164164
| Attribute | Value |
165165
|-----------|-------|
166-
| **What it tests** | Claude Code engine running inside AWF sandbox with MCP servers, Playwright, file I/O, and bash tools |
167-
| **Engine** | `claude` (max 15 turns) |
166+
| **What it tests** | Claude Code engine running inside AWF sandbox with GitHub API, Playwright, file I/O, and bash tools |
167+
| **Engine** | `claude` (max 8 turns) |
168168
| **Triggers** | Every 12h (schedule), PR (opened/synchronize/reopened), manual dispatch |
169169
| **Timeout** | 10 minutes |
170170
| **Network allowed** | defaults, github, playwright |
171-
| **Tools** | cache-memory, github (repos, pull_requests), playwright, edit, bash |
171+
| **Tools** | github (repos, pull_requests), playwright, bash |
172172
| **Safe outputs** | add-comment (hide older), add-labels (smoke-claude) |
173-
| **MCP** | ghcr.io/github/gh-aw-mcpg container |
174173
| **Real-world mapping** | Validates that Claude Code can operate within AWF's network sandbox: GitHub API access via MCP, browser automation via Playwright, local file operations — the core use case for agentic workflows |
175174
| **Gaps** | Non-deterministic (AI agent may behave differently). No HTTPS blocking verification. |
176175
| **Integration test relationship** | High-level end-to-end complement. Integration tests verify AWF mechanics (iptables, proxy); this verifies an actual AI agent works through the firewall. |

0 commit comments

Comments
 (0)