You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`v0.4.6` is a smaller quality release focused on operator discipline and agent behavior rather than new product surface area.
4
+
5
+
It strengthens the built-in role presets with higher-signal planning, review, debugging, and implementation rules, while keeping the core prompt stack compact and predictable.
6
+
7
+
## Highlights
8
+
9
+
## 1) Built-in role presets are more rigorous
10
+
11
+
This release upgrades the first-wave built-in role presets with stronger execution discipline harvested from high-quality external prompt patterns.
12
+
13
+
Key improvements:
14
+
15
+
- planners now push harder on existing-code leverage, minimum viable scope, and complete-vs-shortcut tradeoffs
16
+
- implementers are pushed more explicitly to finish the real blast radius in one pass, including nearby tests, docs, and obvious edge-case handling
17
+
- reviewers are stricter about reading the full diff first and checking whether docs, diagrams, or adjacent tests went stale
18
+
- debuggers are more firmly rooted in regression checks, hypothesis confirmation, and regression-test follow-through
19
+
- explorers are better at separating true canonical anchor files from nearby noise
20
+
21
+
## 2) The startup prompt stays lean, but a bit smarter
22
+
23
+
`v0.4.6` keeps the startup preamble compact while sharpening one important default:
24
+
25
+
- actors are nudged to reuse working paths first instead of inventing parallel ones
26
+
27
+
This is intentionally a small change. The goal is to improve default judgment without making the startup prompt bloated again.
28
+
29
+
## 3) Built-in help now has a safer growth budget
30
+
31
+
The built-in help surface remains compact, but the enforced ceiling is relaxed from `1300` to `1500` words.
32
+
33
+
Why this matters:
34
+
35
+
- it preserves a hard anti-bloat boundary
36
+
- it leaves room for a few more genuinely universal rules in future iterations
37
+
- it avoids treating the previous `1300` test limit as a product law when it was really just a local compactness guardrail
38
+
39
+
## 4) Release readiness stayed clean
40
+
41
+
This version is intentionally small in scope, and the main goal was to land the quality improvements without reopening broader workflow or UI churn.
42
+
43
+
Validation included:
44
+
45
+
- Python test suite passing cleanly
46
+
- targeted prompt-default tests passing after the compactness-budget change
47
+
- Web lint and type-check remaining clean alongside the role-preset text updates
0 commit comments