inkeep · nick-inkeep · Feb 19, 2026 · Feb 18, 2026 · Feb 18, 2026 · Feb 18, 2026
diff --git a/.gitignore b/.gitignore
@@ -1,3 +1,5 @@
 node_modules/
 bun.lock
 .DS_Store
+/screengrabs/
+tmp/
diff --git a/docs/pr-screenshots-demo/capture-demo.png b/docs/pr-screenshots-demo/capture-demo.png
diff --git a/docs/pr-screenshots-demo/raw-capture.png b/docs/pr-screenshots-demo/raw-capture.png
diff --git a/docs/pr-screenshots-demo/stitch-demo.png b/docs/pr-screenshots-demo/stitch-demo.png
diff --git a/plugins/eng/skills/debug/SKILL.md b/plugins/eng/skills/debug/SKILL.md
diff --git a/plugins/eng/skills/debug/references/tool-patterns.md b/plugins/eng/skills/debug/references/tool-patterns.md
@@ -0,0 +1,314 @@
+Use when: Phase 3 (Investigate) or Phase 4 (Fix & Verify) needs specific tool sequences.
+Priority: P0
+Impact: Without concrete tool patterns, the agent uses ad-hoc tool sequences and misses efficient investigation shortcuts.
+
+# Tool Patterns for Debugging
+
+Specific sequences of tool calls for common debugging scenarios. Each pattern describes WHEN to use it, the exact sequence, and how to interpret results.
+
+---
+
+## §1 Tracing Data Flow Backward from an Error
+
+**When:** You have an error at a specific location and need to find where the bad data originated. This is the most common investigation pattern — use it whenever the bug is a wrong value, wrong type, null/undefined, or unexpected state.
+
+```
+SEQUENCE:
+
+1. Read the file at the error location (30-line context)
+   Tool: Read(file_path, offset=<error_line - 15>, limit=30)
+   -> Identify the variable/expression with the wrong value
+
+2. Grep for where that variable is assigned:
+   Tool: Grep(pattern="variableName\\s*=", path="src/", glob="*.ts")
+   Tool: Grep(pattern="variableName\\s*:", path="src/", glob="*.ts")  # object properties
+   Use output_mode="content" with context lines to see surrounding code
+
+3. For each assignment found, Read that location to understand the source
+
+4. If the value comes from a FUNCTION CALL:
+   Grep for the function definition:
+   Tool: Grep(pattern="function functionName|def functionName|functionName\\s*=", path="src/")
+   Read the function body, find its return statements
+   -> The return value IS the bad data's source
+
+5. If the value comes from an IMPORT:
+   Grep for the export in the source module
+   Read the exported definition
+
+6. If the value comes from a PARAMETER:
+   Grep for callers of this function:
+   Tool: Grep(pattern="functionName\\(", path="src/", glob="*.ts")
+   Read each caller to see what they pass
+
+7. REPEAT until you find the origin of the bad data
+   TERMINATION: If you've traced through 5+ assignments without finding
+   the origin, return to Phase 3 and form a hypothesis about where the
+   value diverges — you have enough context to hypothesize.
+```
+
+**Key pattern:** Search for DEFINITIONS (where a value is produced), not usages (where it's consumed). When tracing backward, you want producers, not consumers.
+
+**Optimization:** Use Grep tool parameters to reduce noise:
+- `glob="*.py"` for Python only
+- `glob="*.{ts,tsx}"` for TypeScript
+- Use `output_mode="files_with_matches"` for a first pass, then `output_mode="content"` on matches
+
+---
+
+## §2 Strategic Git for Debugging
+
+**When:** Something broke and you need to understand what changed and when. Choose the right git command for your specific question.
+
+```
+COMMAND BY QUESTION:
+
+"When did this specific line change?"
+  git blame -L <start>,<end> <file>
+  -> Shows commit hash, author, date for each line
+  -> Then: git show <commit_hash> for full context
+
+"What changed in this file recently?"
+  git log --oneline -10 -- <file>
+  -> Last 10 commits touching this file
+  git log -p -1 -- <file>
+  -> Most recent diff for this file
+
+"What changed between working and broken states?"
+  git diff <good_commit>..<bad_commit> -- <file_or_directory>
+  git diff <good_commit>..<bad_commit> --stat
+  -> Summary of which files changed and how much
+
+"Who changed the function that's breaking?"
+  git log -p -S "function_name" -- <file>
+  -> Commits where the count of "function_name" changed (pickaxe)
+  -> -S finds additions/removals; -G uses regex
+
+"Find the exact commit that introduced the bug"
+  git bisect start
+  git bisect bad HEAD
+  git bisect good <known_good>
+  git bisect run <test_script.sh>
+  -> Automated binary search through history
+  git bisect reset  # when done
+
+"What was this code before the recent change?"
+  git show <commit>:<file>
+  -> File contents at a specific commit
+  -> Compare with current to see what changed
+
+"What recent changes might be relevant?"
+  git log --oneline --since="3 days ago" -- <directory>
+  -> Scoped to recent changes in a specific area
+```
+
+**Selection heuristic:**
+
+| Question | Command |
+|---|---|
+| Who/when changed this line | `git blame` |
+| When was this function added/removed | `git log -S` |
+| Which commit broke this behavior | `git bisect` |
+| What changed between two states | `git diff` |
+| What does the old version look like | `git show <commit>:<file>` |
+
+---
+
+## §3 Stack Trace Parsing
+
+**When:** You have a stack trace and need to extract actionable information. Use this as a systematic parsing procedure, not as something to skim.
+
+```
+PARSING PROCEDURE:
+
+1. FIND THE ERROR TYPE AND MESSAGE
+   - Usually first or last line of the trace
+   - This tells you WHAT went wrong
+   - Example: "TypeError: Cannot read properties of undefined (reading 'id')"
+     -> Something is undefined when we expected an object with .id
+
+2. EXTRACT FRAMES (each line with file:line info)
+   - Separate into: YOUR code vs LIBRARY/FRAMEWORK code
+   - Your code: files in src/, app/, lib/ (not node_modules, vendor, stdlib)
+   - Note: Python tracebacks show most recent call LAST (bottom).
+     JavaScript/Java show most recent call FIRST (top).
+     Adjust frame reading order accordingly.
+
+3. IDENTIFY THE PRIMARY FRAME
+   - First frame in YOUR code (closest to the error)
+   - This is where to start investigation
+
+4. IDENTIFY THE CALLER FRAME
+   - The frame that called into the primary frame
+   - This tells you what data was passed in
+
+5. FOR CHAINED EXCEPTIONS
+   ("Caused by:", "__cause__", "The above exception was the direct cause")
+   - Start from the INNERMOST cause
+   - That's usually the real error; outer exceptions are reactions
+
+THEN:
+  READ the primary frame file (line +/- 20 lines)
+  READ the caller frame file (line +/- 10 lines)
+  If bug isn't obvious:
+    GREP for the function name to find all callers
+    READ each caller to understand what data they pass
+```
+
+**Frames to skip** (almost never the bug's location):
+
+- Framework middleware / pipeline frames
+- Event loop / scheduler frames
+- Serialization / deserialization wrappers
+- Logging / monitoring interceptors
+- Test runner infrastructure frames
+
+**Focus on frames where YOUR code makes DECISIONS about data.**
+
+---
+
+## §4 Strategic Diagnostic Logging
+
+**When:** You need to understand runtime behavior and can't use an interactive debugger. This is your primary investigation tool for most agent debugging scenarios.
+
+```
+WHERE TO INSERT LOGS (priority order):
+
+1. FUNCTION ENTRY POINTS with parameters
+   Log what data the function receives
+   "f entered with: x={x}, y={y}"
+
+2. DECISION POINTS (if/else, switch, pattern match)
+   Log which branch was taken and why
+   "Taking branch A because condition={value}"
+
+3. DATA TRANSFORMATION OUTPUTS
+   Log intermediate results after each transform
+   "After transform: data={repr(data)}"
+
+4. EXTERNAL CALL BOUNDARIES
+   Log before and after: API calls, DB queries, file I/O
+   "Calling API: {request}; got: {status}, {body[:200]}"
+
+5. LOOP SUMMARIES (not every iteration)
+   Log: count, first item, last item, any items failing sanity check
+   Don't log every iteration of a 10,000-item loop
+```
+
+**What to capture in each log:**
+
+- Variable values — use `repr()` / `inspect` to see types and special characters
+- Collection sizes — `len(list)`, object key count
+- Type information — `type(x).__name__`, `typeof x`
+- Timestamps — for performance/timing issues
+- Thread/process IDs — for concurrency issues
+
+**How to interpret logs:**
+
+1. Compare logged values against expected values at each point
+2. Find the FIRST point where actual diverges from expected
+3. That divergence point is your investigation target
+4. Watch for: null/undefined appearing, types changing unexpectedly, collections being empty, values being 0 or NaN
+
+**Logging heuristic:** Place logs at BOUNDARIES, not in the middle of logic. Boundaries are: function entry/exit, loop start/end, conditional branch points, external calls. Maximum information, minimum noise.
+
+**Cleanup:** Always remove diagnostic logging after the bug is fixed. It's temporary investigation infrastructure, not permanent code.
+
+---
+
+## §5 Searching for Similar Patterns / Bugs
+
+**When:** You found a bug and want to check if the same pattern exists elsewhere. Also useful in Phase 5 (Harden) to prevent recurrence.
+
+```
+SEQUENCE:
+
+1. ABSTRACT THE BUG PATTERN
+   Don't search for the exact code; search for the pattern
+   Example: Bug is "using .length on potentially null array"
+   -> Pattern: something that could be null followed by .length
+
+2. GREP FOR THE PATTERN
+   Tool: Grep(pattern="\\.length", path="src/", glob="*.ts", output_mode="content", -B=2)
+   Then filter results for cases where the object might be null
+
+3. For STRUCTURAL PATTERNS, search for the anti-pattern shape:
+   Example: missing null check before property access
+   Tool: Grep(pattern="response\\.\\w+", path="src/", glob="*.ts", output_mode="content", -B=3)
+   Then: are any of these missing a null check above them?
+
+4. SEARCH for the same FUNCTION being called elsewhere:
+   Tool: Grep(pattern="brokenFunction\\(", path="src/", glob="*.ts")
+   Each call site might have the same bug
+
+5. SEARCH for similar VARIABLE NAMES (same data, different location):
+   Tool: Grep(pattern="userData|user_data|userInfo", path="src/", glob="*.{ts,py}")
+
+6. Use git log to find similar PAST FIXES:
+   git log --all --oneline --grep="null check"
+   git log --all --oneline --grep="TypeError"
+   Past fixes for similar issues reveal other vulnerable spots
+```
+
+**Efficiency tips:**
+
+- Use Glob to find relevant files first, then Grep within them
+- Use `output_mode="files_with_matches"` for a first pass, then `output_mode="content"` on matches
+- Search test files too — test assertions reveal expected behavior
+- When you find a pattern, use `output_mode="count"` to estimate the scope of the problem
+
+---
+
+## §6 Targeted Fix Verification
+
+**When:** You've made a fix and want to verify it works before running the full suite. This is the verification sequence for Phase 4.
+
+```
+VERIFICATION SEQUENCE:
+
+1. RUN THE SPECIFIC FAILING TEST
+   pytest path/to/test_file.py::TestClass::test_method      # Python
+   npx jest path/to/test.spec.ts -t "test name"             # JavaScript
+   go test -run TestName ./package/...                        # Go
+   cargo test test_name                                       # Rust
+   mvn test -pl module -Dtest=TestClass#testMethod            # Java
+
+2. RUN RELATED TESTS IN THE SAME FILE/MODULE
+   pytest path/to/test_file.py
+   npx jest path/to/test.spec.ts
+
+3. RUN TESTS FOR THE MODIFIED MODULE
+   Find what tests import the modified code:
+   grep -rl "from.*modified_module\|import.*modified_module" tests/
+   Run those test files
+
+4. TYPE CHECK (if applicable)
+   npx tsc --noEmit          # TypeScript
+   mypy src/modified_file.py # Python
+   cargo check               # Rust
+
+5. LINT CHECK
+   npx eslint src/modified_file.ts     # JS/TS
+   ruff check src/modified_file.py     # Python
+   golangci-lint run ./package/...     # Go
+
+6. SMOKE TEST
+   Can the application start?
+   Does the specific user-facing behavior work?
+```
+
+**Order rationale:**
+
+- Steps 1-2: Verify the fix works (seconds)
+- Step 3: Verify no regressions in related code (seconds to minutes)
+- Steps 4-5: Catch type/style issues the fix might introduce (seconds)
+- Step 6: Final sanity check before committing
+
+**A fix is ready when:**
+
+1. The originally failing test passes
+2. All tests in the same file/module pass
+3. Type checker and linter are clean
+4. You can explain WHY the fix works (not just that it does)
+
+If you can't explain why it works, you don't understand the root cause. Go back to Phase 3.