[PERF] .claude/rules/ consumes 34% of context window on session start (68k/200k tokens)

## Summary

The `.claude/rules/` directory contains 16 markdown files totaling **288KB raw / ~68,100 tokens**. These are automatically loaded as memory files at session start, consuming **34.1% of the 200k token context window** before any user interaction.

Combined with system prompt, tools, skills metadata, and autocompact buffer, **over 61% of context is consumed before the first user message**. Only ~71k tokens (35.6%) remain for actual work.

## Evidence

Context breakdown observed via `/context` command on a fresh session (Claude Opus 4.6):

| Category | Tokens | % of 200k |
|----------|--------|-----------|
| System prompt | 3,500 | 1.7% |
| System tools | 15,900 | 7.9% |
| **Memory files (rules)** | **68,100** | **34.1%** |
| Skills metadata | 1,800 | 0.9% |
| Autocompact buffer | 33,000 | 16.5% |
| **Free space** | **71,000** | **35.6%** |

### Top offenders by size

| File | Raw bytes | Est. tokens |
|------|-----------|-------------|
| agent-cognition.md | 64,707 | ~16,200 |
| agent-integrity.md | 34,403 | ~8,600 |
| RULE-GROUP-4.md | 32,952 | ~8,200 |
| ANTHROPIC-STANDARDS.md | 27,242 | ~6,800 |
| RULE-GROUP-6.md | 23,618 | ~5,900 |
| RULE-GROUP-5.md | 23,740 | ~5,900 |
| All 16 files total | 288,650 | ~68,100 |

## Architectural Contradiction

`CLAUDE-LITE.md` documents a lazy loading strategy where rules are loaded on-demand via keyword matching (`skill_router.py`). However, all 16 rule files sit in `.claude/rules/` which Claude Code loads **eagerly** at session start. The lazy loading intent is defeated by the file placement.

Additionally, `CLAUDE-LITE.md` itself (~9.6KB) is loaded *alongside* all the full rule files, not *instead of* them.

## Impact

- **Reduced effective context:** Only 35.6% available for actual work
- **Earlier autocompaction:** Conversations lose history sooner
- **Degraded multi-step tasks:** Less room for tool results, code reading, agent operations
- **Cost:** ~$1.02 per session start in input tokens (at Opus rates), repeated every API round-trip

## Recommendations

1. **R1 (HIGH):** Move lazy-loaded rules out of `.claude/rules/` to a directory not auto-loaded (e.g., `core/protocols/`). Keep only essential always-needed rules. Target: <15k tokens.
2. **R2 (MEDIUM):** Consolidate and compress — remove redundant ASCII art/box-drawing, merge RULE-GROUP 1-6, remove agent-cognition.md and agent-integrity.md from auto-load.
3. **R3 (MEDIUM):** Make `skill_router.py` inject rule content via hook output only when keywords match, implementing true lazy loading.
4. **R4 (LOW):** Set a token budget policy for `.claude/rules/` (e.g., max 15k tokens total, max 3k per file).

## Framework Mapping

| Framework | ID | Relevance |
|-----------|----|-----------|
| OWASP LLM Top 10 | LLM10 (Unbounded Consumption) | Self-inflicted resource exhaustion |
| MITRE ATLAS | AML.T0054 (LLM Resource Exhaustion) | Internal config causing token waste |

---

*Reported by riaworks security audit | 2026-02-28 | Method: Runtime observation*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PERF] .claude/rules/ consumes 34% of context window on session start (68k/200k tokens) #2

Summary

Evidence

Top offenders by size

Architectural Contradiction

Impact

Recommendations

Framework Mapping

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Category	Tokens	% of 200k
System prompt	3,500	1.7%
System tools	15,900	7.9%
Memory files (rules)	68,100	34.1%
Skills metadata	1,800	0.9%
Autocompact buffer	33,000	16.5%
Free space	71,000	35.6%

File	Raw bytes	Est. tokens
agent-cognition.md	64,707	~16,200
agent-integrity.md	34,403	~8,600
RULE-GROUP-4.md	32,952	~8,200
ANTHROPIC-STANDARDS.md	27,242	~6,800
RULE-GROUP-6.md	23,618	~5,900
RULE-GROUP-5.md	23,740	~5,900
All 16 files total	288,650	~68,100

Framework	ID	Relevance
OWASP LLM Top 10	LLM10 (Unbounded Consumption)	Self-inflicted resource exhaustion
MITRE ATLAS	AML.T0054 (LLM Resource Exhaustion)	Internal config causing token waste

[PERF] .claude/rules/ consumes 34% of context window on session start (68k/200k tokens) #2

Description

Summary

Evidence

Top offenders by size

Architectural Contradiction

Impact

Recommendations

Framework Mapping

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions