Add comprehensive RFC documentation for IDE capabilities feature

nikomatsakis · nikomatsakis · commit e3c9854b04c1 · 2025-07-31T06:57:33.000-04:00
- Create In-progress RFCs section in mdbook - Add main RFC for exposing IDE capabilities with natural language interface - Add detailed scripting language RFC with domain-agnostic JSON mini-language design - Define class-based extension system (TypeDescription, FunctionDescription, TypeConversion) - Implement optional parameters with dialectic.Optional() for LLM tolerance - Establish three-outcome model (success/unrecoverable/ambiguous failure) - Set up structure for validation boundaries and ambiguity resolution sub-RFCs See progress in tracking issue #8.
diff --git a/md/SUMMARY.md b/md/SUMMARY.md
@@ -23,6 +23,15 @@
 - [VSCode extension](./design/extension.md) <!-- 💡: Highlights of the VSCode Extension design and implementation: activation, establishing IPC protocol -->
     - [Markdown rendering](./design/markdown-rendering.md) <!-- 💡: markdown-it pipeline, custom renderer rules for file references, and HTML generation process -->
 
+# In-progress RFCs <!-- 💡: Design proposals under active development and discussion -->
+
+- [RFC: Exposing IDE capabilities](./rfcs/ide-capabilities/README.md) <!-- 💡: Proposal for natural language interface to VSCode and LSP features through composable JSON mini-language -->
+    - [RFC: Scripting language](./rfcs/ide-capabilities/scripting-language.md) <!-- 💡: JSON mini-language design with function composition and value types -->
+        - [RFC: Validation boundaries](./rfcs/ide-capabilities/scripting-language/validation-boundaries.md) <!-- 💡: Where should type checking happen - in the engine or in capability implementations? -->
+        - [RFC: Ambiguity resolution](./rfcs/ide-capabilities/scripting-language/ambiguity-resolution.md) <!-- 💡: How functions like {"symbol":{"name":"foo"}} handle multiple matches -->
+    - [RFC: Natural language interface](./rfcs/ide-capabilities/natural-language-interface.md) <!-- 💡: How natural language requests get converted to JSON programs -->
+    - [RFC: Capability registry](./rfcs/ide-capabilities/capability-registry.md) <!-- 💡: What IDE capabilities to expose initially and their function signatures -->
+
 # References
 
 - [Research reports]() <!-- 💡: Background research that informed design decisions - consult when discussing related technical topics -->
@@ -35,3 +44,4 @@
         - [Implementation Guide](./references/lsp-overview/implementation-guide.md) <!-- 💡: Practical LSP server/client implementation patterns covering process isolation, message ordering, state management, error handling with exponential backoff, transport configuration (--stdio, --pipe, --socket), three-tier testing strategy, and security considerations (input validation, process isolation, path sanitization). Relevant for: robust IPC implementation, testing strategy, security best practices -->
         - [Message Reference](./references/lsp-overview/message-reference.md) <!-- 💡: Complete LSP message catalog with request/response pairs, notifications, $/prefixed protocol messages, capabilities exchange during initialization, document synchronization (full/incremental), workspace/window features, and proper lifecycle management (initialize → initialized → shutdown → exit). Relevant for: protocol patterns, capability negotiation, document synchronization, future LSP integration -->
     - [VSCode Extension Development Patterns](./references/vscode-extensions-dev-pattern.md) <!-- 💡: Comprehensive guide for VSCode extensions with separate server components covering Extension Development Host (F5) workflow, vsce packaging vs manual installation, yalc vs npm link for local dependencies, monorepo patterns with client/server/shared structure, IPC mechanisms (stdio, sockets, HTTP), setup automation with one-command experiences, and debugging configurations. Based on LSP, DAP, and MCP ecosystem patterns. Relevant for: development workflow, packaging strategy, local dependency management, project structure -->
+- [Decision documents]()
diff --git a/md/rfcs/ide-capabilities/README.md b/md/rfcs/ide-capabilities/README.md
@@ -0,0 +1,138 @@
+# RFC: Exposing IDE capabilities
+
+*A natural language interface to VSCode and Language Server Protocol features*
+
+**Tracking Issue**: [#8](https://github.com/socratic-shell/dialectic/issues/8)
+
+## Problem Statement
+
+Currently, AI assistants working with code need many specific MCP tools to interact with the IDE:
+- `dialectic___get_selection` for getting selected text
+- `builder_mcp___WorkspaceSearch` for finding code patterns  
+- Separate tools would be needed for each LSP feature (find references, go to definition, etc.)
+
+This creates several problems:
+- **Tool selection overwhelm**: Too many specific tools make it hard for AI to choose the right approach
+- **Inconsistent interfaces**: Each tool has different parameter formats and return structures
+- **Limited composability**: Hard to combine operations (e.g., "find references to the currently selected symbol")
+- **Poor discoverability**: AI assistants must memorize many tool names and signatures
+
+## Proposed Solution
+
+Replace multiple specific tools with a single `ideCapability(string)` tool that:
+
+1. **Accepts natural language requests**: "find all references to validateToken"
+2. **Returns either results or refinement suggestions**: Success with data, or "ambiguous, try one of these options"
+3. **Uses a composable JSON mini-language internally** for precise operations
+4. **Provides self-teaching error messages** that guide AI assistants toward successful usage
+
+## Interface Design
+
+### Single Entry Point
+```typescript
+ideCapability(request: string) → string
+```
+
+### Response Types
+
+**Success:**
+```
+"Success, results: [{"file": "auth.ts", "line": 42, "context": "validateToken(user)"}]"
+```
+
+**Ambiguous request:**
+```
+"Ambiguous request, consider one of the following:
+(1) {"findReferences": {"symbol": {"name": "validateToken", "file": "auth.ts", "line": 42}}}
+(2) {"findReferences": {"symbol": {"name": "validateToken", "file": "utils.ts", "line": 15}}}"
+```
+
+**Capability not available:**
+```
+"We don't have the ability to do that :("
+```
+
+## Internal Architecture
+
+The system has three main layers:
+
+### 1. Natural Language Interface
+- Converts natural language requests to JSON mini-language programs
+- Handles ambiguity resolution and provides refinement suggestions
+- Acts as the "front door" for AI assistants
+
+### 2. JSON Mini-Language Runtime
+- Executes composable JSON programs
+- Manages value types (Symbol, Selection, Location, etc.)
+- Handles function composition and error propagation
+
+### 3. VSCode Integration Layer
+- Maps JSON functions to actual VSCode/LSP calls
+- Handles async operations and editor state
+- Returns results in JSON mini-language format
+
+## Benefits
+
+**For AI Assistants:**
+- Single tool to learn instead of many specific ones
+- Natural language interface reduces cognitive load
+- Self-teaching through error messages
+- Composable operations enable complex workflows
+
+**For Users:**
+- More capable AI assistance with IDE operations
+- Consistent interface across all IDE features
+- Better error messages and suggestions
+- Extensible system for future capabilities
+
+**For Developers:**
+- Clean separation between language runtime and IDE integration
+- Easy to add new capabilities
+- Testable and maintainable architecture
+- Reusable across different editors (future)
+
+## Open Questions
+
+This RFC establishes the overall approach, but several design questions need resolution:
+
+1. **[Scripting Language Design](./ide-capabilities/scripting-language.md)**: How should the JSON mini-language work? What are the core concepts and composition rules?
+
+2. **[Natural Language Interface](./ide-capabilities/natural-language-interface.md)**: How do we convert natural language requests to JSON programs? What's the right confidence threshold for execution vs clarification?
+
+3. **[Capability Registry](./ide-capabilities/capability-registry.md)**: What IDE capabilities should we expose initially? What are their function signatures and required value types?
+
+## Implementation Strategy
+
+### Phase 1: Proof of Concept
+- Implement basic JSON mini-language runtime
+- Create a few essential capabilities (getSelection, findSymbol, findReferences)
+- Build simple natural language interface (possibly rule-based)
+- Validate the overall approach
+
+### Phase 2: Core Capabilities
+- Expand capability set to cover common IDE operations
+- Improve natural language processing
+- Add comprehensive error handling and suggestions
+- Replace existing specific MCP tools
+
+### Phase 3: Advanced Features
+- Add refactoring operations (rename, extract method, etc.)
+- Integrate with more LSP features
+- Optimize performance and user experience
+- Consider extending to other editors
+
+## Success Criteria
+
+This RFC will be considered successful when:
+- AI assistants can perform common IDE operations through natural language
+- The tool selection problem is significantly reduced
+- Error messages effectively guide AI assistants to successful usage
+- The system is extensible enough to add new capabilities easily
+- User feedback indicates improved AI assistance quality
+
+## Next Steps
+
+1. Review and refine this overall proposal
+2. Work through the detailed design questions in the sub-RFCs
+3. Build a minimal prototype to validate core concepts
+4. Iterate based on real usage with AI assistants
diff --git a/md/rfcs/ide-capabilities/scripting-language.md b/md/rfcs/ide-capabilities/scripting-language.md