Implement daemon subcommand for multi-window message bus

nikomatsakis · nikomatsakis · commit 1064a9be20a3 · 2025-08-04T14:11:28.000-04:00
Add daemon subcommand to MCP server with Unix domain socket claiming for atomic coordination. Daemon monitors VSCode process lifecycle and prevents multiple instances per VSCode PID. Key features: - Socket claiming prevents daemon conflicts - Process monitoring with automatic cleanup - Stale socket handling for crashed daemons - Foundation for Phase 2 message bus implementation Includes research document validating Unix socket approach. See progress in issue #20.
diff --git a/md/SUMMARY.md b/md/SUMMARY.md
@@ -42,5 +42,5 @@
         - [Language Features](./references/lsp-overview/language-features.md) <!-- 💡: Comprehensive LSP feature catalog including navigation (go-to-definition, find references), information (hover, signature help), code intelligence (completion, actions, lens), formatting, semantic tokens, inlay hints, and diagnostics (push/pull models). Relevant for: code intelligence features, enhanced review experience, future LSP integration -->
         - [Implementation Guide](./references/lsp-overview/implementation-guide.md) <!-- 💡: Practical LSP server/client implementation patterns covering process isolation, message ordering, state management, error handling with exponential backoff, transport configuration (--stdio, --pipe, --socket), three-tier testing strategy, and security considerations (input validation, process isolation, path sanitization). Relevant for: robust IPC implementation, testing strategy, security best practices -->
         - [Message Reference](./references/lsp-overview/message-reference.md) <!-- 💡: Complete LSP message catalog with request/response pairs, notifications, $/prefixed protocol messages, capabilities exchange during initialization, document synchronization (full/incremental), workspace/window features, and proper lifecycle management (initialize → initialized → shutdown → exit). Relevant for: protocol patterns, capability negotiation, document synchronization, future LSP integration -->
-    - [VSCode Extension Development Patterns](./references/vscode-extensions-dev-pattern.md) <!-- 💡: Comprehensive guide for VSCode extensions with separate server components covering Extension Development Host (F5) workflow, vsce packaging vs manual installation, yalc vs npm link for local dependencies, monorepo patterns with client/server/shared structure, IPC mechanisms (stdio, sockets, HTTP), setup automation with one-command experiences, and debugging configurations. Based on LSP, DAP, and MCP ecosystem patterns. Relevant for: development workflow, packaging strategy, local dependency management, project structure -->
+    - [Unix IPC Message Bus Implementation Guide](./references/unix-message-bus-architecture.md) <!-- 💡: Comprehensive research on Unix IPC message bus patterns covering Unix domain sockets vs other mechanisms, hub-and-spoke architecture with central broker, epoll-based event handling, process lifecycle management, performance optimization through hybrid approaches, security hardening, and real-world implementations (D-Bus, Redis, nanomsg). Validates Unix sockets as superior foundation for multi-client message buses with concrete implementation patterns. Relevant for: message bus daemon design, IPC architecture decisions, multi-process communication, performance considerations -->
 - [Decision documents]()
diff --git a/md/references/unix-message-bus-architecture.md b/md/references/unix-message-bus-architecture.md
@@ -0,0 +1,112 @@
+# Unix IPC Message Bus Implementation Guide
+
+Building a Unix IPC message bus where multiple processes connect through a shared endpoint requires careful consideration of performance, reliability, and architectural patterns. Based on comprehensive research of technical documentation, benchmarks, and production implementations, this guide provides concrete guidance for implementing such systems.
+
+## Unix domain sockets emerge as the superior foundation
+
+For implementing a message bus with multiple processes connecting via `/tmp/shared-endpoint`, **Unix domain sockets provide the most robust and scalable solution**. Unlike named pipes (FIFOs) which suffer from the single-reader problem and lack client identification, Unix sockets offer true multi-client support with independent connections for each process. A basic implementation creates a server socket at a filesystem path, accepts multiple client connections, and maintains a list of connected file descriptors for message broadcasting.
+
+The performance characteristics strongly favor Unix sockets over other IPC mechanisms for this use case. While shared memory can achieve higher raw throughput (4-20x faster for small messages), Unix sockets provide essential features like automatic connection management, bidirectional communication, and built-in flow control that make them ideal for message bus architectures. The trade-off between raw speed and architectural cleanliness typically favors Unix sockets unless extreme performance is required.
+
+## Core implementation patterns and message distribution
+
+The **hub-and-spoke architecture** using a central broker process proves most effective for Unix socket-based message buses. The broker maintains connections to all clients using epoll or select for efficient I/O multiplexing, receives messages from any client, and broadcasts them to all other connected processes. This pattern scales linearly with the number of clients and provides a single point for implementing routing logic, authentication, and message transformation.
+
+```c
+// Essential broker pattern with epoll
+int epfd = epoll_create1(EPOLL_CLOEXEC);
+struct epoll_event ev, events[MAX_EVENTS];
+
+// Main event loop
+while (running) {
+    int nfds = epoll_wait(epfd, events, MAX_EVENTS, -1);
+    for (int i = 0; i < nfds; i++) {
+        if (events[i].data.fd == server_fd) {
+            accept_new_client();
+        } else {
+            char buffer[1024];
+            int bytes = recv(events[i].data.fd, buffer, sizeof(buffer), 0);
+            if (bytes > 0) {
+                broadcast_to_all_except(buffer, bytes, events[i].data.fd);
+            }
+        }
+    }
+}
+```
+
+Message framing becomes critical when dealing with streaming sockets. The most reliable approach uses **length-prefixed messages** where each message begins with a fixed-size header containing the payload length. This prevents message boundary confusion and enables efficient buffer management. For maximum performance with guaranteed atomicity, messages under PIPE_BUF size (4096 bytes on Linux) can be written atomically even with multiple writers.
+
+## Performance optimization through hybrid approaches
+
+When extreme performance is required, a **hybrid architecture** combining multiple IPC mechanisms yields optimal results. The pattern uses Unix sockets for control messages and connection management while employing shared memory ring buffers for high-throughput data transfer. This approach can achieve 10-100x better performance than pure socket-based solutions while maintaining the architectural benefits of socket-based connection management.
+
+Lock-free ring buffer implementations in shared memory can achieve over 20 million messages per second for single-producer/single-consumer scenarios. The key is careful attention to memory ordering and cache-line alignment:
+
+```c
+struct ring_buffer {
+    alignas(64) std::atomic<uint64_t> write_pos;
+    alignas(64) std::atomic<uint64_t> read_pos;
+    char data[BUFFER_SIZE];
+};
+```
+
+For multi-producer scenarios, more sophisticated synchronization is required. POSIX semaphores or robust mutexes provide process-safe synchronization, with robust mutexes offering automatic cleanup when processes holding locks terminate unexpectedly.
+
+## Process lifecycle and connection management
+
+Proper handling of process connections and disconnections is crucial for production reliability. The message bus must detect when clients disconnect (gracefully or through crashes) and clean up resources accordingly. Unix domain sockets provide several mechanisms for this:
+
+**Socket-level detection** through EPOLLHUP events or failed send operations immediately identifies disconnected clients. Setting SO_KEEPALIVE enables periodic connection verification for long-lived but idle connections. For shared memory implementations, robust mutexes (PTHREAD_MUTEX_ROBUST) automatically handle cleanup when lock-holding processes die.
+
+Signal handling requires careful design to avoid race conditions. The standard pattern uses signal-safe atomic flags checked in the main event loop rather than performing cleanup directly in signal handlers:
+
+```c
+volatile sig_atomic_t shutdown_requested = 0;
+
+void signal_handler(int sig) {
+    if (sig == SIGTERM || sig == SIGINT) {
+        shutdown_requested = 1;
+    }
+}
+```
+
+## Concurrency, synchronization, and scalability
+
+For high-concurrency scenarios, **epoll with edge-triggered mode** provides the best performance on Linux systems. This approach scales to tens of thousands of connections with O(1) event notification complexity. The event-driven architecture avoids the thread-per-connection model's memory overhead and context switching costs.
+
+Synchronization between multiple writers requires careful consideration. For shared memory approaches, atomic operations and memory barriers enable lock-free implementations for specific patterns. However, most production systems benefit from the simplicity of mutex-based synchronization with proper error handling for partial operations and EINTR interruptions.
+
+## Security hardening and production considerations
+
+Production message bus implementations must address several security concerns. Unix domain sockets support credential passing through SO_PEERCRED, enabling authentication based on process UID/GID. File permissions on the socket path provide basic access control, though abstract namespace sockets (Linux-specific) avoid filesystem permission issues entirely.
+
+Rate limiting prevents denial-of-service attacks from misbehaving clients. A simple token bucket algorithm per client connection effectively limits message rates while allowing burst traffic:
+
+```c
+bool check_rate_limit(client_t* client) {
+    time_t now = time(NULL);
+    if (now > client->last_reset) {
+        client->tokens = MAX_TOKENS;
+        client->last_reset = now;
+    }
+    if (client->tokens > 0) {
+        client->tokens--;
+        return true;
+    }
+    return false;
+}
+```
+
+## Real-world implementations and architectural choices
+
+Production systems demonstrate various architectural trade-offs. **D-Bus**, the Linux desktop standard, uses Unix domain sockets with a central daemon providing message routing, service activation, and security policy enforcement. Its hub-and-spoke architecture handles system-wide and per-user session buses effectively but incurs ~2.5x overhead compared to direct IPC.
+
+**Redis** configured with Unix sockets for local communication provides a pragmatic pub/sub message bus with persistence options and rich data structures. While not as performant as custom solutions, Redis offers battle-tested reliability and extensive language bindings.
+
+For embedded systems or performance-critical applications, **nanomsg/nng** provides a socket-like API with multiple messaging patterns including bus topology. It abstracts the underlying IPC mechanism while providing zero-copy message passing and automatic reconnection.
+
+## Conclusion
+
+Implementing a Unix IPC message bus requires balancing performance, reliability, and complexity. **Unix domain sockets provide the best foundation for most use cases**, offering natural multi-client support, connection management, and sufficient performance for typical messaging workloads. When extreme performance is required, hybrid approaches combining sockets for control with shared memory for data transfer can achieve orders of magnitude better throughput.
+
+The key to a successful implementation lies in careful attention to process lifecycle management, proper error handling for partial operations, and appropriate synchronization mechanisms. Whether building a simple pub/sub system or a complex service bus, the patterns and techniques outlined here provide a solid foundation for robust inter-process communication on Unix systems.
diff --git a/server/Cargo.toml b/server/Cargo.toml
@@ -46,6 +46,9 @@ tracing-appender = "0.2"
 # Command line argument parsing
 clap = { version = "4.0", features = ["derive"] }
 
+# Unix system calls for process monitoring
+nix = { version = "0.27", features = ["signal", "process"] }
+
 [dev-dependencies]
 tokio-test = { workspace = true }
 # Add client feature for testing
diff --git a/server/src/main.rs b/server/src/main.rs
@@ -18,12 +18,24 @@ use dialectic_mcp_server::{pid_discovery, DialecticServer};
 #[command(about = "Dialectic MCP Server for VSCode integration")]
 struct Args {
     /// Run PID discovery probe and exit (for testing)
-    #[arg(long)]
+    #[arg(long, global = true)]
     probe: bool,
 
     /// Enable development logging to /tmp/dialectic-mcp-server.log
-    #[arg(long)]
+    #[arg(long, global = true)]
     dev_log: bool,
+
+    #[command(subcommand)]
+    command: Option<Command>,
+}
+
+#[derive(Parser)]
+enum Command {
+    /// Run as message bus daemon for multi-window support
+    Daemon {
+        /// VSCode process ID to monitor
+        vscode_pid: u32,
+    },
 }
 
 #[tokio::main]
@@ -69,26 +81,110 @@ async fn main() -> Result<()> {
         return Ok(());
     }
 
-    info!("Starting Dialectic MCP Server (Rust)");
+    match args.command {
+        Some(Command::Daemon { vscode_pid }) => {
+            info!("🚀 DAEMON MODE - Starting message bus daemon for VSCode PID {}", vscode_pid);
+            run_daemon(vscode_pid).await?;
+        }
+        None => {
+            info!("Starting Dialectic MCP Server (Rust)");
 
-    // Create our server instance
-    let server = DialecticServer::new().await?;
+            // Create our server instance
+            let server = DialecticServer::new().await?;
 
-    // Start the MCP server with stdio transport
-    let service = server.serve(stdio()).await.inspect_err(|e| {
-        error!("MCP server error: {:?}", e);
-    })?;
+            // Start the MCP server with stdio transport
+            let service = server.serve(stdio()).await.inspect_err(|e| {
+                error!("MCP server error: {:?}", e);
+            })?;
 
-    info!("Dialectic MCP Server is ready and listening");
+            info!("Dialectic MCP Server is ready and listening");
 
-    // Wait for the service to complete
-    service.waiting().await?;
+            // Wait for the service to complete
+            service.waiting().await?;
 
-    info!("Dialectic MCP Server shutting down");
+            info!("Dialectic MCP Server shutting down");
+        }
+    }
     std::mem::drop(flush_guard);
     Ok(())
 }
 
+/// Run the message bus daemon for multi-window support
+async fn run_daemon(vscode_pid: u32) -> Result<()> {
+    use std::os::unix::net::UnixListener;
+    use std::path::Path;
+    
+    let socket_path = format!("/tmp/dialectic-vscode-{}.sock", vscode_pid);
+    info!("Attempting to claim socket: {}", socket_path);
+
+    // Try to bind to the socket first - this is our "claim" operation
+    let _listener = match UnixListener::bind(&socket_path) {
+        Ok(listener) => {
+            info!("✅ Successfully claimed socket: {}", socket_path);
+            listener
+        }
+        Err(e) if e.kind() == std::io::ErrorKind::AddrInUse => {
+            error!("❌ Failed to claim socket {}: {}", socket_path, e);
+            error!("Another daemon is already running for VSCode PID {}", vscode_pid);
+            return Err(e.into());
+        }
+        Err(e) => {
+            // Other error - maybe stale socket file, try to remove and retry once
+            if Path::new(&socket_path).exists() {
+                std::fs::remove_file(&socket_path)?;
+                info!("Removed stale socket file, retrying bind");
+                
+                // Retry binding once
+                match UnixListener::bind(&socket_path) {
+                    Ok(listener) => {
+                        info!("✅ Successfully claimed socket after cleanup: {}", socket_path);
+                        listener
+                    }
+                    Err(e) => {
+                        error!("❌ Failed to claim socket {} even after cleanup: {}", socket_path, e);
+                        return Err(e.into());
+                    }
+                }
+            } else {
+                error!("❌ Failed to claim socket {}: {}", socket_path, e);
+                return Err(e.into());
+            }
+        }
+    };
+
+    info!("🚀 Message bus daemon started for VSCode PID {}", vscode_pid);
+    info!("📡 Listening on socket: {}", socket_path);
+
+    // TODO: Implement the actual message bus loop
+    // For now, just keep the socket claimed and monitor the VSCode process
+    loop {
+        // Check if VSCode process is still alive
+        match nix::sys::signal::kill(nix::unistd::Pid::from_raw(vscode_pid as i32), None) {
+            Ok(_) => {
+                // Process exists, continue
+                tokio::time::sleep(tokio::time::Duration::from_secs(5)).await;
+            }
+            Err(nix::errno::Errno::ESRCH) => {
+                info!("VSCode process {} has died, shutting down daemon", vscode_pid);
+                break;
+            }
+            Err(e) => {
+                error!("Error checking VSCode process {}: {}", vscode_pid, e);
+                tokio::time::sleep(tokio::time::Duration::from_secs(5)).await;
+            }
+        }
+    }
+
+    // Clean up socket file on exit
+    if Path::new(&socket_path).exists() {
+        std::fs::remove_file(&socket_path)?;
+        info!("🧹 Cleaned up socket file: {}", socket_path);
+    }
+
+    info!("🛑 Daemon shutdown complete");
+    Ok(())
+}
+
 /// Run PID discovery probe for testing
 async fn run_pid_probe() -> Result<()> {
     use std::process;