feat: Implement Lifecycle Hooks System - Phase 1 Complete #2

mikeumus · 2025-05-24T03:38:01Z

Summary

This PR implements a comprehensive lifecycle hooks system for Codex, enabling external scripts, webhooks, and integrations to be triggered at specific points in the Codex execution lifecycle.

What's Implemented

✅ Phase 1.1: Hook Type Definitions and Core Types

Core Type System: Complete LifecycleEvent enum with 12 event types covering all major Codex lifecycle points
Hook Types: Support for scripts, webhooks, MCP tools, and custom executables
Hook Context: Environment variable injection, temporary file management, and template substitution
Configuration System: TOML-based configuration with validation and schema checking
Infrastructure: Hook registry, manager, and async executor framework

✅ Phase 1.2: Hook Registry System

Priority Management: Automatic sorting and priority-based execution ordering
Conditional Execution: Expression parser supporting equality, contains, and boolean conditions
Advanced Features: Hook filtering by tags/priority, runtime registration, statistics
Configuration Integration: Full integration with Codex's main configuration system

Key Features

Lifecycle Events

The system supports hooks for all major Codex lifecycle points:

Session lifecycle (start, end)
Task lifecycle (start, complete)
Execution lifecycle (before/after command execution)
Patch lifecycle (before/after patch application)
MCP tool lifecycle (before/after tool calls)
Agent interactions (messages, reasoning)
Error handling

Hook Types

Script Hooks: Execute shell scripts/commands with event context
Webhook Hooks: Send HTTP requests to external APIs with event data
MCP Tool Hooks: Call MCP tools as lifecycle hooks
Custom Executable Hooks: Execute any binary with event data

Configuration Example

[hooks]
enabled = true
timeout_seconds = 30

# Session lifecycle hooks
[[hooks.session]]
event = "session_start"
type = "script"
command = ["./scripts/session-start.sh"]
environment = { CODEX_SESSION_ID = "{session_id}" }

# Task completion webhook
[[hooks.task]]
event = "task_complete"
type = "webhook"
url = "https://api.example.com/codex/task-complete"
method = "POST"
condition = "success == true"

# Error notification
[[hooks.agent]]
event = "agent_message"
type = "script"
command = ["./scripts/notify-error.sh"]
condition = "message.contains('ERROR')"
priority = 0  # High priority

Advanced Features

Conditional Execution

# Only run on successful commands
condition = "exit_code == 0"

# Only run on error messages
condition = "message.contains('ERROR')"

# Environment-based conditions
condition = "env.ENVIRONMENT == production"

Priority Management

HookPriority::HIGHEST  // 0 - Critical hooks
HookPriority::HIGH     // 25 - Important hooks
HookPriority::NORMAL   // 100 - Default priority
HookPriority::LOW      // 200 - Background hooks
HookPriority::LOWEST   // u32::MAX - Cleanup hooks

Template Substitution

# Hook scripts receive context via environment variables
echo "Task {task_id} completed with status {success}"
# Becomes: "Task task_123 completed with status true"

Architecture

The hooks system is built around these core components:

Hook Manager: Central coordinator for all lifecycle hooks
Hook Registry: Stores hook definitions and manages event routing
Hook Executors: Execute different types of hooks (scripts, webhooks, MCP tools)
Hook Context: Provides execution context and data to hooks

Integration

Configuration: Seamlessly integrated with Codex's existing config system
Events: Built on top of Codex's existing event architecture
MCP: Reuses existing MCP infrastructure for tool-based hooks
Security: Respects existing sandbox policies and permission systems

Testing

12 passing tests covering all major functionality
Unit tests for core types, configuration, and registry
Integration tests with the main configuration system
Test coverage for priority sorting, condition evaluation, and hook execution

Files Added/Modified

New Files

TODO.md - Comprehensive implementation roadmap
codex-rs/core/src/hooks/mod.rs - Main hooks module
codex-rs/core/src/hooks/types.rs - Core type definitions
codex-rs/core/src/hooks/context.rs - Hook execution context
codex-rs/core/src/hooks/config.rs - Configuration parsing
codex-rs/core/src/hooks/registry.rs - Hook registry implementation
codex-rs/core/src/hooks/manager.rs - Hook manager
codex-rs/core/src/hooks/executor.rs - Hook execution framework

Modified Files

codex-rs/core/Cargo.toml - Added dependencies (chrono, tempfile, async-trait)
codex-rs/core/src/lib.rs - Added hooks module export
codex-rs/core/src/config.rs - Integrated hooks configuration

Next Steps

This PR completes Phase 1 of the lifecycle hooks implementation. The next phases will include:

Phase 2: Hook Execution Engine (manager coordination, timeout management)
Phase 3: Event System Integration (protocol extensions, core integration)
Phase 4: Client-Side Integration (CLI support, event processing)
Phase 5: Configuration and Documentation (examples, comprehensive docs)
Phase 6: Testing and Validation (e2e tests, performance testing)
Phase 7: Advanced Features (monitoring, additional hook types)

Breaking Changes

None. This is a purely additive feature that doesn't affect existing functionality.

Performance Impact

Minimal. The hooks system is designed to be:

Async-first: Non-blocking execution
Opt-in: Disabled by default, zero overhead when not used
Efficient: Event-driven architecture with minimal overhead

Ready for Review: This PR establishes a solid foundation for the lifecycle hooks system with comprehensive type safety, configuration management, and extensibility.

Pull Request opened by Augment Code with guidance from the PR author

- Add detailed implementation plan for Codex lifecycle hooks system - Define 7-phase implementation approach with 21 major sections - Include architecture overview, security considerations, and progress tracking - Establish foundation for programmatic interface to Codex lifecycle events - Support for scripts, webhooks, MCP tools, and custom executables as hooks

- Add comprehensive lifecycle hooks system foundation - Create core hook types and lifecycle events - Implement hook execution context with template substitution - Add hook configuration parsing and validation - Create hook registry and manager infrastructure - Add async hook executor framework with trait definitions - Update Cargo.toml with required dependencies (chrono, tempfile, async-trait) - Add hooks module to core library exports Completed Phase 1.1 tasks: ✅ Main hooks module with comprehensive documentation ✅ LifecycleEvent enum covering all major Codex lifecycle points ✅ HookType definitions for scripts, webhooks, MCP tools, executables ✅ HookContext with environment variables and temp file management ✅ Hook configuration system with TOML parsing and validation ✅ Hook registry for event routing and hook management ✅ Hook manager for coordinating hook execution ✅ Async hook executor framework with trait definitions Next: Phase 1.2 - Complete hook registry implementation

- Implement comprehensive hook registry with priority management - Add conditional execution support with expression evaluation - Support for equality, contains, and boolean conditions - Add hook filtering by tags and priority ranges - Implement hook statistics and registry management - Add hooks configuration integration with main Codex config - Support for runtime hook registration and removal - Add comprehensive test coverage for all registry features Completed Phase 1.2 tasks: ✅ Hook priority and dependency management with automatic sorting ✅ Conditional execution support with expression parser ✅ Hook registry statistics and management functions ✅ Integration with main Codex configuration system ✅ Runtime hook registration and removal capabilities ✅ Comprehensive test coverage with 12 passing tests Features implemented: - Priority-based hook execution ordering - Conditional hook execution with field-based expressions - Hook filtering by tags, priority ranges, and conditions - Registry statistics for monitoring and debugging - Integration with Codex Config and ConfigToml structs - Support for environment variable conditions - Template variable substitution in hook contexts Next: Phase 2.1 - Hook execution coordination and management

- Add comprehensive parallel development strategy to TODO.md - Create TODO-DEVELOPER-A.md for backend/execution engine work - Create TODO-DEVELOPER-B.md for frontend/documentation work - Define clear file ownership and coordination protocols - Establish branch strategy and communication guidelines - Set success criteria and progress tracking for both developers Workstream Split: 🔵 Developer A: Core Execution Engine (Backend Focus) - Phase 2: Hook Execution Engine - Phase 3: Event System Integration - Phase 6: Testing and Validation - 30 backend-focused tasks 🟢 Developer B: Client Integration & Documentation (Frontend/Docs Focus) - Phase 4: Client-Side Integration - Phase 5: Configuration and Documentation - Phase 7: Advanced Features - 40 frontend/docs-focused tasks Benefits: - Minimal file conflicts (clear ownership boundaries) - Parallel development without blocking dependencies - Clear communication protocols and merge strategies - Focused expertise areas for each developer - Comprehensive progress tracking and success metrics

- Add Phase 8: Magentic-One QA Integration to main TODO.md - Assign Magentic-One QA tasks to Developer B workstream - Add comprehensive Magentic-One implementation guide - Include safety protocols and container isolation - Add automated testing workflows and examples - Update progress tracking for all TODO files Phase 8 Features: 🤖 Magentic-One Setup and Configuration - Multi-agent system for automated QA - GPT-4o powered Orchestrator agent - Secure containerized execution environment 🔍 Automated QA Agent Implementation - FileSurfer for configuration validation - WebSurfer for webhook endpoint testing - Coder agent for test script generation - ComputerTerminal for CLI automation ⚡ QA Workflow Integration - Automated test suite generation - End-to-end testing scenarios - Performance benchmarking automation - Regression testing workflows 🛡️ Safety and Monitoring - Container isolation protocols - Comprehensive logging and monitoring - Human oversight and access restrictions - Prompt injection protection Benefits: - Autonomous testing and validation of hooks system - Comprehensive QA coverage with minimal manual effort - Integration with existing Codex testing infrastructure - Advanced multi-agent coordination for complex test scenarios - Safety-first approach with proper isolation and monitoring

- Add Phase 9: Comprehensive E2E Testing as dedicated testing phase - Expand Phase 6.3 with detailed E2E testing scenarios - Enhance Phase 8.3 with comprehensive Magentic-One E2E workflows - Split Phase 9 between both developers (backend/frontend portions) - Update progress tracking to reflect enhanced testing coverage Enhanced E2E Testing Coverage: 🧪 Phase 6.3: Traditional E2E Tests (Developer A) - Complete hook workflows testing (6 scenarios) - Integration testing with existing Codex (5 scenarios) - Cross-platform E2E testing (4 environments) - Real-world scenario testing (5 scenarios) - Performance and security E2E (4 scenarios) 🤖 Phase 8.3: AI-Powered E2E with Magentic-One (Developer B) - Automated test suite generation (4 capabilities) - Hook configuration validation automation (4 validations) - Multi-agent E2E testing scenarios (5 scenarios) - Performance benchmarking automation (4 capabilities) - Regression testing workflows (4 workflows) 🎭 Phase 9: Comprehensive E2E Testing (Both Developers) - Playwright E2E test suite for CLI (5 tests) - Real-world integration testing (5 scenarios) - Cross-environment E2E validation (5 environments) Total E2E Coverage: - 24 traditional E2E test scenarios - 21 AI-powered automated E2E workflows - 15 comprehensive cross-environment tests - 60+ individual E2E test cases across all phases Benefits: - Comprehensive coverage of all hook functionality - Both manual and automated testing approaches - Cross-platform and cross-environment validation - Real-world scenario testing with actual services - AI-powered test generation and execution - Continuous regression testing capabilities

mikeumus added 6 commits May 24, 2025 01:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Implement Lifecycle Hooks System - Phase 1 Complete #2

feat: Implement Lifecycle Hooks System - Phase 1 Complete #2

Uh oh!

mikeumus commented May 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Implement Lifecycle Hooks System - Phase 1 Complete #2

Are you sure you want to change the base?

feat: Implement Lifecycle Hooks System - Phase 1 Complete #2

Uh oh!

Conversation

mikeumus commented May 24, 2025

Summary

What's Implemented

✅ Phase 1.1: Hook Type Definitions and Core Types

✅ Phase 1.2: Hook Registry System

Key Features

Lifecycle Events

Hook Types

Configuration Example

Advanced Features

Conditional Execution

Priority Management

Template Substitution

Architecture

Integration

Testing

Files Added/Modified

New Files

Modified Files

Next Steps

Breaking Changes

Performance Impact

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants