v0.8.0
What's New
- TAU-bench integration: Full benchmark framework for evaluating agents on TAU-bench tasks
- Recursive Reflector: New reflector module with sandbox execution, trace context, and sub-agent support
- Skillbook tools: Clean, consolidate, and merge skillbooks via new utility scripts
Full Changelog: v0.7.0...v0.8.0