Skip to content

Conversation

@strickvl
Copy link
Contributor

Summary

Please provide a short summary explaining the motivation behind these changes.

Checklist

  • I have read the contributing guidelines.
  • I have run the necessary tests and linters.
  • I have updated relevant documentation where applicable.

Related Issues

Please link to any relevant issues or discussions.

strickvl and others added 2 commits May 28, 2025 00:53
This commit implements the design document for splitting the monolithic ResearchState into separate, named artifacts with custom visualizations. Major changes include:

## New Artifact Classes
- **QueryContext**: Immutable context containing the research query and sub-questions
- **SearchData**: All search results and cost tracking information
- **SynthesisData**: Synthesized information from searches (including enhanced versions)
- **AnalysisData**: Cross-viewpoint analysis and reflection metadata
- **FinalReport**: The generated HTML report with metadata

## Custom Materializers
Each artifact now has its own materializer with beautiful HTML visualizations:
- Interactive charts using Chart.js for search costs
- Collapsible sections for better organization
- Consistent styling across all artifact views
- Metadata tables with key statistics

## Pipeline Updates
- All steps refactored to use the new artifact-based approach
- Proper dependencies established between parallel steps
- Fixed merge step to run after parallel sub-question processing
- Updated metadata logging and tagging throughout

## Bug Fixes
- Fixed log_metadata calls to include infer_artifact=True parameter
- Fixed template variable names in final report generation
- Corrected enhanced_info merging logic to preserve original synthesis data
- Added proper step dependencies in parallel pipeline

## Test Updates
- Updated all tests to use the new artifact-based interface
- Tests now create individual artifacts instead of ResearchState
- Maintained test coverage for all functionality

This refactoring improves modularity, enables better artifact visualization in the ZenML dashboard, and makes the pipeline more maintainable and extensible.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <[email protected]>
@dagshub
Copy link

dagshub bot commented May 27, 2025

@strickvl strickvl changed the title Feature/split research state Refactor ResearchState into smaller artifacts May 27, 2025
@strickvl strickvl merged commit 19eb71a into feat/deep-research May 28, 2025
4 of 5 checks passed
@strickvl strickvl deleted the feature/split-research-state branch May 28, 2025 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants