-
Notifications
You must be signed in to change notification settings - Fork 45
Open
Labels
Description
Description
Add a new agent to handle the creation and evaluation of complex benchmarks.
For example, if the create_portfolio agent creates a portfolio composed of 70% stocks and 30% bonds with a set of different ETFs, it would be interesting if the benchmark agent created a benchmark like this:
- 1 global equity ETF for a total of 70% of the benchmark
- 1 bond ETF for a total of 30% of the benchmark
Why
Centralize benchmark logic for portfolio comparisons and reproducible performance tests.
Acceptance criteria
- Directory created: src/nodes/analyst_agents/benchmark
- Includes scaffold: README.md, init.py, agent.py (interface), tests/test_benchmark_agent.py
- Agent supports: loading benchmarks, running evaluations, returning metrics
Testing
- Unit tests verify agent can load a sample benchmark and return expected metric keys.
Reactions are currently unavailable