-
Notifications
You must be signed in to change notification settings - Fork 249
Multi-agent collaboration with inline evals #2002
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
examples/experimental/trulens-no-plan-eval-multi-agent-collaboration.ipynb
Show resolved
Hide resolved
examples/experimental/trulens-no-plan-eval-multi-agent-collaboration.ipynb
Show resolved
Hide resolved
examples/experimental/trulens-no-plan-eval-multi-agent-collaboration.ipynb
Show resolved
Hide resolved
| @@ -0,0 +1,888 @@ | |||
| { | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
examples/experimental/trulens-no-plan-eval-multi-agent-collaboration.ipynb
Show resolved
Hide resolved
| @@ -0,0 +1,888 @@ | |||
| { | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Line #50. parsed_eval = json.loads(result.content)
this will fail right? result does not have .content attribute as it's just a dict
Reply via ReviewNB
Description
Researcher and chart generator multi-agent system using Trulens evaluation functions. Includes step-wise evaluations for researcher (RAG triad) and chart generator (custom chart accuracy, formatting, relevance evaluations). Also includes trajectory execution evaluation. Includes orchestrator but does NOT include planning.
Other details good to know for developers
Version still needs work. Currently facing OpenAIEndpoint request failed issues.
Type of change
not work as expected)