Replies: 2 comments 1 reply
-
|
I also join your feeling, would it be possible to have some acceptance criteria and test results ? |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
An example: How does this prescription in the agent instructions effect to desired behavior of detecting "ambiguous schemas"? Seems a little ambiguous. How would this "clarification" be ranked among the nearly infinite possibilities that it encompasses? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is there any discussion on how these components (prompts, skills, etc.) are evaluated and scored for utility? How are changes evaluated? If skills and prompts are later updated, how do we know the effect one way or another? Are there efforts to study this?
For the moment, it looks like 99% of this is caveat emptor.
Beta Was this translation helpful? Give feedback.
All reactions