Allow LangSmith evaluators to access tool calls #24894
alexnodeland
announced in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Checked
Feature request
It would be useful to allow LangSmith evaluators to assess tool calls. Currently if a user is using the prompt playground with function calling and runs that against examples that exist in a data set, the prompt playground automatically appends a
StrOutputParser
to the LLM response, and extracts theoutput
from the responsecontent
. Then, when building an evaluator, when selecting theSubmission
I cannot selecttool_calls
, onlyoutput
from theStrOutputParser
. I would like to be able to build an evaluator that looks at thetool_calls
attribute of the response, and compares that to an expectation, defined in the dataset.Motivation
Essentially, I just want to be able to build evaluators for function calling that leverage the prompt playgrounds and datasets features in LangSmith directly
Proposal (If applicable)
No response
Beta Was this translation helpful? Give feedback.
All reactions