You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_quality/_tool_call_quality.py
message=f"Invalid score value: {score}. Expected a number in range [{ToolCallAccuracyEvaluator._MIN_TOOL_CALL_ACCURACY_SCORE}, {ToolCallAccuracyEvaluator._MAX_TOOL_CALL_ACCURACY_SCORE}].",
248
+
message=f"Invalid score value: {score}. Expected a number in range [{ToolCallQualityEvaluator._MIN_TOOL_CALL_QUALITY_SCORE}, {ToolCallQualityEvaluator._MAX_TOOL_CALL_QUALITY_SCORE}].",
Copy file name to clipboardExpand all lines: sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_quality/tool_call_quality.prompty
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
---
2
-
name: Tool Call Accuracy
3
-
description: Evaluates Tool Call Accuracy for tool used by agent
2
+
name: Tool Call Quality
3
+
description: Evaluates Tool Call Quality for tool used by agent
"The Tool Call Accuracy evaluator assesses how accurately an AI uses tools by examining:\n",
36
+
"The Tool Call Quality evaluator assesses how accurately an AI uses tools by examining:\n",
37
37
"- Relevance to the conversation\n",
38
38
"- Parameter correctness according to tool definitions\n",
39
39
"- Parameter value extraction from the conversation\n",
@@ -53,18 +53,18 @@
53
53
"cell_type": "markdown",
54
54
"metadata": {},
55
55
"source": [
56
-
"Tool Call Accuracy requires following input:\n",
56
+
"Tool Call Quality requires following input:\n",
57
57
"- Query - This can be a single query or a list of messages(conversation history with agent). Latter helps to determine if Agent used the information in history to make right tool calls.\n",
58
58
"- Tool Calls - Tool Call(s) made by Agent to answer the query. Optional - if response has tool calls, if not provided evaluator will look for tool calls in response.\n",
59
-
"- Response - (Optional)Response from Agent (or any GenAI App). This can be a single text response or a list or messages generated as part of Agent Response. If tool calls are not provide Tool Call Accuracy Evaluator will look at response for tool calls.\n",
59
+
"- Response - (Optional)Response from Agent (or any GenAI App). This can be a single text response or a list or messages generated as part of Agent Response. If tool calls are not provide Tool Call Quality Evaluator will look at response for tool calls.\n",
60
60
"- Tool Definitions - Tool(s) definition used by Agent to answer the query. \n"
0 commit comments