You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/observability.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,7 +61,7 @@ To learn more, see [Textual similarity evaluators](./evaluation-evaluators/textu
61
61
|--|--|--|
62
62
| Retrieval | Measures how effectively the system retrieves relevant information. | Query , context |
63
63
| Document Retrieval | Measures accuracy in retrieval results given ground truth. | Ground truth , retrieved documents, |
64
-
| Groundedness | Measures how consistent the response is with respect to the retrieved context. | (optional) Query, context, response |
64
+
| Groundedness | Measures how consistent the response is with respect to the retrieved context. | Query (optional), context, response |
65
65
| Groundedness Pro | Measures whether the response is consistent with respect to the retrieved context. | Query, context, response |
66
66
| Relevance | Measures how relevant the response is with respect to the query. | Query, response|
67
67
| Response Completeness | Measures to what extent the response is complete (not missing critical information) with respect to the ground truth. | Response, ground truth |
@@ -88,7 +88,7 @@ To learn more, see [Risk and safety evaluators](./evaluation-evaluators/risk-saf
88
88
| Evaluator | Purpose | Inputs |
89
89
|--|--|--|
90
90
| Intent Resolution | Measures how accurately the agent identifies and addresses user intentions.| Query, response |
91
-
| Task Adherence | Measures how well the agent follows through on identified tasks. | Query, response, (optional) tool_definitions|
91
+
| Task Adherence | Measures how well the agent follows through on identified tasks. | Query, response, tool definitions (optional) |
92
92
| Tool Call Accuracy | Measures how well the agent selects and calls the correct tools to.| Query, either response or tool calls, tool definitions |
93
93
94
94
To learn more, see [Agent evaluators](./evaluation-evaluators/agent-evaluators.md).
0 commit comments