You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/evaluation-github-action.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -49,7 +49,7 @@ The input of ai-agent-evals includes:
49
49
-`deployment-name`: the deployed model name.
50
50
-`data-path`: Path to the input data file containing the conversation starters. Each conversation starter is sent to each agent for a pairwise comparison of evaluation results.
51
51
-`evaluators`: built-in evaluator names.
52
-
-`data`: a set of conversation starters/queries and ground truth. Ground-truth is optional and only required for a subset of evaluators. (See which [evaluator requires ground-truth](./develop/evaluate-sd.md#data-requirements-for-built-in-evaluators))
52
+
-`data`: a set of conversation starters/queries and ground truth. Ground-truth is optional and only required for a subset of evaluators. (See which [evaluator requires ground-truth](./develop/evaluate-sdk.md#data-requirements-for-built-in-evaluators))
53
53
- Only single agent turn is supported.
54
54
-`agent-ids`: a unique identifier for the agent and comma-separated list of agent IDs to evaluate.
55
55
- When only one `agent-id` is specified, the evaluation results include the absolute values for each metric along with the corresponding confidence intervals.
@@ -159,7 +159,7 @@ The input of genai-evals includes (some of them are optional depending on the ev
159
159
160
160
Evaluation configuration file:
161
161
162
-
-`data`: a set of queries and ground truth. Ground-truth is optional and only required for a subset of evaluators. (See which [evaluator requires ground-truth](./develop/evaluate-sd.md#data-requirements-for-built-in-evaluators)).
162
+
-`data`: a set of queries and ground truth. Ground-truth is optional and only required for a subset of evaluators. (See which [evaluator requires ground-truth](./develop/evaluate-sdk.md#data-requirements-for-built-in-evaluators)).
0 commit comments