You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/evaluation-github-action.md
+44-10Lines changed: 44 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ description: How to run evaluation in GitHub Action to streamline the evaluation
5
5
manager: scottpolly
6
6
ms.service: azure-ai-foundry
7
7
ms.topic: how-to
8
-
ms.date: 05/28/2025
8
+
ms.date: 06/1/2025
9
9
ms.reviewer: hanch
10
10
ms.author: lagayhar
11
11
author: lgayhardt
@@ -47,11 +47,11 @@ The input of ai-agent-evals includes:
47
47
48
48
# [Foundry project](#tab/foundry-project)
49
49
50
-
-`azure-ai-project-endpoint`: The endpoint of the Azure AI project. This is used to connect to Azure OpenAI to simulate conversations with each agent, and to connect to the Azure AI evaluation SDK to perform the evaluation.
50
+
-`azure-ai-project-endpoint`: The endpoint of the Azure AI project. This is used to connect to your AI project to simulate conversations with each agent, and to connect to the Azure AI evaluation SDK to perform the evaluation.
51
51
52
52
# [Hub based project](#tab/hub-project)
53
53
54
-
-`azure-aiproject-connection-string`: The connection string of the Azure AI project. This is used to connect to Azure OpenAI to simulate conversations with each agent, and to connect to the Azure AI evaluation SDK to perform the evaluation.
54
+
-`azure-aiproject-connection-string`: The connection string of the Azure AI project. This is used to connect to your AI project to simulate conversations with each agent, and to connect to the Azure AI evaluation SDK to perform the evaluation.
55
55
56
56
---
57
57
-`deployment-name`: the deployed model name for evaluation judgement.
@@ -96,20 +96,52 @@ Here's a sample of the dataset:
96
96
97
97
To use the GitHub Action, add the GitHub Action to your CI/CD workflows and specify the trigger criteria (for example, on commit) and file paths to trigger your automated workflows.
98
98
99
+
> [!TIP]
100
+
> To minimize costs, you should avoid running evaluation on every commit.
101
+
102
+
This example illustrates how Azure Agent AI Evaluation can be run when comparing different agents with agent IDs.
103
+
99
104
# [Foundry project](#tab/foundry-project)
100
105
101
-
Specify v2-beta.
106
+
```YAML
107
+
name: "AI Agent Evaluation"
102
108
103
-
# [Hub based project](#tab/hub-project)
109
+
on:
110
+
workflow_dispatch:
111
+
push:
112
+
branches:
113
+
- main
104
114
105
-
Specify v1-beta.
115
+
permissions:
116
+
id-token: write
117
+
contents: read
106
118
107
-
---
119
+
jobs:
120
+
run-action:
121
+
runs-on: ubuntu-latest
122
+
steps:
123
+
- name: Checkout
124
+
uses: actions/checkout@v4
108
125
109
-
> [!TIP]
110
-
> To minimize costs, you should avoid running evaluation on every commit.
0 commit comments