Commit e453b20
update tool accuracy for new behavior around built-in tools (#40829)
* update tool accuracy for new behavior around built-in tools
* include changes for converter
* fix imports
* test tool definition conversion
* move constants to base eval
* check yourself
* assistant_id --> agent_id
* off by one version, fix
---------
Co-authored-by: spon <[email protected]>1 parent 9fe6528 commit e453b20
File tree
7 files changed
+749
-58
lines changed- sdk/evaluation/azure-ai-evaluation
- azure/ai/evaluation
- _converters
- _evaluators
- _common
- _tool_call_accuracy
- tests
- converters/ai_agent_converter
- unittests
7 files changed
+749
-58
lines changedLines changed: 3 additions & 41 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
18 | | - | |
| 18 | + | |
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| |||
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | | - | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
45 | | - | |
46 | | - | |
47 | | - | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
59 | | - | |
60 | | - | |
61 | | - | |
62 | | - | |
63 | | - | |
64 | | - | |
65 | | - | |
66 | | - | |
67 | | - | |
68 | | - | |
69 | | - | |
70 | | - | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | 35 | | |
76 | 36 | | |
77 | 37 | | |
| |||
202 | 162 | | |
203 | 163 | | |
204 | 164 | | |
| 165 | + | |
205 | 166 | | |
206 | 167 | | |
207 | 168 | | |
| |||
213 | 174 | | |
214 | 175 | | |
215 | 176 | | |
| 177 | + | |
216 | 178 | | |
217 | 179 | | |
218 | 180 | | |
| |||
Lines changed: 65 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
24 | 75 | | |
25 | 76 | | |
26 | 77 | | |
| |||
98 | 149 | | |
99 | 150 | | |
100 | 151 | | |
| 152 | + | |
| 153 | + | |
101 | 154 | | |
102 | 155 | | |
103 | 156 | | |
104 | 157 | | |
105 | 158 | | |
106 | 159 | | |
107 | 160 | | |
| 161 | + | |
108 | 162 | | |
109 | 163 | | |
110 | 164 | | |
| |||
191 | 245 | | |
192 | 246 | | |
193 | 247 | | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
194 | 252 | | |
195 | 253 | | |
196 | 254 | | |
| |||
217 | 275 | | |
218 | 276 | | |
219 | 277 | | |
220 | | - | |
| 278 | + | |
221 | 279 | | |
222 | | - | |
| 280 | + | |
223 | 281 | | |
224 | | - | |
| 282 | + | |
225 | 283 | | |
226 | 284 | | |
227 | 285 | | |
| |||
231 | 289 | | |
232 | 290 | | |
233 | 291 | | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
| 295 | + | |
234 | 296 | | |
235 | 297 | | |
236 | 298 | | |
| |||
Lines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
86 | 86 | | |
87 | 87 | | |
88 | 88 | | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
89 | 93 | | |
90 | 94 | | |
91 | 95 | | |
| |||
Lines changed: 70 additions & 10 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
214 | 214 | | |
215 | 215 | | |
216 | 216 | | |
217 | | - | |
218 | | - | |
219 | | - | |
220 | | - | |
221 | | - | |
222 | | - | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
223 | 229 | | |
224 | 230 | | |
225 | 231 | | |
| |||
231 | 237 | | |
232 | 238 | | |
233 | 239 | | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
| 245 | + | |
| 246 | + | |
| 247 | + | |
| 248 | + | |
234 | 249 | | |
235 | 250 | | |
236 | 251 | | |
237 | | - | |
| 252 | + | |
| 253 | + | |
| 254 | + | |
| 255 | + | |
238 | 256 | | |
239 | 257 | | |
240 | 258 | | |
| 259 | + | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
| 275 | + | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
241 | 289 | | |
242 | 290 | | |
243 | 291 | | |
| |||
260 | 308 | | |
261 | 309 | | |
262 | 310 | | |
263 | | - | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
| 324 | + | |
264 | 325 | | |
265 | | - | |
| 326 | + | |
266 | 327 | | |
267 | | - | |
268 | 328 | | |
269 | 329 | | |
270 | 330 | | |
| |||
0 commit comments