Commit c3bd145
Modify max_completion_tokens parameter for tool call accuracy evaluator (#42544)
* support 5 levels, evaluate all tools at once
* Update sample notebook and change log
* Add missing import
* Modify test cases to match the new output format
* Modify other test file to match the new output format
* Fixed parsing of results
* Change key name in output
* Spell check fixes
* Minor prompt update
* Update result key to tool_call_accuracy
* Delete test_new_evaluator.ipynb
* Added field names and messages as constants
* Additional note in prompt
* Re-add the temperature to the prompty file
* Removed 'applicable' field and print statement
* Move excess/missing tool calls fields under additional details
* Typo fix and removal of redundant field in the prompt
* Modify per_tool_call_details field's name to details
* Revert "Modify per_tool_call_details field's name to details"
This reverts commit 2c3ce50.
* Revert 'Merge branch 'main' into selshafey/improve_tool_call_accuracy'
* Black reformat
* Reformat with black
* To re-trigger build pipelines
* Add notebook for bugbash
* modify bugbash notebook
* Add support for built-in tools for Tool Call Accuracy Evaluator
* Remove bugbash notebook
* Resolve issues with merge
* Fix id value
* Use existing built-in tool definitions
* Run black
* Prompt modifications
* Add test cases for built-in tools
* Handle converter format
* Add test cases for converter format
* Support only converter format
* Revert tool definitions to be required, run black
* Change max_completion_tokens to max_tokens
* merged with main
* re-merge with main
* run black
---------
Co-authored-by: Salma Elshafey <[email protected]>1 parent d0d6eb9 commit c3bd145
File tree
1 file changed
+2
-1
lines changed- sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_tool_call_accuracy
1 file changed
+2
-1
lines changedLines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
7 | | - | |
| 7 | + | |
| 8 | + | |
8 | 9 | | |
9 | 10 | | |
10 | 11 | | |
| |||
0 commit comments