Reasoning support for evaluators #42482

nagkumar91 · 2025-08-12T15:40:14Z

Description

Please add an informative description that covers that changes made by the pull request and link all relevant issues.

If an SDK is being regenerated based on a new API spec, a link to the pull request containing these API spec changes should be included above.

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Add pyrit and not remove the other one

…ators; propagate via QA; ensure groundedness reload passes flag

…ning_model propagation; exclude unintended sample/log/docs files

github-actions · 2025-08-12T16:04:30Z

API Change Check

APIView identified API level changes in this PR and created the following API reviews

azure-ai-evaluation

sdk/evaluation/azure-ai-evaluation/CHANGELOG.md

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py

…tors/_coherence/_coherence.py Co-authored-by: Ankit Singhal <[email protected]>

Copilot

Pull Request Overview

This PR adds support for reasoning models to evaluators by introducing an is_reasoning_model keyword parameter. When set, this parameter updates the evaluator configuration appropriately for reasoning models, enabling better integration with Azure OpenAI's reasoning capabilities.

Key Changes:

Added is_reasoning_model parameter to all evaluators' constructors
Updated QAEvaluator to propagate this parameter to child evaluators
Added defensive parameter checking in GroundednessEvaluator for backward compatibility
Updated documentation across evaluators to describe the new parameter

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`_similarity/_similarity.py`	Added `is_reasoning_model` parameter and updated docstrings
`_retrieval/_retrieval.py`	Added `is_reasoning_model` parameter support
`_response_completeness/_response_completeness.py`	Added `is_reasoning_model` parameter and improved formatting
`_relevance/_relevance.py`	Added `is_reasoning_model` parameter support
`_qa/_qa.py`	Updated to propagate `is_reasoning_model` to child evaluators
`_groundedness/_groundedness.py`	Added parameter support with backward compatibility checks
`_fluency/_fluency.py`	Added `is_reasoning_model` parameter and updated docstrings
`_base_prompty_eval.py`	Updated to pass `is_reasoning_model` to AsyncPrompty.load
`_base_multi_eval.py`	Minor import formatting improvement
`_coherence/_coherence.py`	Added `is_reasoning_model` parameter and updated docstrings
`CHANGELOG.md`	Documented the new feature and bug fix

_{You can also share your feedback on Copilot code review for a chance to win a $100 gift card. Take the survey.}

Copilot · 2025-08-12T22:23:36Z

...valuation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_groundedness/_groundedness.py

@@ -282,4 +304,4 @@ def _get_context_from_agent_response(self, response, tool_definitions):
            logger.debug(f"Error extracting context from agent response : {str(ex)}")
            context = ""

-        return context if context else None
+        return context


The function _get_context_from_agent_response should return None when context is empty, not an empty string. The original code returned context if context else None, which properly handles the case where no context is found. Returning an empty string may cause issues in downstream processing that expects None for missing context.

Suggested change

return context

return context if context else None

Copilot · 2025-08-12T22:23:37Z

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py

    """

    _PROMPTY_FILE = "coherence.prompty"
    _RESULT_KEY = "coherence"

    id = "azureai://built-in/evaluators/coherence"
-    """Evaluator identifier, experimental and to be used only with evaluation in cloud."""
+    """Evaluator identifier, experimental to be used only with cloud evaluation"""


The docstring is missing a comma. It should read 'Evaluator identifier, experimental, to be used only with cloud evaluation' or 'Evaluator identifier (experimental) to be used only with cloud evaluation'.

Suggested change

"""Evaluator identifier, experimental to be used only with cloud evaluation"""

"""Evaluator identifier, experimental, to be used only with cloud evaluation"""

Nagkumar Arkalgud and others added 30 commits May 28, 2025 11:11

Prepare evals SDK Release

4318329

Fix bug

192b980

Fix for ADV_CONV for FDP projects

758adb4

Update release date

de09fd1

Merge branch 'main' into main

ef60fe6

Merge branch 'Azure:main' into main

8ca51d0

Merge branch 'Azure:main' into main

98bfc3a

Merge branch 'Azure:main' into main

a5f32e8

Merge branch 'Azure:main' into main

5fd88b6

Merge branch 'Azure:main' into main

51f2b44

Merge branch 'Azure:main' into main

a5be8b5

Merge branch 'Azure:main' into main

75965b7

Merge branch 'Azure:main' into main

d0c5e53

Merge branch 'Azure:main' into main

b790276

Merge branch 'Azure:main' into main

d5ca243

re-add pyrit to matrix

8d62e36

Change grader ids

59a70f2

Merge branch 'Azure:main' into main

4d146d7

Update unit test

f7a4c83

replace all old grader IDs in tests

79e3a40

Merge branch 'main' into main

588cbec

Update platform-matrix.json

7514472

Add pyrit and not remove the other one

Update test to ensure everything is mocked

28b2513

tox/black fixes

8603e0e

Skip that test with issues

895f226

Merge branch 'Azure:main' into main

b4b2daf

update grader ID according to API View feedback

023f07f

Update test

45b5f5d

remove string check for grader ID

1ccb4db

Merge branch 'Azure:main' into main

6fd9aa5

Nagkumar Arkalgud and others added 13 commits July 24, 2025 14:06

removed the properties in update

f9faa61

Merge branch 'Azure:main' into main

69e783a

Merge branch 'Azure:main' into main

8ebea2a

Merge branch 'Azure:main' into main

3f9c818

Merge branch 'Azure:main' into main

3b3159c

Merge branch 'Azure:main' into main

d78b834

Merge branch 'Azure:main' into main

ae3fc52

evaluation: support is_reasoning_model across all prompty-based evalu…

19cce75

…ators; propagate via QA; ensure groundedness reload passes flag

evaluation: docs(Preview) + groundedness feature-detection + is_reaso…

e59ca7f

…ning_model propagation; exclude unintended sample/log/docs files

evaluation: revert _proxy_completion_model.py to origin/main version

98b4618

Merge branch 'Azure:main' into main

706c042

Merge remote-tracking branch 'origin/main' into diff-20250811-171736

c418513

Restore files that shouldn't have been modified

86f24ba

Copilot AI review requested due to automatic review settings August 12, 2025 15:40

nagkumar91 requested a review from a team as a code owner August 12, 2025 15:40

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Aug 12, 2025

This comment was marked as outdated.

Sign in to view

singankit reviewed Aug 12, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/CHANGELOG.md Outdated Show resolved Hide resolved

singankit reviewed Aug 12, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py Outdated Show resolved Hide resolved

singankit reviewed Aug 12, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py Outdated Show resolved Hide resolved

singankit reviewed Aug 12, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_coherence/_coherence.py Outdated Show resolved Hide resolved

nagkumar91 and others added 5 commits August 12, 2025 11:51

Update sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evalua…

a1e55b4

…tors/_coherence/_coherence.py Co-authored-by: Ankit Singhal <[email protected]>

Update the groundedness based on comments

bd6809f

Add changelog to bug fix and link issue

3ae37cb

Fix docstring

6b8d4ce

lint fixes

733ee1a

nagkumar91 requested review from Copilot and singankit August 12, 2025 22:23

Copilot AI reviewed Aug 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reasoning support for evaluators #42482

Reasoning support for evaluators #42482

Uh oh!

nagkumar91 commented Aug 12, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

github-actions bot commented Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 12, 2025

Uh oh!

Copilot AI Aug 12, 2025

Uh oh!

Uh oh!

	"""Evaluator identifier, experimental to be used only with cloud evaluation"""
	"""Evaluator identifier, experimental, to be used only with cloud evaluation"""

Reasoning support for evaluators #42482

Are you sure you want to change the base?

Reasoning support for evaluators #42482

Uh oh!

Conversation

nagkumar91 commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Uh oh!

This comment was marked as outdated.

Uh oh!

github-actions bot commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Change Check

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes:

Reviewed Changes

Uh oh!

Copilot AI Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nagkumar91 commented Aug 12, 2025 •

edited

Loading

github-actions bot commented Aug 12, 2025 •

edited

Loading