Skip to content

Commit d6d0823

Browse files
committed
file from Lauryn
1 parent 498432e commit d6d0823

File tree

1 file changed

+29
-29
lines changed

1 file changed

+29
-29
lines changed

articles/ai-foundry/includes/evaluation-github-action-azure-devops-features.md

Lines changed: 29 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -19,36 +19,36 @@ ms.custom: include file
1919

2020
| Category | Evaluator class/Metrics | AI Agent evaluations | GenAI evaluations |
2121
|--|--|--|--|
22-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `GroundednessEvaluator` | Not Supported | Supported |
23-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `GroundednessProEvaluator` | Not Supported | Supported |
24-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `RetrievalEvaluator` | Not Supported | Supported |
25-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `RelevanceEvaluator` | Supported | Supported |
26-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `CoherenceEvaluator` | Supported | Supported |
27-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `FluencyEvaluator` | Supported | Supported |
28-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `SimilarityEvaluator` | Not Supported | Supported |
29-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `IntentResolutionEvaluator` | Supported | Supported |
30-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `TaskAdherenceEvaluator` | Supported | Supported |
31-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `ToolCallAccuracyEvaluator` | Not Supported | Not Supported |
32-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `ResponseCompletenessEvaluator` | Not Supported | Supported |
33-
| [Performance and quality (AI-assisted)](../how-to/develop/evaluate-sdk.md) | `DocumentRetrievalEvaluator` | Not Supported | Not Supported |
34-
| [Performance and quality (NLP)](../how-to/develop/evaluate-sdk.md) | `F1ScoreEvaluator` | Not Supported | Supported |
35-
| [Performance and quality (NLP)](../how-to/develop/evaluate-sdk.md) | `RougeScoreEvaluator` | Not Supported | Not Supported |
36-
| [Performance and quality (NLP)](../how-to/develop/evaluate-sdk.md) | `GleuScoreEvaluator` | Not Supported | Supported |
37-
| [Performance and quality (NLP)](../how-to/develop/evaluate-sdk.md) | `BleuScoreEvaluator ` | Not Supported | Supported |
38-
| [Performance and quality (NLP)](../how-to/develop/evaluate-sdk.md) | `MeteorScoreEvaluator` | Not Supported | Supported |
39-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `ViolenceEvaluator` | Supported | Supported |
40-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `SexualEvaluator` | Supported | Supported |
41-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `SelfHarmEvaluator` | Supported | Supported |
42-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `HateUnfairnessEvaluator` | Supported | Supported |
43-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `IndirectAttackEvaluator` | Supported | Supported |
44-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `ProtectedMaterialEvaluator` | Supported | Supported |
45-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `CodeVulnerabilityEvaluator` | Supported | Supported |
46-
| [Risk and safety (AI-assisted)](../how-to/develop/evaluate-sdk.m) | `UngroundedAttributesEvaluator` | Not Supported | Supported |
47-
| [Composite](../how-to/develop/evaluate-sdk.md#composite-evaluators) | `QAEvaluator` | Not Supported | Supported |
48-
| [Composite](../how-to/develop/evaluate-sdk.md#composite-evaluators) | `ContentSafetyEvaluator` | Supported | Supported |
49-
| [Composite](../how-to/develop/evaluate-sdk.md#composite-evaluators) | `AgentOverallEvaluator` | Not Supported | Not Supported |
22+
| Performance and quality (AI-assisted) | `GroundednessEvaluator` | Not Supported | Supported |
23+
| Performance and quality (AI-assisted) | [`GroundednessProEvaluator`](../concepts/evaluation-evaluators/rag-evaluators#groundedness-pro) | Not Supported | Supported |
24+
| Performance and quality (AI-assisted) | `RetrievalEvaluator` | Not Supported | Supported |
25+
| Performance and quality (AI-assisted)| `RelevanceEvaluator` | Supported | Supported |
26+
| Performance and quality (AI-assisted) | `CoherenceEvaluator` | Supported | Supported |
27+
| Performance and quality (AI-assisted) | `FluencyEvaluator` | Supported | Supported |
28+
| Performance and quality (AI-assisted)| `SimilarityEvaluator` | Not Supported | Supported |
29+
| Performance and quality (AI-assisted) | `IntentResolutionEvaluator` | Supported | Supported |
30+
| Performance and quality (AI-assisted)| `TaskAdherenceEvaluator` | Supported | Supported |
31+
| Performance and quality (AI-assisted) | `ToolCallAccuracyEvaluator` | Not Supported | Not Supported |
32+
| Performance and quality (AI-assisted) | `ResponseCompletenessEvaluator` | Not Supported | Supported |
33+
| Performance and quality (AI-assisted) | `DocumentRetrievalEvaluator` | Not Supported | Not Supported |
34+
| Performance and quality (NLP) | `F1ScoreEvaluator` | Not Supported | Supported |
35+
| Performance and quality (NLP) | `RougeScoreEvaluator` | Not Supported | Not Supported |
36+
| Performance and quality (NLP) | `GleuScoreEvaluator` | Not Supported | Supported |
37+
| Performance and quality (NLP) | `BleuScoreEvaluator ` | Not Supported | Supported |
38+
| Performance and quality (NLP) | `MeteorScoreEvaluator` | Not Supported | Supported |
39+
| Risk and safety (AI-assisted)| `ViolenceEvaluator` | Supported | Supported |
40+
| Risk and safety (AI-assisted) | `SexualEvaluator` | Supported | Supported |
41+
| Risk and safety (AI-assisted) | `SelfHarmEvaluator` | Supported | Supported |
42+
| Risk and safety (AI-assisted)| `HateUnfairnessEvaluator` | Supported | Supported |
43+
| Risk and safety (AI-assisted)| `IndirectAttackEvaluator` | Supported | Supported |
44+
| Risk and safety (AI-assisted)| `ProtectedMaterialEvaluator` | Supported | Supported |
45+
| Risk and safety (AI-assisted)| `CodeVulnerabilityEvaluator` | Supported | Supported |
46+
| Risk and safety (AI-assisted)| `UngroundedAttributesEvaluator` | Not Supported | Supported |
47+
| Composite| `QAEvaluator` | Not Supported | Supported |
48+
| Composite | `ContentSafetyEvaluator` | Supported | Supported |
49+
| Composite| `AgentOverallEvaluator` | Not Supported | Not Supported |
5050
| Operational metrics | Client run duration | Supported | Not Supported |
5151
| Operational metrics | Server run duration | Supported | Not Supported |
5252
| Operational metrics | Completion tokens | Supported | Not Supported |
5353
| Operational metrics | Prompt tokens | Supported | Not Supported |
54-
| [Custom evaluators](../how-to/develop/evaluate-sdk.md#custom-evaluators) | | Not Supported | Not Supported |
54+
| [Custom evaluators](../concepts/evaluation-evaluators/custom-evaluators.md) | | Not Supported | Not Supported |

0 commit comments

Comments
 (0)