fixes

lgayhardt · lgayhardt · commit d7f7efdd1469 · 2024-03-28T01:07:15.000-07:00
diff --git a/articles/ai-studio/concepts/evaluation-metrics-built-in.md b/articles/ai-studio/concepts/evaluation-metrics-built-in.md
@@ -81,8 +81,8 @@ We support the following AI-Assisted metrics for the above task types:
 
 | Task type | Question and Generated Answers Only (No context or ground truth needed)  | Question and Generated Answers + Context | Question and Generated Answers + Context + Ground Truth  |
 | --- | --- | --- | --- |
-| Question Answering | - Risk and safety metrics (all AI-Assisted): hateful and unfair content defect rate, sexual content defect rate, violent content defect rate, self-harm-related content defect rate, and jailbreak defect rate <br> - Generation quality metrics (all AI-Assisted): Coherence, Fluency |Previous Column Metrics <br> + <br> Generation quality metrics (all AI-Assisted): <br> - Groundedness <br> - Relevance |Previous Column Metrics <br> + <br> Generation quality metrics: <br> Similarity (AI-assisted) <br> F1-Score (traditional ML metric) |
-| Conversation | - Risk and safety metrics (all AI-Assisted): hateful and unfair content defect rate, sexual content defect rate, violent content defect rate, self-harm-related content defect rate, and jailbreak defect rate <br> - Generation quality metrics (all AI-Assisted): Coherence, Fluency | Previous Column Metrics <br> + <br> Generation quality metrics (all AI-Assisted): <br> - Groundedness <br> - Retrieval Score | N/A |
+| [Question Answering](#question-answering-single-turn) | - Risk and safety metrics (all AI-Assisted): hateful and unfair content defect rate, sexual content defect rate, violent content defect rate, self-harm-related content defect rate, and jailbreak defect rate <br> - Generation quality metrics (all AI-Assisted): Coherence, Fluency |Previous Column Metrics <br> + <br> Generation quality metrics (all AI-Assisted): <br> - Groundedness <br> - Relevance |Previous Column Metrics <br> + <br> Generation quality metrics: <br> Similarity (AI-assisted) <br> F1-Score (traditional ML metric) |
+| [Conversation](#conversation-single-turn-and-multi-turn) | - Risk and safety metrics (all AI-Assisted): hateful and unfair content defect rate, sexual content defect rate, violent content defect rate, self-harm-related content defect rate, and jailbreak defect rate <br> - Generation quality metrics (all AI-Assisted): Coherence, Fluency | Previous Column Metrics <br> + <br> Generation quality metrics (all AI-Assisted): <br> - Groundedness <br> - Retrieval Score | N/A |
 
 > [!NOTE]
 > While we are providing you with a comprehensive set of built-in metrics that facilitate the easy and efficient evaluation of the quality and safety of your generative AI application, it is best practice to adapt and customize them to your specific task types. Furthermore, we empower you to introduce entirely new metrics, enabling you to measure your applications from fresh angles and ensuring alignment with your unique objectives.
diff --git a/articles/ai-studio/how-to/evaluate-flow-results.md b/articles/ai-studio/how-to/evaluate-flow-results.md
@@ -79,7 +79,7 @@ Evaluation results might have different meanings for different audiences. For ex
 
 When understanding each content risk metric, you can easily view each metric definition and severity scale by selecting the metric name above the chart to see a detailed explanation in a pop-up.
 
-:::image type="content" source="../media/evaluations/view-results/ risk-safety-metric-definition-popup.png" alt-text="Screenshot of risk and safety metrics detailed explanation pop-up." lightbox="../media/evaluations/view-results/ risk-safety-metric-definition-popup.png":::
+:::image type="content" source="../media/evaluations/view-results/risk-safety-metric-definition-popup" alt-text="Screenshot of risk and safety metrics detailed explanation pop-up." lightbox="../media/evaluations/view-results/risk-safety-metric-definition-popup.png":::
 
 If there's something wrong with the run, you can also debug your evaluation run with the log and trace. 
 
diff --git a/articles/ai-studio/includes/evaluations/from-data/python.md b/articles/ai-studio/includes/evaluations/from-data/python.md
@@ -40,7 +40,7 @@ When using AI-assisted performance and quality metrics, you must specify a GPT m
 When using AI-assisted risk and safety metrics, you do not need to provide a connection and deployment. The Azure AI Studio safety evaluations back-end service provisions a GPT-4 model that can generate content risk severity scores and reasoning to enable you to evaluate your application for content harms. 
 
 > [!NOTE]
-> Currently AI-assisted risk and safety metrics are only available in the following regions: East US 2, France Central, UK South, Sweden Central. Groundedness measurement leveraging Azure AI Content Safety Groundedness Detection is only supported following regions: East US 2 and Sweden Central. Read more about the [supported metrics](../../../concepts/evaluation-metrics-built-in.md#metrics-for-single-turn-question-answering-without-retrieval-non-rag) and when to use which metric.
+> Currently AI-assisted risk and safety metrics are only available in the following regions: East US 2, France Central, UK South, Sweden Central. Groundedness measurement leveraging Azure AI Content Safety Groundedness Detection is only supported following regions: East US 2 and Sweden Central. Read more about the [supported metrics](../../../concepts/evaluation-metrics-built-in.md) and when to use which metric.
 
 ### Supported input data format for question answering
 
diff --git a/articles/ai-studio/includes/evaluations/from-data/studio.md b/articles/ai-studio/includes/evaluations/from-data/studio.md
@@ -146,7 +146,7 @@ For guidance on the specific data mapping requirements for each metric, refer to
 | Violent content            | Required: list |
 | Sexual content             | Required: list |
 
-Messages: message key that follows the chat protocol format defined by Azure Open AI for [conversations](../../../concepts/evaluation-metrics-built-in.md#Conversation-single-turn-and-multi-turn). For Groundedness, Relevance and Retrieval score, the citations key is required within your messages list.
+Messages: message key that follows the chat protocol format defined by Azure Open AI for [conversations](../../../concepts/evaluation-metrics-built-in.md#conversation-single-turn-and-multi-turn). For Groundedness, Relevance and Retrieval score, the citations key is required within your messages list.
 
 #### Review and finish