Acrolinx

ttorble · web-flow · commit aba3ae3fad36 · 2024-04-02T11:40:44.000+01:00
diff --git a/articles/machine-learning/prompt-flow/how-to-bulk-test-evaluate-flow.md b/articles/machine-learning/prompt-flow/how-to-bulk-test-evaluate-flow.md
@@ -126,7 +126,7 @@ You can select **Evaluate** to start another round of evaluation.
 
 After setting up the configuration, you can select **"Submit"** for this new round of evaluation. After submission, you'll be able to see a new record in the prompt flow run list.
 
-After the evaluation run completed, similarly, you can check the result of evaluation in the **"Outputs"** tab of the batch run detail panel. You need select the new evaluation run to view its result.
+After the evaluation run completed, similarly, you can check the result of evaluation in the **"Outputs"** tab of the batch run detail panel. You need to select the new evaluation run to view its result.
 
 :::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output-new-evaluation.png" alt-text="Screenshot of batch run detail page on the output tab with checking the new evaluation output." lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output-new-evaluation.png":::
 
@@ -183,7 +183,7 @@ System message, sometimes referred to as a metaprompt or [system prompt](../../c
 
 ## Further reading: Guidance for creating Golden Datasets used for Copilot quality assurance
 
-The creation of copilot that use Large Language Models (LLMs) typically involves grounding the model in reality using source datasets. However, to ensure that the LLMs provide the most accurate and useful responses to customer queries, a "Golden Dataset" is necessary.
+The creation of a copilot that use Large Language Models (LLMs) typically involves grounding the model in reality using source datasets. However, to ensure that the LLMs provide the most accurate and useful responses to customer queries, a "Golden Dataset" is necessary.
 
 A Golden Dataset is a collection of realistic customer questions and expertly crafted answers. It serves as a Quality Assurance tool for LLMs used by your copilot. Golden Datasets are not used to train an LLM or inject context into an LLM prompt. Instead, they are utilized to assess the quality of the answers generated by the LLM.