Merge branch 'pf/SKintegration' of https://github.com/jiaochenlu/azure-docs-pr into pf/SKintegration

jiaochenlu · jiaochenlu · commit 854c4042a201 · 2023-09-18T19:21:42.000+08:00
diff --git a/articles/machine-learning/prompt-flow/how-to-evaluate-semantic-kernel.md b/articles/machine-learning/prompt-flow/how-to-evaluate-semantic-kernel.md
@@ -47,7 +47,7 @@ Similar to the integration of Langchain with Prompt flow, Semantic Kernel, which
 > [!IMPORTANT]
 > Prior to developing the flow, it's essential to install the [Semantic Kernel package](/semantic-kernel/get-started/quick-start-guide/?toc=%2Fsemantic-kernel%2Ftoc.json&tabs=python) in your runtime environment for executor. 
 
-To learn more, see [Customize environment for runtime](./how-to-customize-environment-runtime.md) for guidance .
+To learn more, see [Customize environment for runtime](./how-to-customize-environment-runtime.md) for guidance.
 
 > [!IMPORTANT]
 > The approach to consume OpenAI or Azure OpenAI in Semantic Kernel is to to obtain the keys you have specified in environment variables or stored in a `.env` file.
@@ -73,54 +73,54 @@ Once the setup is complete, you can conveniently convert your existing Semantic
 
 For example, we can create a flow with a Semantic Kernel planner that solves math problems. Follow this [documentation](/semantic-kernel/ai-orchestration/planners/evaluate-and-deploy-planners/create-a-prompt-flow-with-semantic-kernel) with steps necessary to create a simple Prompt flow with Semantic Kernel at its core.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/semantic-kernel-flow.png" alt-text="Create a flow with semantic kernel planner" lightbox = "./media/how-to-evaluate-semantic-kernel/semantic-kernel-flow.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/semantic-kernel-flow.png" alt-text="Screenshot of creating a flow with semantic kernel planner." lightbox = "./media/how-to-evaluate-semantic-kernel/semantic-kernel-flow.png":::
 
 Set up the connection in python code.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/set-connection-in-python.png" alt-text="Set custom connection in python node" lightbox = "./media/how-to-evaluate-semantic-kernel/set-connection-in-python.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/set-connection-in-python.png" alt-text="Screenshot of setting custom connection in python node." lightbox = "./media/how-to-evaluate-semantic-kernel/set-connection-in-python.png":::
 
 Select the connection object in the node input, and set the model name of OpenAI or deployment name of Azure OpenAI.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/set-key-model.png" alt-text="Set model and key in node input" lightbox = "./media/how-to-evaluate-semantic-kernel/set-key-model.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/set-key-model.png" alt-text="Screenshot of setting model and key in node input." lightbox = "./media/how-to-evaluate-semantic-kernel/set-key-model.png":::
 
 ### Batch testing your plugins and planners
 
 Instead of manually testing different scenarios one-by-one, now you can now automatically run large batches of tests using Prompt flow and benchmark data. 
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/using-batch-runs-with-prompt-flow.png" alt-text="Batch runs with prompt flow for Semantic kernel" lightbox = "./media/how-to-evaluate-semantic-kernel/using-batch-runs-with-prompt-flow.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/using-batch-runs-with-prompt-flow.png" alt-text="Screenshot of batch runs with prompt flow for Semantic kernel." lightbox = "./media/how-to-evaluate-semantic-kernel/using-batch-runs-with-prompt-flow.png":::
 
 Once the flow has passed the single test run in the previous step, you can effortlessly create a batch test in Prompt flow by adhering to the following steps:
 1. Create benchmark data in a *jsonl* file, contains a list of JSON objects that contains the input and the correct ground truth.
 1. Click *Batch run* to create a batch test.
 1. Complete the batch run settings, especially the data part.
-1. Submit run without evaluation (for this specific batch test, the *Evaluation st*ep can be skipped).
+1. Submit run without evaluation (for this specific batch test, the *Evaluation step* can be skipped).
 
-In our [updated docs](/semantic-kernel/ai-orchestration/planners/evaluate-and-deploy-planners/running-batches-with-prompt-flow?tabs=gpt-35-turbo), we demonstrate how you can use this functionality to run batch tests on a planner that uses a math plugin. By defining a bunch of word problems, we can quickly test any changes we make to our plugins or planners so we can catch regressions early and often.
+In our [Running batches with Prompt flow](/semantic-kernel/ai-orchestration/planners/evaluate-and-deploy-planners/running-batches-with-prompt-flow?tabs=gpt-35-turbo), we demonstrate how you can use this functionality to run batch tests on a planner that uses a math plugin. By defining a bunch of word problems, we can quickly test any changes we make to our plugins or planners so we can catch regressions early and often.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/semantic-kernel-test-data.png" alt-text="Data of batch runs with prompt flow for Semantic kernel" lightbox = "./media/how-to-evaluate-semantic-kernel/semantic-kernel-test-data.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/semantic-kernel-test-data.png" alt-text="Screenshot of data of batch runs with prompt flow for Semantic kernel." lightbox = "./media/how-to-evaluate-semantic-kernel/semantic-kernel-test-data.png":::
 
-In your workspace, you can go to the **Run list** in Prompt flow, click **Details** button, and then click **Output** tab to view the batch run result.
+In your workspace, you can go to the **Run list** in Prompt flow, select **Details** button, and then select **Output** tab to view the batch run result.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run.png" alt-text="Run list" lightbox = "./media/how-to-evaluate-semantic-kernel/run.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run.png" alt-text="Screenshot of the run list." lightbox = "./media/how-to-evaluate-semantic-kernel/run.png":::
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run-detail.png" alt-text="Run detail" lightbox = "./media/how-to-evaluate-semantic-kernel/run-detail.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run-detail.png" alt-text="Screenshot of the run detail." lightbox = "./media/how-to-evaluate-semantic-kernel/run-detail.png":::
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run-output.png" alt-text="Run output" lightbox = "./media/how-to-evaluate-semantic-kernel/run-output.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/run-output.png" alt-text="Screenshot of the run output." lightbox = "./media/how-to-evaluate-semantic-kernel/run-output.png":::
 
 ### Evaluating the accuracy
 
 Once a batch run is completed, you then need an easy way to determine the adequacy of the test results. This information can then be used to develop accuracy scores, which can be incrementally improved.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-batch-run-with-prompt-flow.png" alt-text="Evaluating batch run with prompt flow" lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-batch-run-with-prompt-flow.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-batch-run-with-prompt-flow.png" alt-text="Screenshot of evaluating batch run with prompt flow." lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-batch-run-with-prompt-flow.png":::
 
 Evaluation flows in Prompt flow enable this functionality. Using the sample evaluation flows offered by prompt flow, you can assess various metrics such as **classification accuracy**, **perceived intelligence**, **groundedness**, and more. 
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-sample-flows.png" alt-text="Evaluation flow samples" lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-sample-flows.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-sample-flows.png" alt-text="Screenshot showing evaluation flow samples." lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-sample-flows.png":::
 
 There's also the flexibility to develop **your own custom evaluators** if needed.
 :::image type="content" source="./media/how-to-evaluate-semantic-kernel/my-evaluator.png" alt-text="My custom evaluation flow" lightbox = "./media/how-to-evaluate-semantic-kernel/my-evaluator.png":::
 
-In Prompt flow, you can quick create an evaluation run based on an completed batch run by following the steps below:
+In Prompt flow, you can quick create an evaluation run based on a completed batch run by following the steps below:
 1. Prepare the evaluation flow and the complete a batch run.
 1. Click *Run* tab in home page to go to the run list.
 1. Go into the previous completed batch run.
@@ -129,9 +129,9 @@ In Prompt flow, you can quick create an evaluation run based on an completed bat
 1. Submit run and wait for the result.
 
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/add-evaluation.png" alt-text="Add new evaluation" lightbox = "./media/how-to-evaluate-semantic-kernel/add-evaluation.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/add-evaluation.png" alt-text="Screenshot showing add new evaluation." lightbox = "./media/how-to-evaluate-semantic-kernel/add-evaluation.png":::
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-setting.png" alt-text="Evaluation settings" lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-setting.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-setting.png" alt-text="Screenshot showing evaluation settings." lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-setting.png":::
 
 
 Follow this [documentation](/semantic-kernel/ai-orchestration/planners/evaluate-and-deploy-planners/evaluating-plugins-and-planners-with-prompt-flow?tabs=gpt-35-turbo) for Semantic Kernel to learn more about how to use the [math accuracy evaluation flow](https://github.com/microsoft/promptflow/tree/main/examples/flows/evaluation/eval-accuracy-maths-to-code) to test our planner to see how well it solves word problems. 
@@ -140,11 +140,11 @@ After running the evaluator, you’ll get a summary back of your metrics. Initia
 
 To check the metrics, you can go back to the batch run detail page, click **Details** button, and then click **Output** tab, select the evaluation run  name in the dropdown list to view the evaluation result.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-result.png" alt-text="Evaluation result" lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-result.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-result.png" alt-text="Screenshot showing evaluation result." lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-result.png":::
 
 You can check the aggregated metric in the **Metrics** tab.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-metrics.png" alt-text="Evaluation metrics" lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-metrics.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/evaluation-metrics.png" alt-text="Screenshot showing evaluation metrics." lightbox = "./media/how-to-evaluate-semantic-kernel/evaluation-metrics.png":::
 
 
 ### Experiments for quality improvement
@@ -169,7 +169,7 @@ To compare, select the runs you wish to analyze, then select the **Visualize out
 
 This will present you with a detailed table, line-by-line comparison of the results from selected runs.
 
-:::image type="content" source="./media/how-to-evaluate-semantic-kernel/compare-detail.png" alt-text="Compare runs details" lightbox = "./media/how-to-evaluate-semantic-kernel/compare-detail.png":::
+:::image type="content" source="./media/how-to-evaluate-semantic-kernel/compare-detail.png" alt-text="Screenshot of compare runs details." lightbox = "./media/how-to-evaluate-semantic-kernel/compare-detail.png":::
 
 ## Next steps