Skip to content

Commit 270eda2

Browse files
committed
screenshots
1 parent af2369b commit 270eda2

File tree

6 files changed

+9
-0
lines changed

6 files changed

+9
-0
lines changed

articles/machine-learning/prompt-flow/how-to-bulk-test-evaluate-flow.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,6 +44,8 @@ A batch run allows you to run your flow with a large dataset and generate output
4444

4545
To start a batch run with evaluation, you can select on the **"Batch run"** button on the top right corner of your flow page.
4646

47+
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-button.png" alt-text="Screenshot of Web Classification with batch run highlighted. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-button.png":::
48+
4749
To submit batch run, you can select a dataset to test your flow with. You can also select an evaluation method to calculate metrics for your flow output. If you don't want to use an evaluation method, you can skip this step and run the batch run without calculating any metrics. You can also start a new round of evaluation later.
4850

4951
First, you're asked to give your batch run a descriptive and recognizable name. You can also write a description and add tags (key-value pairs) to your batch run. After you finish the configuration, select **"Next"** to continue.
@@ -94,13 +96,16 @@ After submission, you can find the submitted batch run in the run list tab in pr
9496

9597
In the run detail page, you can select **Overview** to check the details of this batch run.
9698

99+
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-overview.png" alt-text="Screenshot of batch run detail page where you view detailed information. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-overview.png":::
97100

98101
In the overview panel, you can check the metadata of this run. You can also go to the **Outputs** tab in the batch run detail page to check the outputs/responses generated by the flow with the dataset that you provided. You can also select **"Export"** to export and download the outputs in a `.csv` file.
99102

100103
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output.png" alt-text="Screenshot of batch run detail page on the outputs tab where you check batch run outputs. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output.png":::
101104

102105
You can **select an evaluation run** from the dropdown box and you'll see appended columns at the end of the table showing the evaluation result for each row of data. You can locate the result that is falsely predicted with the output column "grade".
103106

107+
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output-evaluation.png" alt-text="Screenshot of batch run detail page on the outputs tab where evaluation results are appended. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-detail-output-evaluation.png":::
108+
104109
To view the overall performance, you can select the **Metrics** tab, and you can see various metrics that indicate the quality of each variant.
105110

106111
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-detail-metrics.png" alt-text="Screenshot of batch run detail page on the metrics tab where you check the overall performance in the metrics tab. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-detail-metrics.png":::
@@ -117,6 +122,8 @@ If you have already completed a batch run, you can start another round of evalua
117122

118123
You can select **New evaluation** to start another round of evaluation.
119124

125+
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-detail-new-evaluation.png" alt-text="Screenshot of batch run detail page on where to start a new round of evaluation. " lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-detail-new-evaluation.png":::
126+
120127
After setting up the configuration, you can select **"Submit"** for this new round of evaluation. After submission, you'll be able to see a new record in the prompt flow run list.
121128

122129
After the evaluation run completed, similarly, you can check the result of evaluation in the **"Overview->Output"** tab of the batch run detail page. You need select the new evaluation run to view its result.
@@ -131,6 +138,8 @@ In some scenarios, you'll modify your flow to improve its performance. You can s
131138

132139
To check the batch run history of your flow, you can select the **"View batch run"** button on the top right corner of your flow page. You'll see a list of batch runs that you have submitted for this flow.
133140

141+
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-history.png" alt-text="Screenshot of Web Classification with the view bulk runs button selected." lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-history.png":::
142+
134143
You can select on each batch run to check the detail. You can also select multiple batch runs and select on the **"Visualize outputs"** to compare the metrics and the outputs of these batch runs.
135144

136145
:::image type="content" source="./media/how-to-bulk-test-evaluate-flow/batch-run-history-list.png" alt-text="Screenshot of batch run runs showing the history." lightbox = "./media/how-to-bulk-test-evaluate-flow/batch-run-history-list.png":::
505 KB
Loading
638 KB
Loading
Loading
495 KB
Loading
475 KB
Loading

0 commit comments

Comments
 (0)