Merge pull request #267368 from eric-urban/eur/prompt-flow-refresh

ttorble · web-flow · commit 7615781a5d45 · 2024-02-27T10:34:21.000Z
prompt flow refresh
diff --git a/articles/ai-studio/how-to/flow-process-image.md b/articles/ai-studio/how-to/flow-process-image.md
@@ -1,33 +1,29 @@
 ---
-title: Process images in prompt flow (preview)
-titleSuffix: Azure Machine Learning
-description: Learn how to incorporate images into prompt flow.
-services: machine-learning
-ms.service: machine-learning
-ms.subservice: prompt-flow
+title: Process images in prompt flow
+titleSuffix: Azure AI Studio
+description: Learn how to use images in prompt flow.
+ms.service: azure-ai-studio
 ms.topic: how-to
-ms.author: jinzhong
-author: zhongj
-ms.reviewer: lagayhar
-ms.date: 02/05/2024
+ms.date: 2/26/2024
+ms.reviewer: jinzhong
+ms.author: lagayhar
+author: lgayhardt
 ---
 
-# Process images in prompt flow (preview)
+# Process images in prompt flow
+
+[!INCLUDE [Azure AI Studio preview](../includes/preview-ai-studio.md)]
 
 Multimodal Large Language Models (LLMs), which can process and interpret diverse forms of data inputs, present a powerful tool that can elevate the capabilities of language-only systems to new heights. Among the various data types, images are important for many real-world applications. The incorporation of image data into AI systems provides an essential layer of visual understanding. 
 
-In this article, you'll learn:
+In this article, you learn:
 > [!div class="checklist"]
 > - How to use image data in prompt flow
 > - How to use built-in GPT-4V tool to analyze image inputs.
 > - How to build a chatbot that can process image and text inputs.
 > - How to create a batch run using image data.  
 > - How to consume online endpoint with image data.
 
-> [!IMPORTANT]
-> Prompt flow image support is currently in public preview. This preview is provided without a service-level agreement, and is not recommended for production workloads. Certain features might not be supported or might have constrained capabilities.
-> For more information, see [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
-
 ## Image type in prompt flow
 
 Prompt flow input and output support Image as a new data type.
@@ -38,10 +34,10 @@ To use image data in prompt flow authoring page:
    :::image type="content" source="../media/prompt-flow/how-to-process-image/add-image-type-input.png" alt-text="Screenshot of flow authoring page showing adding flow input as Image type." lightbox = "../media/prompt-flow/how-to-process-image/add-image-type-input.png":::
 2. Preview the image. If the image isn't displayed correctly, delete the image and add it again.
    :::image type="content" source="../media/prompt-flow/how-to-process-image/flow-input-image-preview.png" alt-text="Screenshot of flow authoring page showing image preview flow input." lightbox = "../media/prompt-flow/how-to-process-image/flow-input-image-preview.png":::
-3. You might want to **preprocess the image using Python tool** before feeding it to LLM, for example, you can resize or crop the image to a smaller size.
+3. You might want to preprocess the image using the [Python tool](./prompt-flow-tools/python-tool.md) before feeding it to the LLM. For example, you can resize or crop the image to a smaller size.
    :::image type="content" source="../media/prompt-flow/how-to-process-image/process-image-using-python.png" alt-text="Screenshot of using python tool to do image preprocessing." lightbox = "../media/prompt-flow/how-to-process-image/process-image-using-python.png":::
     > [!IMPORTANT]
-    > To process image using Python function, you need to use the `Image` class, import it from `promptflow.contracts.multimedia` package. The Image class is used to represent an Image type within prompt flow. It is designed to work with image data in byte format, which is convenient when you need to handle or manipulate the image data directly.
+    > To process images using a Python function, you need to use the `Image` class that you import from the `promptflow.contracts.multimedia` package. The `Image` class is used to represent an `Image` type within prompt flow. It is designed to work with image data in byte format, which is convenient when you need to handle or manipulate the image data directly.
     >
     > To return the processed image data, you need to use the `Image` class to wrap the image data. Create an `Image` object by providing the image data in bytes and the [MIME type](https://developer.mozilla.org/docs/Web/HTTP/Basics_of_HTTP/MIME_types/Common_types) `mime_type`. The MIME type lets the system understand the format of the image data, or it can be `*` for unknown type.
 
@@ -51,7 +47,7 @@ If the Image object from Python node is set as the flow output, you can preview
 
 ## Use GPT-4V tool
 
-Azure OpenAI GPT-4 Turbo with Vision tool and OpenAI GPT-4V are built-in tools in prompt flow that can use OpenAI GPT-4V model to answer questions based on input images. You can find the tool by selecting **More tool** in the flow authoring page.
+The [Azure OpenAI GPT-4 Turbo with Vision tool](./prompt-flow-tools/azure-open-ai-gpt-4v-tool.md) and OpenAI GPT-4V are built-in tools in prompt flow that can use OpenAI GPT-4V model to answer questions based on input images. You can find the tool by selecting **+ More tools** in the flow authoring page.
 
 Add the [Azure OpenAI GPT-4 Turbo with Vision tool](./prompt-flow-tools/azure-open-ai-gpt-4v-tool.md) to the flow. Make sure you have an Azure OpenAI connection, with the availability of GPT-4 vision-preview models.
 
@@ -65,11 +61,11 @@ You can assign a value to the image input through the following ways:
 
 - Reference from the flow input of Image type.
 - Reference from other node's output of Image type.
-- Upload, drag, paste an image, or specify an image URL or the relative image path.
+- Upload, drag, or paste an image, or specify an image URL or the relative image path.
 
 ## Build a chatbot to process images
 
-In this section, you'll learn how to build a chatbot that can process image and text inputs.
+In this section, you learn how to build a chatbot that can process image and text inputs.
 
 Assume you want to build a chatbot that can answer any questions about the image and text together. You can achieve this by following the steps below:
 
@@ -120,13 +116,13 @@ If the batch run outputs contain images, you can check the **flow_outputs datase
 
 You can [deploy a flow to an online endpoint for real-time inference](./flow-deploy.md).
 
-Currently the **Test** tab in the deployment detail page does not support image inputs or outputs.
+Currently the **Test** tab in the deployment detail page doesn't support image inputs or outputs.
 
 For now, you can test the endpoint by sending request including image inputs.
 
 To consume the online endpoint with image input, you should represent the image by using the format `{"data:<mime type>;<representation>": "<value>"}`. In this case, `<representation>` can either be `url` or `base64`.
 
-If the flow generates image output, it will be returned with `base64` format, for example, `{"data:<mime type>;base64": "<base64 string>"}`.
+If the flow generates image output, it is returned with `base64` format, for example, `{"data:<mime type>;base64": "<base64 string>"}`.
 
 ## Next steps
 
diff --git a/articles/ai-studio/how-to/prompt-flow-tools/azure-open-ai-gpt-4v-tool.md b/articles/ai-studio/how-to/prompt-flow-tools/azure-open-ai-gpt-4v-tool.md
@@ -5,7 +5,7 @@ description: This article introduces the Azure OpenAI GPT-4 Turbo with Vision to
 manager: nitinme
 ms.service: azure-ai-studio
 ms.topic: how-to
-ms.date: 1/8/2024
+ms.date: 2/26/2024
 ms.reviewer: keli19
 ms.author: lagayhar
 author: lgayhardt
@@ -27,18 +27,51 @@ The prompt flow *Azure OpenAI GPT-4 Turbo with Vision* tool enables you to use y
 
 - An [Azure AI hub resource](../../how-to/create-azure-ai-resource.md) with a GPT-4 Turbo with Vision model deployed in one of the regions that support GPT-4 Turbo with Vision: Australia East, Switzerland North, Sweden Central, and West US. When you deploy from your project's **Deployments** page, select: `gpt-4` as the model name and `vision-preview` as the model version.
 
-## Connection
+## Build with the Azure OpenAI GPT-4 Turbo with Vision tool
 
-Set up connections to provisioned resources in prompt flow.
+1. Create or open a flow in [Azure AI Studio](https://ai.azure.com). For more information, see [Create a flow](../flow-develop.md).
+1. Select **+ More tools** > **Azure OpenAI GPT-4 Turbo with Vision** to add the Azure OpenAI GPT-4 Turbo with Vision tool to your flow.
 
-| Type        | Name     | API KEY  | API Type | API Version |
-|-------------|----------|----------|----------|-------------|
-| AzureOpenAI | Required | Required | Required | Required    |
+    :::image type="content" source="../../media/prompt-flow/azure-openai-gpt-4-vision-tool.png" alt-text="Screenshot of the Azure OpenAI GPT-4 Turbo with Vision tool added to a flow in Azure AI Studio." lightbox="../../media/prompt-flow/azure-openai-gpt-4-vision-tool.png":::
+
+1. Select the connection to your Azure OpenAI Service. For example, you can select the **Default_AzureOpenAI** connection. For more information, see [Prerequisites](#prerequisites).
+1. Enter values for the Azure OpenAI GPT-4 Turbo with Vision tool input parameters described [here](#inputs). For example, you can use this example prompt:
+
+    ```jinja
+    # system:
+    As an AI assistant, your task involves interpreting images and responding to questions about the image.
+    Remember to provide accurate answers based on the information present in the image.
+    
+    # user:
+    Can you tell me what the image depicts?
+    ![image]({{image_input}})
+    ```
+
+1. Select **Validate and parse input** to validate the tool inputs.
+1. Specify an image to analyze in the `image_input` input parameter. For example, you can upload an image or enter the URL of an image to analyze. Otherwise you can paste or drag and drop an image into the tool.
+1. Add more tools to your flow as needed, or select **Run** to run the flow.
+1. The outputs are described [here](#outputs).
+
+Here's an example output response:
+
+```json
+{
+    "system_metrics": {
+        "completion_tokens": 96,
+        "duration": 4.874329,
+        "prompt_tokens": 1157,
+        "total_tokens": 1253
+    },
+    "output": "The image depicts a user interface for Azure's OpenAI GPT-4 service. It is showing a configuration screen where settings related to the AI's behavior can be adjusted, such as the model (GPT-4), temperature, top_p, frequency penalty, etc. There's also an area where users can enter a prompt to generate text, and an option to include an image input for the AI to interpret, suggesting that this particular interface supports both text and image inputs."
+}
+```
 
 ## Inputs
 
-| Name                   | Type        | Description                                                                                    | Required |
-|------------------------|-------------|------------------------------------------------------------------------------------------------|----------|
+The following are available input parameters:
+
+| Name | Type | Description | Required |
+| ---- | ---- | ----------- | -------- |
 | connection             | AzureOpenAI | The Azure OpenAI connection to be used in the tool.                                              | Yes      |
 | deployment\_name       | string      | The language model to use.                                                                      | Yes      |
 | prompt                 | string      | Text prompt that the language model uses to generate its response.                    | Yes      |
@@ -51,6 +84,13 @@ Set up connections to provisioned resources in prompt flow.
 
 ## Outputs
 
+The following are available output parameters:
+
 | Return Type | Description                              |
 |-------------|------------------------------------------|
 | string      | The text of one response of conversation |
+
+## Next steps
+
+- [Learn more about how to create a flow](../flow-develop.md)
+
diff --git a/articles/ai-studio/media/prompt-flow/azure-openai-gpt-4-vision-tool.png b/articles/ai-studio/media/prompt-flow/azure-openai-gpt-4-vision-tool.png