Skip to content

Commit 72c4137

Browse files
Merge pull request #277533 from lgayhardt/aistudiopfdeploy0524
AI Studio Prompt flow deployment updates
2 parents 2892a02 + 4dcb202 commit 72c4137

File tree

1 file changed

+12
-1
lines changed

1 file changed

+12
-1
lines changed

articles/ai-studio/how-to/flow-deploy.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,10 @@ To deploy a prompt flow as an online endpoint, you need:
3737
* An Azure subscription. If you don't have one, create a free account before you begin.
3838
* An Azure AI Studio project.
3939

40+
41+
> [!NOTE]
42+
> Managed online endpoint only supports managed virtual network. If your hub is in custom vnet, you need to try other deployment options, such as [deploy to other platforms such as Docker](https://microsoft.github.io/promptflow/how-to-guides/deploy-a-flow/index.html).
43+
4044
## Create an online deployment
4145

4246
Now that you have built a flow and tested it properly, it's time to create your online endpoint for real-time inference.
@@ -106,7 +110,6 @@ This step allows you to configure the basic settings of the deployment.
106110
|Virtual machine| The VM size to use for the deployment.|
107111
|Instance count| The number of instances to use for the deployment. Specify the value on the workload you expect. For high availability, we recommend that you set the value to at least `3`. We reserve an extra 20% for performing upgrades.|
108112
|Inference data collection| If you enable this, the flow inputs and outputs are auto collected in an Azure Machine Learning data asset, and can be used for later monitoring.|
109-
|Application Insights diagnostics| If you enable this, system metrics during inference time (such as token count, flow latency, and flow request) will be collected into Azure AI Studio hub default Application Insights.|
110113

111114
After you finish the basic settings, you can directly **Review + Create** to finish the creation, or you can select **Next** to configure advanced settings.
112115

@@ -149,6 +152,9 @@ If you created the associated endpoint with **User Assigned Identity**, the user
149152

150153
See detailed guidance about how to grant permissions to the endpoint identity in [Grant permissions to the endpoint](#grant-permissions-to-the-endpoint).
151154

155+
> [!IMPORTANT]
156+
> If your flow uses Microsoft Entra ID based authentication connections, no matter you use system-assigned identity or user-assigned identity, you always need to grant the managed identity appropriate roles of the corresponding resources so that it can make API calls to that resource. For example, if your Azure OpenAI connection uses Microsoft Entra ID based authentication, you need to grant your endpoint managed identity **Cognitive Services OpenAI User or Cognitive Services OpenAI Contributor role** of the corresponding Azure OpenAI resources.
157+
152158
### Advanced settings - Outputs & Connections
153159

154160
In this step, you can view all flow outputs, and specify which outputs to include in the response of the endpoint you deploy. By default all flow outputs are selected.
@@ -164,6 +170,10 @@ Once you configured and reviewed all the steps above, you can select **Review +
164170
>
165171
> You can understand the deployment creation progress via the notification starts by **Prompt flow deployment**.
166172
173+
#### Enable tracing by turning-on Application Insights diagnostics (preview)
174+
175+
If you enable this, tracing data and system metrics during inference time (such as token count, flow latency, flow request, and etc.) will be collected into workspace linked Application Insights. To learn more, see [prompt flow serving tracing data and metrics](./develop/trace-production-sdk.md).
176+
167177
## Grant permissions to the endpoint
168178

169179
> [!IMPORTANT]
@@ -245,3 +255,4 @@ If you aren't going use the endpoint after completing this tutorial, you should
245255

246256
- Learn more about what you can do in [Azure AI Studio](../what-is-ai-studio.md)
247257
- Get answers to frequently asked questions in the [Azure AI FAQ article](../faq.yml)
258+
- [Enable trace and collect feedback for your deployment] (./develop/trace-production-sdk.md)

0 commit comments

Comments
 (0)