You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/flow-deploy.md
+12-1Lines changed: 12 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,6 +37,10 @@ To deploy a prompt flow as an online endpoint, you need:
37
37
* An Azure subscription. If you don't have one, create a free account before you begin.
38
38
* An Azure AI Studio project.
39
39
40
+
41
+
> [!NOTE]
42
+
> Managed online endpoint only supports managed virtual network. If your hub is in custom vnet, you need to try other deployment options, such as [deploy to other platforms such as Docker](https://microsoft.github.io/promptflow/how-to-guides/deploy-a-flow/index.html).
43
+
40
44
## Create an online deployment
41
45
42
46
Now that you have built a flow and tested it properly, it's time to create your online endpoint for real-time inference.
@@ -106,7 +110,6 @@ This step allows you to configure the basic settings of the deployment.
106
110
|Virtual machine| The VM size to use for the deployment.|
107
111
|Instance count| The number of instances to use for the deployment. Specify the value on the workload you expect. For high availability, we recommend that you set the value to at least `3`. We reserve an extra 20% for performing upgrades.|
108
112
|Inference data collection| If you enable this, the flow inputs and outputs are auto collected in an Azure Machine Learning data asset, and can be used for later monitoring.|
109
-
|Application Insights diagnostics| If you enable this, system metrics during inference time (such as token count, flow latency, and flow request) will be collected into Azure AI Studio hub default Application Insights.|
110
113
111
114
After you finish the basic settings, you can directly **Review + Create** to finish the creation, or you can select **Next** to configure advanced settings.
112
115
@@ -149,6 +152,9 @@ If you created the associated endpoint with **User Assigned Identity**, the user
149
152
150
153
See detailed guidance about how to grant permissions to the endpoint identity in [Grant permissions to the endpoint](#grant-permissions-to-the-endpoint).
151
154
155
+
> [!IMPORTANT]
156
+
> If your flow uses Microsoft Entra ID based authentication connections, no matter you use system-assigned identity or user-assigned identity, you always need to grant the managed identity appropriate roles of the corresponding resources so that it can make API calls to that resource. For example, if your Azure OpenAI connection uses Microsoft Entra ID based authentication, you need to grant your endpoint managed identity **Cognitive Services OpenAI User or Cognitive Services OpenAI Contributor role** of the corresponding Azure OpenAI resources.
157
+
152
158
### Advanced settings - Outputs & Connections
153
159
154
160
In this step, you can view all flow outputs, and specify which outputs to include in the response of the endpoint you deploy. By default all flow outputs are selected.
@@ -164,6 +170,10 @@ Once you configured and reviewed all the steps above, you can select **Review +
164
170
>
165
171
> You can understand the deployment creation progress via the notification starts by **Prompt flow deployment**.
166
172
173
+
#### Enable tracing by turning-on Application Insights diagnostics (preview)
174
+
175
+
If you enable this, tracing data and system metrics during inference time (such as token count, flow latency, flow request, and etc.) will be collected into workspace linked Application Insights. To learn more, see [prompt flow serving tracing data and metrics](./develop/trace-production-sdk.md).
176
+
167
177
## Grant permissions to the endpoint
168
178
169
179
> [!IMPORTANT]
@@ -245,3 +255,4 @@ If you aren't going use the endpoint after completing this tutorial, you should
245
255
246
256
- Learn more about what you can do in [Azure AI Studio](../what-is-ai-studio.md)
247
257
- Get answers to frequently asked questions in the [Azure AI FAQ article](../faq.yml)
258
+
-[Enable trace and collect feedback for your deployment] (./develop/trace-production-sdk.md)
0 commit comments