edit pass: deploy-and-monitor-flows

paulth1 · paulth1 · commit 77cfaadd7e3c · 2025-02-28T15:35:54.000-08:00
diff --git a/articles/ai-foundry/how-to/develop/trace-production-sdk.md b/articles/ai-foundry/how-to/develop/trace-production-sdk.md
@@ -75,14 +75,14 @@ The **Dependency** type event records calls from your deployments. The name of t
 
 | Metrics name                         | Type      | Dimensions                                | Description                                                                     |
 |--------------------------------------|-----------|-------------------------------------------|---------------------------------------------------------------------------------|
-| token_consumption                    | counter   | - flow <br> - node<br> - llm_engine<br> - token_type:  `prompt_tokens`: LLM API input tokens;  `completion_tokens`: LLM API response tokens; `total_tokens` = `prompt_tokens + completion tokens`          | OpenAI token consumption metrics                                                |
-| flow_latency                         | histogram | flow, response_code, streaming, response_type | request execution cost, response_type means whether it's full/firstbyte/lastbyte|
-| flow_request                         | counter   | flow, response_code, exception, streaming    | flow request count                                                              |
-| node_latency                         | histogram | flow, node, run_status                      | node execution cost                                                             |
-| node_request                         | counter   | flow, node, exception, run_status            | node execution count                                                    |
-| rpc_latency                          | histogram | flow, node, api_call                        | rpc cost                                                                        |
-| rpc_request                          | counter   | flow, node, api_call, exception              | rpc count                                                                       |
-| flow_streaming_response_duration     | histogram | flow                                      | streaming response sending cost, from sending first byte to sending last byte   |
+| `token_consumption`                    | counter   | - flow <br> - node<br> - `llm_engine`<br> - `token_type`:  `prompt_tokens`: LLM API input tokens;  `completion_tokens`: LLM API response tokens; `total_tokens` = `prompt_tokens + completion tokens`          | OpenAI token consumption metrics                                                |
+| `flow_latency`                         | histogram | flow, `response_code`, streaming, `response_type` | request execution cost, `response_type` means whether it's full/firstbyte/lastbyte|
+| `flow_request`                         | counter   | flow, `response_code`, exception, streaming    | flow request count                                                              |
+| `node_latency`                         | histogram | flow, node, `run_status`                      | node execution cost                                                             |
+| `node_request`                         | counter   | flow, node, exception, `run_status`            | node execution count                                                    |
+| `rpc_latency`                          | histogram | flow, node, `api_call`                        | rpc cost                                                                        |
+| `rpc_request`                          | counter   | flow, node, `api_call`, exception              | rpc count                                                                       |
+| `flow_streaming_response_duration`     | histogram | flow                                      | streaming response sending cost, from sending first byte to sending last byte   |
 
 You can find the workspace default Application Insights metrics on your workspace overview page in the Azure portal.
 
diff --git a/articles/ai-foundry/how-to/flow-deploy.md b/articles/ai-foundry/how-to/flow-deploy.md
@@ -237,8 +237,6 @@ If you aren't going to use the endpoint after you finish this tutorial, delete t
 
 ## Related content
 
-## Next steps
-
 - Learn more about what you can do in [Azure AI Foundry](../what-is-ai-foundry.md).
 - Get answers to frequently asked questions in the [Azure AI Foundry FAQ](../faq.yml).
 - [Enable trace and collect feedback for your deployment](./develop/trace-production-sdk.md).
diff --git a/articles/ai-foundry/how-to/prompt-flow-troubleshoot.md b/articles/ai-foundry/how-to/prompt-flow-troubleshoot.md
@@ -37,7 +37,7 @@ Errors related to compute session failures that use a custom base image are disc
 
 Flow run-related issues are discussed.
 
-### How do I find the raw inputs and outputs of in the LLM tool for further investigation?
+### How do I find the raw inputs and outputs of the LLM tool for further investigation?
 
 In a prompt flow, on a **Flow** page with a successful run and run detail page, you can find the raw inputs and outputs of the LLM tool in the output section. Select **View full output** to view the full output.
 
@@ -87,7 +87,7 @@ You might encounter a 409 error from Azure OpenAI. This error means that you rea
 
 Flow deployment-related issues are discussed.
 
-### Upstream request time-out issue when consuming the endpoint
+### How do I resolve an upstream request time-out issue?
 
 If you use the Azure CLI or SDK to deploy the flow, you might encounter a time-out error. By default, `request_timeout_ms` is 5000. You can specify a maximum of five minutes, which is 300,000 ms. The following example shows how to specify a `request timeout` in the deployment yaml file. To learn more, see [deployment schema](/azure/machine-learning/reference-yaml-deployment-managed-online).
 
@@ -96,7 +96,7 @@ request_settings:
   request_timeout_ms: 300000
 ```
 
-### What do I do when OpenAI API hits an authentication error?
+### What do I do when OpenAI API generates an authentication error?
 
 If you regenerate your Azure OpenAI key and manually update the connection used in a prompt flow, you might encounter errors like "Unauthorized. Access token is missing, invalid, audience is incorrect or have expired." You might see these messages when you invoke an existing endpoint that was created before the key was regenerated.