Skip to content

Commit 6e2a16b

Browse files
committed
Edit content
1 parent 523a0d0 commit 6e2a16b

File tree

1 file changed

+36
-40
lines changed

1 file changed

+36
-40
lines changed

articles/machine-learning/how-to-monitor-online-endpoints.md

Lines changed: 36 additions & 40 deletions
Original file line numberDiff line numberDiff line change
@@ -128,43 +128,33 @@ You can configure deployments to scale automatically based on metrics. To turn o
128128

129129
There are three logs that you can turn on for online endpoints:
130130

131-
* **AmlOnlineEndpointTrafficLog**: This traffic log provides a way for you to check the information of a request to the endpoint. You can use this log in the following cases:
132-
133-
* If the request response isn't 200, check the value of the `ResponseCodeReason` column to see the reason. Also check the reason in [HTTPS status codes](how-to-troubleshoot-online-endpoints.md#http-status-codes), in the article about troubleshooting online endpoints.
134-
135-
* You could check the response code and response reason of your model from the column "ModelStatusCode" and "ModelStatusReason".
136-
137-
* You want to check the duration of the request like total duration, the request/response duration, and the delay caused by the network throttling. You could check it from the logs to see the breakdown latency.
138-
139-
* If you want to check how many requests or failed requests recently. You could also enable the logs.
140-
141-
* **AmlOnlineEndpointConsoleLog**: Contains logs that the containers output to the console. Below are some cases:
142-
143-
* If the container fails to start, the console log can be useful for debugging.
144-
145-
* Monitor container behavior and make sure that all requests are correctly handled.
146-
147-
* Write request IDs in the console log. Joining the request ID, the AmlOnlineEndpointConsoleLog, and AmlOnlineEndpointTrafficLog in the Log Analytics workspace, you can trace a request from the network entry point of an online endpoint to the container.
148-
149-
* You can also use this log for performance analysis in determining the time required by the model to process each request.
150-
151-
* **AmlOnlineEndpointEventLog**: Contains event information regarding the container's life cycle. Currently, we provide information on the following types of events:
152-
153-
| Name | Message |
154-
| ----- | ----- |
155-
| BackOff | Back-off restarting failed container
156-
| Pulled | Container image "\<IMAGE\_NAME\>" already present on machine
157-
| Killing | Container inference-server failed liveness probe, will be restarted
158-
| Created | Created container image-fetcher
159-
| Created | Created container inference-server
160-
| Created | Created container model-mount
161-
| LivenessProbeFailed | Liveness probe failed: \<FAILURE\_CONTENT\>
162-
| ReadinessProbeFailed | Readiness probe failed: \<FAILURE\_CONTENT\>
163-
| Started | Started container image-fetcher
164-
| Started | Started container inference-server
165-
| Started | Started container model-mount
166-
| Killing | Stopping container inference-server
167-
| Killing | Stopping container model-mount
131+
* **AmlOnlineEndpointTrafficLog**: This traffic log provides a way for you to check the information of requests to the endpoint. This log is useful in the following cases:
132+
* A request response isn't 200, and you want more information. The `ResponseCodeReason` column in the log lists the reason. For a description of status codes and reasons, you can also see [HTTPS status codes](how-to-troubleshoot-online-endpoints.md#http-status-codes) in the article about troubleshooting online endpoints.
133+
* You want to look up the response code and response reason of your model for a request. The `ModelStatusCode` and `ModelStatusReason` columns provide this information.
134+
* You want to know the duration of a request. The logs provide a breakdown of the latency that shows the total duration, the request duration, the response duration, and the delay caused by network throttling.
135+
* You want to check how many recent requests succeeded and failed. The logs provide this information.
136+
* **AmlOnlineEndpointConsoleLog**: This log contains statements that the containers write as output to the console. This log is useful in the following cases:
137+
* A container fails to start. The console log can be useful for debugging.
138+
* You want to monitor container behavior and make sure that all requests are correctly handled.
139+
* You want to write request IDs in the console log. Joining the request ID, the AmlOnlineEndpointConsoleLog, and AmlOnlineEndpointTrafficLog in the Log Analytics workspace, you can trace a request from the network entry point of an online endpoint to the container.
140+
* You want to run a performance analysis. For instance, you want to determine the time the model needs to process each request.
141+
* **AmlOnlineEndpointEventLog**: This log contains event information about the container life cycle. Currently, the log provides information about the following types of events:
142+
143+
| Name | Message |
144+
| ----- | ----- |
145+
| BackOff | Back-off restarting failed container |
146+
| Pulled | Container image "\<IMAGE\_NAME\>" already present on machine |
147+
| Killing | Container inference-server failed liveness probe, will be restarted |
148+
| Created | Created container image-fetcher |
149+
| Created | Created container inference-server |
150+
| Created | Created container model-mount |
151+
| LivenessProbeFailed | Liveness probe failed: \<FAILURE\_CONTENT\> |
152+
| ReadinessProbeFailed | Readiness probe failed: \<FAILURE\_CONTENT\> |
153+
| Started | Started container image-fetcher |
154+
| Started | Started container inference-server |
155+
| Started | Started container model-mount |
156+
| Killing | Stopping container inference-server |
157+
| Killing | Stopping container model-mount |
168158

169159
### Turn logs on or off
170160

@@ -193,7 +183,9 @@ There are three logs that you can turn on for online endpoints:
193183

194184
1. From either the online endpoint properties or the Log Analytics workspace, select **Logs** from the left of the screen.
195185

196-
1. Close the **Queries** dialog that automatically opens, and then double-click the **AmlOnlineEndpointConsoleLog**. If you don't see it, use the **Search** field.
186+
1. Close the **Queries hub** window that automatically opens.
187+
188+
1. Under **Other**, double-click **AmlOnlineEndpointConsoleLog**. If you don't see **AmlOnlineEndpointConsoleLog**, enter that value into the search field.
197189

198190
:::image type="content" source="./media/how-to-monitor-online-endpoints/online-endpoints-log-queries.png" alt-text="Screenshot showing the log queries.":::
199191

@@ -203,13 +195,17 @@ There are three logs that you can turn on for online endpoints:
203195

204196
### Example queries
205197

206-
You can find example queries on the __Queries__ tab while viewing logs. Search for __Online endpoint__ to find example queries.
198+
Example queries are available for you to use. Take the following steps to view them:
199+
200+
1. On the Logs page, select **Queries** to open the Queries tab.
201+
202+
1. In the search box, enter **Online endpoint**.
207203

208204
:::image type="content" source="./media/how-to-monitor-online-endpoints/example-queries.png" alt-text="Screenshot of the example queries.":::
209205

210206
### Log column details
211207

212-
The following tables provide details on the data stored in each log:
208+
The following tables provide detailed information about the data that's stored in each log:
213209

214210
**AmlOnlineEndpointTrafficLog**
215211

0 commit comments

Comments
 (0)