Skip to content

Commit c4192b9

Browse files
committed
Added shoeboxed metrics
1 parent 9062bea commit c4192b9

File tree

1 file changed

+10
-0
lines changed

1 file changed

+10
-0
lines changed

articles/operator-nexus/troubleshoot-memory-limits.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,16 @@ author: matternst7258
1515

1616
It's recommended to have alerts set up for the Operator Nexus cluster to look for Kubernetes pods restarting from OOMKill errors. These alerts allow customers to know if a component on a server is working appropriately.
1717

18+
Metrics exposed to identify memory limits:
19+
20+
| Metric Name | Description |
21+
| ------------------------------------ | ------------------------------------------------ |
22+
| Container Restarts | `kube_pod_container_status_restarts_total` |
23+
| Container Status Terminated Reason | `kube_pod_container_status_terminated_reason` |
24+
| Container Resource Limits | `kube_pod_container_resource_limits` |
25+
26+
`Container Status Terminated Reason` displays the OOMKill reason for impacted pods.
27+
1828
## Identifying Out of Memory (OOM) pods
1929

2030
Start by identifying any components that are restarting or show OOMKill.

0 commit comments

Comments
 (0)