Skip to content

Commit 595f4ac

Browse files
Final section of metrics topic.
1 parent a204515 commit 595f4ac

File tree

1 file changed

+23
-0
lines changed

1 file changed

+23
-0
lines changed

articles/operator-nexus/reference-operator-nexus-observability-metrics.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -152,3 +152,26 @@ All of the measurements are collected at 5-minute intervals.
152152
| Lldp Frame in | LLDPFrameIn in a network device refers to the number of Link Layer Discovery Protocol (LLDP) frames that the device received. |
153153
| Lldp Frame out | LLDPFrameOut in a network device refers to the number of Link Layer Discovery Protocol (LLDP) frames that the device sent. LLDP is used by network devices to advertise their identity and capabilities to other devices on the same network. |
154154
| Lldp TLV unknown | LLDPTLVUnknown in a network device refers to the number of Link Layer Discovery Protocol (LLDP) frames received that contain unknown Type-Length-Value (TLV) entries. TLVs are used in network protocols to specify optional information. An "unknown" TLV suggests the device received data that it doesn't recognize or can't interpret, which could indicate compatibility issues within the network. |
155+
156+
157+
## Network fabric device resource utilization
158+
159+
Resource utilization metrics provide critical insights into how efficiently network resources are being used. These metrics provide insights into the performance and health of a network fabric device. The resource utilization metrics provide a holistic view of a system's performance and health. They measure CPU workload, cooling efficiency, memory availability, power performance, and heat levels. These metrics are essential for optimizing system performance, managing resources effectively, and preventing hardware damage due to excessive heat.
160+
161+
162+
| Metrics Category | Description/Usage | Collection Interval | Measured Unit |
163+
|--|--|--|--|
164+
| CPU Utilization Avg | The CPU Utilization Average Metric is the mean percentage of computing resources used by a processor over a specific time interval. | 1 min | Percentage |
165+
| CPU Utilization Instant | The immediate percentage of computing resources being used by a processor at a specific time. It provides real-time insight into the CPU's workload and performance. | 1 min | Percentage |
166+
| CPU Utilization Max | The highest percentage of computing resources used by a processor over a specific time interval. | 1 min | Percentage |
167+
| CPU Utilization Min | The lowest percentage of computing resources used by a processor over a given time interval. | 1 min | Percentage |
168+
| Fan Speed | The immediate rate at which the cooling fan is spinning. It's a crucial metric for maintaining optimal operating temperature and ensuring the longevity of the device's components. | 1 min | Rpm (not Available) |
169+
| Memory Available | The available memory physically installed, or logically allocated to the component. | 1 min | Bytes |
170+
| Power Supply Input Current | The amount of electrical current, measured in amps, that the power supply unit (PSU) draws from the source. It's a crucial factor in determining the PSU's efficiency and the overall power consumption of the device. | 1 min | Amps (not Available) |
171+
| Power supply Input Voltage | The amount of electrical potential, measured in volts, that the power supply unit (PSU) draws from the source. It's a key parameter in ensuring the PSU can adequately convert AC power to DC power for the device's needs. | 1 min | Volts (Not Available) |
172+
| Power Supply Max Power Capacity | Maximum power capacity of the power supply. | 1 min | Watts (Not Available) |
173+
| Power Supply Output current | The output current supplied by the power supply. | 1 min | Amps (Not available) |
174+
| Power Supply Output Voltage | The output voltage supplied by the power supply. | 1 min | Volts (not available) |
175+
| Power Supply Output Power | The amount of electrical power, measured in watts, that the power supply unit (PSU) delivers to the device's components. It's a critical factor in ensuring the device has sufficient power for optimal performance. | 1 min | |
176+
| Temperature Instantaneous | The real-time temperature of the device's components. | 1 min | |
177+
| Temperature Max | The highest safe operating temperature for the device's components. Exceeding this limit can lead to overheating, which might cause performance issues, component damage, or even lead to device failure. It's crucial to monitor and manage the device's temperature to ensure its longevity and optimal performance. | 1 min | |

0 commit comments

Comments
 (0)