You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/azure-monitor/insights/network-performance-monitor-faq.md
+32-31Lines changed: 32 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -27,7 +27,7 @@ More information on the various capabilities supported by [Network Performance M
27
27
### What are the platform requirements for the nodes to be used for monitoring by NPM?
28
28
Listed below are the platform requirements for NPM's various capabilities:
29
29
30
-
- NPM's Performance Monitor and Service Connectivity Monitor capabilities support both Windows server and Windows desktops/client operating systems. Windows server OS versions supported are 2008 SP1 or later. Windows desktops/client versions supported are Windows 10, Windows 8.1, Windows 8 and Windows 7.
30
+
- NPM's Performance Monitor and Service Connectivity Monitor capabilities support both Windows server and Windows desktops/client operating systems. Windows server OS versions supported are 2008 SP1 or later. Windows desktops/client versions supported are Windows 10, Windows 8.1, Windows 8, and Windows 7.
31
31
- NPM's ExpressRoute Monitor capability supports only Windows server (2008 SP1 or later) operating system.
32
32
33
33
### Can I use Linux machines as monitoring nodes in NPM?
@@ -97,38 +97,39 @@ Refer to [alerts section in the documentation](https://docs.microsoft.com/azure/
97
97
### What are the default Log Analytics queries for alerts
98
98
Performance monitor query
99
99
100
-
NetworkMonitoring
101
-
| where (SubType == "SubNetwork" or SubType == "NetworkPath")
102
-
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy") and RuleName == "<<yourrulename>>"
100
+
NetworkMonitoring
101
+
| where (SubType == "SubNetwork" or SubType == "NetworkPath")
102
+
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy") and RuleName == "<<your rule name>>"
103
103
104
104
Service connectivity monitor query
105
105
106
-
NetworkMonitoring
107
-
| where (SubType == "EndpointHealth" or SubType == "EndpointPath")
108
-
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or ServiceResponseHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy") and TestName == "<<yourtestname>>"
106
+
NetworkMonitoring
107
+
| where (SubType == "EndpointHealth" or SubType == "EndpointPath")
108
+
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or ServiceResponseHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy") and TestName == "<<your test name>>"
109
109
110
110
ExpressRoute monitor queries:
111
111
Circuits query
112
112
113
-
NetworkMonitoring
114
-
| where (SubType == "ERCircuitTotalUtilization") and (UtilizationHealthState == "Unhealthy") and CircuitResourceId == "<<yourcircuitresourceID>>"
113
+
NetworkMonitoring
114
+
| where (SubType == "ERCircuitTotalUtilization") and (UtilizationHealthState == "Unhealthy") and CircuitResourceId == "<<yourcircuitresourceID>>"
115
115
116
116
Private peering
117
117
118
-
NetworkMonitoring
119
-
| where (SubType == "ExpressRoutePeering" or SubType == "ERVNetConnectionUtilization" or SubType == "ExpressRoutePath")
120
-
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy") and CircuitName == "<<yourcircuitname>>" and VirtualNetwork == "<<vnetname>>"
118
+
NetworkMonitoring
119
+
| where (SubType == "ExpressRoutePeering" or SubType == "ERVNetConnectionUtilization" or SubType == "ExpressRoutePath")
120
+
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy") and CircuitName == "<<yourcircuitname>>" and VirtualNetwork == "<<vnetname>>"
121
121
122
122
Microsoft peering
123
123
124
-
NetworkMonitoring
125
-
| where (SubType == "ExpressRoutePeering" or SubType == "ERMSPeeringUtilization" or SubType == "ExpressRoutePath")
126
-
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy") and CircuitName == ""<<yourcircuitname>>" and PeeringType == "MicrosoftPeering"
124
+
NetworkMonitoring
125
+
| where (SubType == "ExpressRoutePeering" or SubType == "ERMSPeeringUtilization" or SubType == "ExpressRoutePath")
126
+
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy") and CircuitName == ""<<yourcircuitname>>" and PeeringType == "MicrosoftPeering"
127
+
128
+
Common query
127
129
128
-
Common query
129
-
NetworkMonitoring
130
-
| where (SubType == "ExpressRoutePeering" or SubType == "ERVNetConnectionUtilization" or SubType == "ERMSPeeringUtilization" or SubType == "ExpressRoutePath")
131
-
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy")
130
+
NetworkMonitoring
131
+
| where (SubType == "ExpressRoutePeering" or SubType == "ERVNetConnectionUtilization" or SubType == "ERMSPeeringUtilization" or SubType == "ExpressRoutePath")
132
+
| where (LossHealthState == "Unhealthy" or LatencyHealthState == "Unhealthy" or UtilizationHealthState == "Unhealthy")
132
133
133
134
### Can NPM monitor routers and servers as individual devices?
134
135
NPM only identifies the IP and host name of underlying network hops (switches, routers, servers, etc.) between the source and destination IPs. It also identifies the latency between these identified hops. It does not individually monitor these underlying hops.
@@ -144,21 +145,21 @@ Incoming and outgoing values for both Primary and Secondary bandwidth can be cap
144
145
145
146
For MS peering level information, use the below mentioned query in Log Search
### Which regions are supported for NPM's Performance Monitor?
164
165
NPM can monitor connectivity between networks in any part of the world, from a workspace that is hosted in one of the [supported regions](../../azure-monitor/insights/network-performance-monitor.md#supported-regions)
@@ -182,8 +183,8 @@ A hop may not respond to a traceroute in one or more of the below scenarios:
182
183
* The network devices are not allowing ICMP_TTL_EXCEEDED traffic.
183
184
* A firewall is blocking the ICMP_TTL_EXCEEDED response from the network device.
184
185
185
-
### I get alerts for unhealthy tests but I do not see the high values in NPM's loss and latency graph. How do I check what is unhealthy?
186
-
NPM raises an alert if end to end latency between source and destination crosses the threshhold for any path between them. Some networks have more than one paths connecting the same source and destination. NPM raises an alert is any path is unhealthy. The loss and latency seen in the graphs is the average value for all the paths, hence it may not show the exact value of a single path. To understand where the threshold has been breached, look for the "SubType" column in the alert. If the issue is caused by a path the SubType value will be NetworkPath (for Performance Monitor tests), EndpointPath (for Service Connectivity Monitor tests) and ExpressRoutePath (for ExpressRotue Monitor tests).
186
+
### I get alerts for unhealthy tests but I do not see the high values in NPM's loss and latency graph. How do I check what is unhealthy?
187
+
NPM raises an alert if end to end latency between source and destination crosses the threshold for any path between them. Some networks have multiple paths connecting the same source and destination. NPM raises an alert is any path is unhealthy. The loss and latency seen in the graphs is the average value for all the paths, hence it may not show the exact value of a single path. To understand where the threshold has been breached, look for the "SubType" column in the alert. If the issue is caused by a path the SubType value will be NetworkPath (for Performance Monitor tests), EndpointPath (for Service Connectivity Monitor tests) and ExpressRoutePath (for ExpressRotue Monitor tests).
187
188
188
189
Sample Query to find is path is unhealthy:
189
190
@@ -193,7 +194,7 @@ Sample Query to find is path is unhealthy:
### Why does my test show unhealthy but the topology does not
196
-
NPM monitors end-to-end loss, latency, and topology at different intervals. Loss and latency are measured once every 5 seconds and aggregated every three minutes (for Performance Monitor and Express Route Monitor) while topology is calculated using traceroute once every 10 minutes. For example, between 3:44 and 4:04, topology may be updated three times (3:44, 3:54, 4:04), but loss and latency are updated about seven times (3:44, 3:47, 3:50, 3:53, 3:56, 3:59, 4:02). The topology generated at 3:54 will be rendered for the loss and latency that gets calculated at 3:56, 3:59 and 4:02. Suppose you get an alert that your ER circuit was unhealthy at 3:59. You log on to NPM and try to set the topology time to 3:59. NPM will render the topology generated at 3:54. To understand the last known topology of your network, compare the fields TimeProcessed (time at which loss and latency was calculated) and TracerouteCompletedTime(time at which topology was calculated).
197
+
NPM monitors end-to-end loss, latency, and topology at different intervals. Loss and latency are measured once every 5 seconds and aggregated every three minutes (for Performance Monitor and Express Route Monitor) while topology is calculated using traceroute once every 10 minutes. For example, between 3:44 and 4:04, topology may be updated three times (3:44, 3:54, 4:04), but loss and latency are updated about seven times (3:44, 3:47, 3:50, 3:53, 3:56, 3:59, 4:02). The topology generated at 3:54 will be rendered for the loss and latency that gets calculated at 3:56, 3:59 and 4:02. Suppose you get an alert that your ER circuit was unhealthy at 3:59. You log on to NPM and try to set the topology time to 3:59. NPM will render the topology generated at 3:54. To understand the last known topology of your network, compare the fields TimeProcessed (time at which loss and latency was calculated) and TracerouteCompletedTime(time at which topology was calculated).
197
198
198
199
### What is the difference between the fields E2EMedianLatency and AvgHopLatencyList in the NetworkMonitoring table
199
200
E2EMedianLatency is the latency updated every three minutes after aggregating the results of tcp ping tests, whereas AvgHopLatencyList is updated every 10 mins based on traceroute. To understand the exact time at which E2EMedianLatency was calculated, use the field TimeProcessed. To understand the exact time at which traceroute completed and updated AvgHopLatencyList, use the field TracerouteCompletedTime
0 commit comments