Skip to content

Commit eeab8a8

Browse files
authored
Make individual metric examples expandable
Show user list of possible metric examples and allow them to select which ones they wish to view
1 parent ee9bca4 commit eeab8a8

File tree

1 file changed

+22
-7
lines changed

1 file changed

+22
-7
lines changed

articles/load-balancer/load-balancer-standard-diagnostics.md

Lines changed: 22 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -80,8 +80,10 @@ To configure alerts:
8080

8181
>[!NOTE]
8282
>Alert condition configuration window will show time series for signal history. There is an option to filter this time series by dimensions such as Backend IP. This will filter the time series graph but **not** the alert itself. You cannot configure alerts for specific Backend IP addresses.
83-
83+
8484
#### Is the data path up and available for my load balancer VIP?
85+
<details>
86+
<summary>Click to expand!</summary>
8587

8688
The VIP availability metric describes the health of the data path within the region to the compute host where your VMs are located. The metric is a reflection of the health of the Azure infrastructure. You can use the metric to:
8789
- Monitor the external availability of your service
@@ -109,9 +111,11 @@ VIP availability fails for the following reasons:
109111
For diagnostic purposes, you can use the [Data Path Availability metric together with the health probe status](#vipavailabilityandhealthprobes).
110112

111113
Use **Average** as the aggregation for most scenarios.
114+
</details>
112115

113116
#### Are the back-end instances for my VIP responding to probes?
114-
117+
<details>
118+
<summary>Click to expand!</summary>
115119
The health probe status metric describes the health of your application deployment as configured by you when you configure the health probe of your load balancer. The load balancer uses the status of the health probe to determine where to send new flows. Health probes originate from an Azure infrastructure address and are visible within the guest OS of the VM.
116120

117121
To get the health probe status for your Standard Load Balancer resources:
@@ -123,9 +127,11 @@ Health probes fail for the following reasons:
123127
- Your probe is not permitted by the Network Security Group, the VM's guest OS firewall, or the application layer filters.
124128

125129
Use **Average** as the aggregation for most scenarios.
130+
</details>
126131

127132
#### How do I check my outbound connection statistics?
128-
133+
<details>
134+
<summary>Click to expand!</summary>
129135
The SNAT connections metric describes the volume of successful and failed connections for [outbound flows](https://aka.ms/lboutbound).
130136

131137
A failed connections volume of greater than zero indicates SNAT port exhaustion. You must investigate further to determine what may be causing these failures. SNAT port exhaustion manifests as a failure to establish an [outbound flow](https://aka.ms/lboutbound). Review the article about outbound connections to understand the scenarios and mechanisms at work, and to learn how to mitigate and design to avoid SNAT port exhaustion.
@@ -137,10 +143,12 @@ To get SNAT connection statistics:
137143
![SNAT connection](./media/load-balancer-standard-diagnostics/LBMetrics-SNATConnection.png)
138144

139145
*Figure: Load Balancer SNAT connection count*
146+
</details>
140147

141148

142149
#### How do I check my SNAT port usage and allocation?
143-
150+
<details>
151+
<summary>Click to expand!</summary>
144152
The SNAT Usage metric indicates how many unique flows are established between an internet source and a backend VM or virtual machine scale set that is behind a load balancer and does not have a public IP address. By comparing this with the SNAT Allocation metric, you can determine if your service is experiencing or at risk of SNAT exhaustion and resulting outbound flow failure.
145153

146154
If your metrics indicate risk of [outbound flow](https://aka.ms/lboutbound) failure, reference the article and take steps to mitigate this to ensure service health.
@@ -162,20 +170,24 @@ To view SNAT port usage and allocation:
162170
![SNAT usage by backend instance](./media/load-balancer-standard-diagnostics/snat-usage-split.png)
163171

164172
*Figure: TCP SNAT port usage per backend instance*
173+
</details>
165174

166175
#### How do I check inbound/outbound connection attempts for my service?
167-
176+
<details>
177+
<summary>Click to expand!</summary>
168178
A SYN packets metric describes the volume of TCP SYN packets, which have arrived or were sent (for [outbound flows](https://aka.ms/lboutbound)) that are associated with a specific front end. You can use this metric to understand TCP connection attempts to your service.
169179

170180
Use **Total** as the aggregation for most scenarios.
171181

172182
![SYN connection](./media/load-balancer-standard-diagnostics/LBMetrics-SYNCount.png)
173183

174184
*Figure: Load Balancer SYN count*
185+
</details>
175186

176187

177188
#### How do I check my network bandwidth consumption?
178-
189+
<details>
190+
<summary>Click to expand!</summary>
179191
The bytes and packet counters metric describes the volume of bytes and packets that are sent or received by your service on a per-front-end basis.
180192

181193
Use **Total** as the aggregation for most scenarios.
@@ -189,9 +201,11 @@ To get byte or packet count statistics:
189201
![Byte count](./media/load-balancer-standard-diagnostics/LBMetrics-ByteCount.png)
190202

191203
*Figure: Load Balancer byte count*
204+
</details>
192205

193206
#### <a name = "vipavailabilityandhealthprobes"></a>How do I diagnose my load balancer deployment?
194-
207+
<details>
208+
<summary>Click to expand!</summary>
195209
By using a combination of the VIP availability and health probe metrics on a single chart you can identify where to look for the problem and resolve the problem. You can gain assurance that Azure is working correctly and use this knowledge to conclusively determine that the configuration or application is the root cause.
196210

197211
You can use health probe metrics to understand how Azure views the health of your deployment as per the configuration you have provided. Looking at health probes is always a great first step in monitoring or determining a cause.
@@ -207,6 +221,7 @@ The chart displays the following information:
207221
- The health probe status (DIP availability), indicated by the purple trace, is at 0 percent at the beginning of the chart. The circled area in green highlights where the health probe status (DIP availability) became healthy, and at which point the customer's deployment was able to accept new flows.
208222

209223
The chart allows customers to troubleshoot the deployment on their own without having to guess or ask support whether other issues are occurring. The service was unavailable because health probes were failing due to either a misconfiguration or a failed application.
224+
</details>
210225

211226
## <a name = "ResourceHealth"></a>Resource health status
212227

0 commit comments

Comments
 (0)