Skip to content

Commit fbf5007

Browse files
SUMO-251500: Adding monitor's information to OTEL Apps Set2
1 parent 3fce183 commit fbf5007

File tree

5 files changed

+80
-0
lines changed

5 files changed

+80
-0
lines changed

docs/integrations/databases/opentelemetry/couchbase-opentelemetry.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -226,3 +226,20 @@ Use this dashboard to:
226226
- To understand user behavior accessing clusters and servers through Rest API.
227227

228228
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Couchbase-OpenTelemetry/Couchbase-HTTP-Access.png' alt="Access" />
229+
230+
## Create monitors for Couchbase app
231+
232+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
233+
234+
<CreateMonitors/>
235+
236+
### Couchbase alerts
237+
238+
| Name | Description | Alert Condition | Recover Condition |
239+
|:--|:--|:--|:--|
240+
| `Couchbase - Bucket Not Ready` | This alert is triggered when a bucket in the Couchbase cluster is not ready. | Count `>` 0 | Count `<=` 0 |
241+
| `Couchbase - High Latency HTTP Requests` | This alert is triggered on high average latency for HTTP requests to Couchbase | Count `>` 1000 | Count `<=` 1000 |
242+
| `Couchbase - Node Down` | This alert is triggered when a node in the Couchbase cluster is down. | Count `>` 0 | Count `<=` 0 |
243+
| `Couchbase - Node Not Respond` | This alert is triggered when a node in the Couchbase cluster does not respond too many times. | Count `>=` 10 | Count `<` 10 |
244+
| `Couchbase - Too Many Error Queries on Buckets` | This alert is triggered when there are too many error queries on a bucket in a Couchbase cluster. | Count `>=` 1000 | Count `<` 1000 |
245+
| `Couchbase - Too Many Login Failures` | This alert is triggered when there are too many login failures to a node in a Couchbase cluster. | Count `>=` 1000 | Count `<` 1000 |

docs/integrations/databases/opentelemetry/mariadb-opentelemetry.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -248,3 +248,19 @@ Use this dashboard to:
248248
- Examine slow query trends to determine if there are periodic performance bottlenecks in your database clusters.
249249

250250
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/MariaDB-OpenTelemetry/MariaDB-Slow-Queries.png' alt="Slow Queries" />
251+
252+
## Create monitors for MariaDB app
253+
254+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
255+
256+
<CreateMonitors/>
257+
258+
### MariaDB alerts
259+
260+
| Name | Description | Alert Condition | Recover Condition |
261+
|:--|:--|:--|:--|
262+
| `MariaDB - Critical Errors` | This alert is triggered when there are critical database errors. | Count `>` 10 | Count `<=` 10 |
263+
| `MariaDB - Excessive Slow Query Detected` | This alert is triggered when the average time to execute a query is more than 15 seconds for a 5 minute time interval. | Count `>=` 1 | Count `<` 1 |
264+
| `MariaDB - Failed Login Attempts` | This alert is triggered when there are excessive failed login attempts in a short period. | Count `>=` 1 | Count `<` 1 |
265+
| `MariaDB - Instance down` | This alert is triggered when we detect that a MariaDB instance is down | Count `>=` 1 | Count `<` 1 |
266+
| `MariaDB - Replication Failure` | This alert is triggered when there are replication failures. | Count `>=` 1 | Count `<` 1 |

docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,3 +184,21 @@ Use this dashboard to:
184184
- Monitor any errors and warnings.
185185

186186
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/SQLServer-Linux-OpenTelemetry/SQL-Server-Operations.png' alt="Operations" />
187+
188+
## Create monitors for SQL Server Linux app
189+
190+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
191+
192+
<CreateMonitors/>
193+
194+
### SQL Server Linux alerts
195+
196+
| Name | Description | Alert Condition | Recover Condition |
197+
|:--|:--|:--|:--|
198+
| `SQL Server - AppDomain` | This alert is triggered when we detect AppDomain related issues in your SQL Server instance. | Count `>=` 1 | Count `<` 1 |
199+
| `SQL Server - Backup Fail` | This alert is triggered when we detect that the SQL Server backup failed. | Count `>=` 1 | Count `<` 1 |
200+
| `SQL Server - Deadlock` | This alert is triggered when we detect deadlocks in a SQL Server instance. | Count `>` 5 | Count `<=` 5 |
201+
| `SQL Server - Instance Down` | This alert is triggered when we detect that the SQL Server instance is down for 5 minutes. | Count `>` 0 | Count `<=` 0 |
202+
| `SQL Server - Insufficient Space` | This alert is triggered when SQL Server instance could not allocate a new page for database because of insufficient disk space in filegroup. | Count `>` 0 | Count `<=` 0 |
203+
| `SQL Server - Login Fail` | This alert is triggered when we detect that the user cannot login to SQL Server. | Count `>=` 1 | Count `<` 1 |
204+
| `SQL Server - Mirroring Error` | This alert is triggered when we detect that the SQL Server mirroring has error. | Count `>=` 1 | Count `<` 1 |

docs/integrations/web-servers/opentelemetry/squid-proxy-opentelemetry.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -194,3 +194,18 @@ The **The Squid Proxy - HTTP Response Analysis** dashboard provides insights int
194194
The **Squid Proxy - Quality of Service** dashboard provides insights into latency, the response time of requests according to HTTP action, and the response time according to location.
195195

196196
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Squid-Proxy-OpenTelemetry/Squid-Proxy-Quality-of-Service.png' alt="Quality of Service" />
197+
198+
## Create monitors for SquidProxy app
199+
200+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
201+
202+
<CreateMonitors/>
203+
204+
### SquidProxy alerts
205+
206+
| Name | Description | Alert Condition | Recover Condition |
207+
|:--|:--|:--|:--|
208+
| `Squid Proxy - High Client (HTTP 4xx) Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 4xx. | Count `>` 0 | Count `<=` 0 |
209+
| `Squid Proxy - High Denied Request` | This alert is triggered when there are too many HTTP denied requests (>5%) | Count `>` 0 | Count `<=` 0 |
210+
| `Squid Proxy - High Response Time` | This alert is triggered when requests are taking too long to process. | Count `>` 20 | Count `<=` 20 |
211+
| `Squid Proxy - High Server (HTTP 5xx) Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 5xx. | Count `>` 0 | Count `<=` 0 |

docs/integrations/web-servers/opentelemetry/varnish-opentelemetry.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,3 +184,17 @@ The **Varnish - Visitor Traffic Insight** dashboard provides detailed informatio
184184
The **Varnish - Web Server Operations** dashboard provides a high-level view combined with detailed information on the top ten bots, geographic locations and data for clients with high error rates, server errors over time, and non 200 response code status codes. Dashboard panels also show information on server error logs, error log levels, error responses by server, and the top URIs responsible for 404 responses.
185185
186186
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Varnish-OpenTelemetry/Varnish-Web-Server-Operations.png' alt="Web Server Operations" />
187+
188+
## Create monitors for Varnish app
189+
190+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
191+
192+
<CreateMonitors/>
193+
194+
### Varnish alerts
195+
196+
| Name | Description | Alert Condition | Recover Condition |
197+
|:--|:--|:--|:--|
198+
| `Varnish - Access from Highly Malicious Sources` | This alert is triggered when Varnish is accessed from highly malicious IP addresses. | Count `>` 0 | Count `<=` 0 |
199+
| `Varnish - High 4XX Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 4xx. | Count `>` 5 | Count `<=` 5 |
200+
| `Varnish - High 5XX Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 5xx. | Count `>` 5 | Count `<=` 5 |

0 commit comments

Comments
 (0)