Skip to content

Commit 4cfd694

Browse files
chetanchoudhary-sumoJV0812amee-sumo
authored
SUMO-251500: Adding monitor's information to OTEL Apps Set2 (#4876)
* SUMO-251500: Adding monitor's information to OTEL Apps Set2 * Adding Oracle and IIS10 Otel monitor info * Update docs/integrations/databases/opentelemetry/mariadb-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/databases/opentelemetry/couchbase-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/web-servers/opentelemetry/squid-proxy-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md Co-authored-by: Jagadisha V <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update docs/integrations/databases/opentelemetry/oracle-opentelemetry.md Co-authored-by: Amee Lepcha <[email protected]> * Update sql-server-linux-opentelemetry.md * Update iis-10-opentelemetry.md --------- Co-authored-by: Jagadisha V <[email protected]> Co-authored-by: Amee Lepcha <[email protected]>
1 parent 8ab62fd commit 4cfd694

File tree

7 files changed

+123
-0
lines changed

7 files changed

+123
-0
lines changed

docs/integrations/databases/opentelemetry/couchbase-opentelemetry.md

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -226,3 +226,20 @@ Use this dashboard to:
226226
- To understand user behavior accessing clusters and servers through Rest API.
227227

228228
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Couchbase-OpenTelemetry/Couchbase-HTTP-Access.png' alt="Access" />
229+
230+
## Create monitors for Couchbase app
231+
232+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
233+
234+
<CreateMonitors/>
235+
236+
### Couchbase alerts
237+
238+
| Name | Description | Alert Condition | Recover Condition |
239+
|:--|:--|:--|:--|
240+
| `Couchbase - Bucket Not Ready` | This alert is triggered when a bucket in the Couchbase cluster is not ready. | Count `>` 0 | Count `<=` 0 |
241+
| `Couchbase - High Latency HTTP Requests` | This alert is triggered on high average latency for HTTP requests to the Couchbase. | Count `>` 1000 | Count `<=` 1000 |
242+
| `Couchbase - Node Down` | This alert is triggered when a node in the Couchbase cluster is down. | Count `>` 0 | Count `<=` 0 |
243+
| `Couchbase - Node Not Respond` | This alert is triggered when a node in the Couchbase cluster does not respond too many times. | Count `>=` 10 | Count `<` 10 |
244+
| `Couchbase - Too Many Error Queries on Buckets` | This alert is triggered when there are too many error queries on a bucket in a Couchbase cluster. | Count `>=` 1000 | Count `<` 1000 |
245+
| `Couchbase - Too Many Login Failures` | This alert is triggered when there are too many login failures to a node in a Couchbase cluster. | Count `>=` 1000 | Count `<` 1000 |

docs/integrations/databases/opentelemetry/mariadb-opentelemetry.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -248,3 +248,19 @@ Use this dashboard to:
248248
- Examine slow query trends to determine if there are periodic performance bottlenecks in your database clusters.
249249

250250
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/MariaDB-OpenTelemetry/MariaDB-Slow-Queries.png' alt="Slow Queries" />
251+
252+
## Create monitors for MariaDB app
253+
254+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
255+
256+
<CreateMonitors/>
257+
258+
### MariaDB alerts
259+
260+
| Name | Description | Alert Condition | Recover Condition |
261+
|:--|:--|:--|:--|
262+
| `MariaDB - Critical Errors` | This alert is triggered when there are critical database errors. | Count `>` 10 | Count `<=` 10 |
263+
| `MariaDB - Excessive Slow Query Detected` | This alert is triggered when the average time to execute a query is more than 15 seconds for a 5 minute time interval. | Count `>=` 1 | Count `<` 1 |
264+
| `MariaDB - Failed Login Attempts` | This alert is triggered when there are excessive failed login attempts in a short period. | Count `>=` 1 | Count `<` 1 |
265+
| `MariaDB - Instance down` | This alert is triggered when the MariaDB instance is down. | Count `>=` 1 | Count `<` 1 |
266+
| `MariaDB - Replication Failure` | This alert is triggered when there are replication failures. | Count `>=` 1 | Count `<` 1 |

docs/integrations/databases/opentelemetry/oracle-opentelemetry.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -559,3 +559,26 @@ See information derived from the syslog audit trail, including successful and fa
559559
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Oracle-OpenTelemetry/Oracle-Performance-Details.png' alt="Monitor Performance by DB Script" />
560560
The Oracle - Performance Details dashboard gives insight about - count of rollback, commits, transaction, process, session.
561561
In addition to this it helps monitoring physical and logical reads, PGA allocated. This dashboard is based on the [metrics collected by Oracle DB opentelemetry receiver](https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/main/receiver/oracledbreceiver/documentation.md).
562+
563+
## Create monitors for Oracle app
564+
565+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
566+
567+
<CreateMonitors/>
568+
569+
### Oracle alerts
570+
571+
| Name | Description | Alert Condition | Recover Condition |
572+
|:--|:--|:--|:--|
573+
| `Oracle - Admin Restricted Command Execution` | This alert is triggered when the Listener cannot resolve a command. | Count `>` 0 | Count `<=` 0 |
574+
| `Oracle - Archival Log Creation` | This alert is triggered when an archive log creation error occurs. | Count `>` 0 | Count `<=` 0 |
575+
| `Oracle - Block Corruption` | This alert is triggered when corrupt data blocks are detected. | Count `>` 0 | Count `<=` 0 |
576+
| `Oracle - Database Crash` | This alert is triggered when the database crashes. | Count `>` 0 | Count `<=` 0 |
577+
| `Oracle - Deadlock` | This alert is triggered when deadlocks are detected. | Count `>` 5 | Count `<=` 5 |
578+
| `Oracle - Fatal NI Connect Error` | This alert is triggered when a "Fatal NI connect error" is detected. | Count `>` 0 | Count `<=` 0 |
579+
| `Oracle - Internal Errors` | This alert is triggered when internal errors are detected. | Count `>` 0 | Count `<=` 0 |
580+
| `Oracle - Login Fail` | This alert is triggered when a user login failure is detected. | Count `>` 0 | Count `<=` 0 |
581+
| `Oracle - Possible Inappropriate Activity` | This alert is triggered when possible inappropriate activity is detected. | Count `>` 0 | Count `<=` 0 |
582+
| `Oracle - TNS Error` | This alert is triggered when TNS operation errors are detected. | Count `>` 0 | Count `<=` 0 |
583+
| `Oracle - Unable To Extend Tablespace` | This alert is triggered when tablespace extension failures are detected. | Count `>` 0 | Count `<=` 0 |
584+
| `Oracle - Unauthorized Command Execution` | This alert is triggered when a user is not authorized to execute a requested listener command in an Oracle instance. | Count `>` 0 | Count `<=` 0 |

docs/integrations/microsoft-azure/opentelemetry/sql-server-linux-opentelemetry.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,3 +184,21 @@ Use this dashboard to:
184184
- Monitor any errors and warnings.
185185

186186
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/SQLServer-Linux-OpenTelemetry/SQL-Server-Operations.png' alt="Operations" />
187+
188+
## Create monitors for SQL Server Linux app
189+
190+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
191+
192+
<CreateMonitors/>
193+
194+
### SQL Server Linux alerts
195+
196+
| Name | Description | Alert Condition | Recover Condition |
197+
|:--|:--|:--|:--|
198+
| `SQL Server - AppDomain` | This alert is triggered when AppDomain-related issues are detected in your SQL Server instance. | Count `>=` 1 | Count `<` 1 |
199+
| `SQL Server - Backup Fail` | This alert is triggered when the SQL Server backup fails. | Count `>=` 1 | Count `<` 1 |
200+
| `SQL Server - Deadlock` | This alert is triggered when deadlocks are detected in a SQL Server instance. | Count `>` 5 | Count `<=` 5 |
201+
| `SQL Server - Instance Down` | This alert is triggered when the SQL Server instance is down for 5 minutes. | Count `>` 0 | Count `<=` 0 |
202+
| `SQL Server - Insufficient Space` | This alert is triggered when the SQL Server instance cannot allocate a new page for the database due to insufficient disk space in the filegroup. | Count `>` 0 | Count `<=` 0 |
203+
| `SQL Server - Login Fail` | This alert is triggered when the user is unable to login to the SQL Server. | Count `>=` 1 | Count `<` 1 |
204+
| `SQL Server - Mirroring Error` | This alert is triggered when an error occurs in SQL Server mirroring. | Count `>=` 1 | Count `<` 1 |

docs/integrations/web-servers/opentelemetry/iis-10-opentelemetry.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -318,3 +318,23 @@ The **IIS - Web Service** dashboard provides a high-level view of the Web Servic
318318

319319
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/IIS-OpenTelemetry/IIS-Web-Service.png' alt="IIS-Web-Service" />
320320

321+
## Create monitors for IIS app
322+
323+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
324+
325+
<CreateMonitors/>
326+
327+
### IIS alerts
328+
329+
| Name | Description | Alert Condition | Recover Condition |
330+
|:--|:--|:--|:--|
331+
| `IIS - Access from Highly Malicious Sources` | This alert is triggered when an IIS server is accessed from highly malicious IP addresses. | Count `>` 0 | Count `<=` 0 |
332+
| `IIS - ASP.NET Application Errors` | This alert is triggered when an error is detected in the ASP.NET applications running on an IIS server. | Count `>` 0 | Count `<=` 0 |
333+
| `IIS - Blocked Async IO Requests` | This alert is triggered when blocked async I/O requests are detected on an IIS server. | Count `>` 0 | Count `<=` 0 |
334+
| `IIS - Error Events` | This alert is triggered when an error is detected in the IIS logs. | Count `>` 0 | Count `<=` 0 |
335+
| `IIS - High ASP.NET Current Requests` | This alert is triggered when the current ASP.NET request count exceeds the given value (Default 500). | Count `>` 500 | Count `<=` 500 |
336+
| `IIS - High Client (HTTP 4xx) Error Rate (Copy)` | This alert is triggered when more than 5% of HTTP requests result in a 4xx response code. | Count `>` 0 | Count `<=` 0 |
337+
| `IIS - High Current Connections` | This alert is triggered when the current connections exceed the given value (Default 1000), indicating potential capacity issues. | Count `>` 1000 | Count `<=` 1000 |
338+
| `IIS - High Server (HTTP 5xx) Error Rate` | This alert is triggered when more than 5% of HTTP requests result in a 5xx response code. | Count `>` 0 | Count `<=` 0 |
339+
| `IIS - No Worker Processes` | This alert is triggered when the worker process count drops to zero, indicating potential application pool issues. | Count `<` 1 | Count `>=` 1 |
340+
| `IIS - Slow Response Time` | This alert is triggered when the response time for a given IIS server exceeds one second. | Count `>` 0 | Count `<=` 0 |

docs/integrations/web-servers/opentelemetry/squid-proxy-opentelemetry.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -194,3 +194,18 @@ The **The Squid Proxy - HTTP Response Analysis** dashboard provides insights int
194194
The **Squid Proxy - Quality of Service** dashboard provides insights into latency, the response time of requests according to HTTP action, and the response time according to location.
195195

196196
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Squid-Proxy-OpenTelemetry/Squid-Proxy-Quality-of-Service.png' alt="Quality of Service" />
197+
198+
## Create monitors for SquidProxy app
199+
200+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
201+
202+
<CreateMonitors/>
203+
204+
### SquidProxy alerts
205+
206+
| Name | Description | Alert Condition | Recover Condition |
207+
|:--|:--|:--|:--|
208+
| `Squid Proxy - High Client (HTTP 4xx) Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 4xx. | Count `>` 0 | Count `<=` 0 |
209+
| `Squid Proxy - High Denied Request` | This alert is triggered when there are too many HTTP denied requests (>5%). | Count `>` 0 | Count `<=` 0 |
210+
| `Squid Proxy - High Response Time` | This alert is triggered when requests are taking too long to process. | Count `>` 20 | Count `<=` 20 |
211+
| `Squid Proxy - High Server (HTTP 5xx) Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 5xx. | Count `>` 0 | Count `<=` 0 |

docs/integrations/web-servers/opentelemetry/varnish-opentelemetry.md

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,3 +184,17 @@ The **Varnish - Visitor Traffic Insight** dashboard provides detailed informatio
184184
The **Varnish - Web Server Operations** dashboard provides a high-level view combined with detailed information on the top ten bots, geographic locations and data for clients with high error rates, server errors over time, and non 200 response code status codes. Dashboard panels also show information on server error logs, error log levels, error responses by server, and the top URIs responsible for 404 responses.
185185
186186
<img src='https://sumologic-app-data-v2.s3.amazonaws.com/dashboards/Varnish-OpenTelemetry/Varnish-Web-Server-Operations.png' alt="Web Server Operations" />
187+
188+
## Create monitors for Varnish app
189+
190+
import CreateMonitors from '../../../reuse/apps/create-monitors.md';
191+
192+
<CreateMonitors/>
193+
194+
### Varnish alerts
195+
196+
| Name | Description | Alert Condition | Recover Condition |
197+
|:--|:--|:--|:--|
198+
| `Varnish - Access from Highly Malicious Sources` | This alert is triggered when Varnish is accessed from highly malicious IP addresses. | Count `>` 0 | Count `<=` 0 |
199+
| `Varnish - High 4XX Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 4xx. | Count `>` 5 | Count `<=` 5 |
200+
| `Varnish - High 5XX Error Rate` | This alert is triggered when there are too many HTTP requests (>5%) with a response status of 5xx. | Count `>` 5 | Count `<=` 5 |

0 commit comments

Comments
 (0)