Skip to content

Commit 0fc350b

Browse files
committed
Add new resources for Istio
1 parent 3a4cde9 commit 0fc350b

File tree

6 files changed

+5895
-42
lines changed

6 files changed

+5895
-42
lines changed

resources/istio/ALERTS.md

Lines changed: 12 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -1,19 +1,18 @@
11
# Alerts
2+
## [Istio-Citadel] CSR without success
3+
Some of the Certificate Signing Request (CSR) were not correctly requested
24

3-
## No Output Records From Plugin
4-
No Output Records From Plugin.
5+
## [Istio-Pilot] Inbound listener rules conflicts
6+
There are some conflict with inbound listener rules
57

6-
## Percentage of instances low
7-
Percentage of instances low
8+
## [Istio-Pilot] Endpoint found in unready state
9+
Endpoint found in unready state
810

9-
## Recently reboot
10-
Recently reboot
11+
## [Istio] Unstable requests for sidecar injections
12+
Sidecar injections requests has no been successfully resolved
1113

12-
## Limit of child proccess exceeded
13-
Limit of child proccess exceeded
14+
## [Istio-Envoy] High 4xx RequestError Rate
15+
High 4xx RequestError Rate
1416

15-
## Reaching limit of queue process
16-
Reaching limit of queue process
17-
18-
## Requests processed reached timeout limit
19-
Requests processed reached timeout limit
17+
## [Istio-Envoy] High Request Latency
18+
Envoy High Request Latency

resources/istio/alerts.yaml

Lines changed: 28 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -9,45 +9,53 @@ configurations:
99
- kind: Prometheus
1010
data: |-
1111
groups:
12-
- name: Php-Fpm
12+
- name: Istio
1313
rules:
14-
- alert: '[Php-Fpm] Percentage of instances low'
14+
- alert: '[Istio-Citadel] CSR without success'
1515
expr: |
16-
sum (phpfpm_up==1) / sum (phpfpm_up) < 0.75
16+
rate(citadel_server_csr_count - citadel_server_success_cert_issuance_count[5m]) > 0
1717
for: 5m
1818
labels:
19-
severity: critical
19+
severity: warning
2020
annotations:
21-
description: Less than 75% of instances are down
22-
- alert: '[Php-Fpm] Recently reboot'
21+
description: Some of the Certificate Signing Request (CSR) were not correctly requested
22+
- alert: '[Istio-Pilot] Inbound listener rules conflicts'
2323
expr: |
24-
(count (phpfpm_start_since < 1800) or vector (0))/sum (phpfpm_up) > 0.4
24+
pilot_conflict_inbound_listener > 0
2525
for: 5m
2626
labels:
27-
severity: info
27+
severity: warning
2828
annotations:
29-
description: Instances have been recently reboot
30-
- alert: '[Php-Fpm] Limit of child proccess exceeded'
29+
description: There are some conflict with inbound listener rules
30+
- alert: '[Istio-Pilot] Endpoint found in unready state'
3131
expr: |
32-
sum (rate (phpfpm_max_children_reached[5m])) > 0
32+
pilot_endpoint_not_ready > 0
3333
for: 5m
3434
labels:
35-
severity: critical
35+
severity: warning
3636
annotations:
37-
description: Number of childs process have been exceeded
38-
- alert: '[Php-Fpm] Reaching limit of queue process'
37+
description: Endpoint found in unready state
38+
- alert: '[Istio] Unstable requests for sidecar injections'
3939
expr: |
40-
(phpfpm_listen_queue / phpfpm_listen_queue_length) > 0.8
40+
sidecar_injection_requests_total - sidecar_injection_success_total > 0
4141
for: 5m
4242
labels:
4343
severity: warning
4444
annotations:
45-
description: Buffer of queue requests reaching its limit
46-
- alert: '[Php-Fpm] Too slow requests processing'
45+
description: Sidecar injections requests has no been successfully resolved
46+
- alert: '[Istio-Envoy] High 4xx RequestError Rate'
4747
expr: |
48-
rate (phpfpm_slow_requests[5m]) / rate (phpfpm_process_requests[5m]) > 0.1
48+
sum (rate(istio_requests_total{ response_code=~"4..", reporter="source"}[5m]))/ sum (rate(istio_requests_total{ reporter="source"}[5m]))> 0.05
4949
for: 5m
5050
labels:
51-
severity: warning
51+
severity: critical
52+
annotations:
53+
description: High 4xx RequestError Rate
54+
- alert: '[Istio-Envoy] High Request Latency'
55+
expr: |
56+
histogram_quantile(0.95,sum(rate(envoy_cluster_upstream_cx_connect_ms_bucket[5m])) by (le)) > 100
57+
for: 5m
58+
labels:
59+
severity: critical
5260
annotations:
53-
description: Requests have taking too much time to be processed
61+
description: Envoy High Request Latency

resources/istio/dashboards.v1.14.yaml

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -7,22 +7,22 @@ appVersion:
77
configurations:
88
- name: Istiod Service
99
kind: Sysdig
10-
image: istio/images/workload_sysdig.png
10+
image: istio/images/servicev1.14.png
1111
description: |
1212
This dashboard offers information on:
13-
* Metrics for each workload and also inbound workloads
14-
file: include/dashboard-Sysdig-Workload-1.5.json
13+
*
14+
file: include/servicev1.14.json
1515
- name: Istio Control Plane
1616
kind: Sysdig
17-
image: istio/images/service_sysdig.png
17+
image: istio/images/control_planev1.14.png
1818
description: |
1919
This dashboard offers information on:
20-
* Metrics for the service and client workload
21-
file: include/dashboard-Sysdig-Service-1.5.json
20+
*
21+
file: include/control_planev1.14.json
2222
- name: Istio Envoy
2323
kind: Sysdig
24-
image: istio/images/service_sysdig.png
24+
image: istio/images/istio_envoyv1.14.png
2525
description: |
2626
This dashboard offers information on:
27-
* Metrics for the service and client workload
28-
file: include/dashboard-Sysdig-Service-1.5.json
27+
*
28+
file: include/istio_envoyv1.14.json

0 commit comments

Comments
 (0)