Skip to content

Commit 5e076a3

Browse files
Merge pull request #234 from sysdiglabs/openshift-state-metrics
Openshift-state-metrics
2 parents 8f0498e + 3dbb73e commit 5e076a3

File tree

10 files changed

+1175
-0
lines changed

10 files changed

+1175
-0
lines changed

apps/openshift-state-metrics.yaml

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
---
2+
apiVersion: v1
3+
kind: App
4+
name: "openshift-state-metrics"
5+
keywords:
6+
- Platform
7+
- OpenShift
8+
- Kubernetes
9+
- Available
10+
availableVersions:
11+
- '4.7'
12+
shortDescription: "Specific metrics for OpenShift"
13+
description: |
14+
openshift-state-metrics expands upon kube-state-metrics by adding metrics for OpenShift specific resources
15+
icon: https://raw.githubusercontent.com/sysdiglabs/promcat-resources/master/apps/images/openshift.png
16+
website: https://github.com/openshift/openshift-state-metrics
17+
available: true
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
# Alerts
2+
## [OpenShift-state-metrics] CPU Resource Request Quota Usage
3+
Resource request CPU usage is over 90% resource quota.
4+
5+
## [OpenShift-state-metrics] CPU Resource Limit Quota Usage
6+
Resource limit CPU usage is over 90% resource limit quota.
7+
8+
## [OpenShift-state-metrics] Memory Resource Request Quota Usage
9+
Resource request memory usage is over 90% resource quota.
10+
11+
## [OpenShift-state-metrics] Memory Resource Limit Quota Usage
12+
Resource limit memory usage is over 90% resource limit quota.
13+
14+
## [OpenShift-state-metrics] Routes with issues
15+
A route status is in error and is having issues.
16+
17+
## [OpenShift-state-metrics] Buid Processes with issues
18+
A build process is in error or failed status.
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# Gather the metrics from the prometheus deployed by Openshift
2+
3+
Metrics are automatically gathered by Prometheus Cluster Monitoring, you can query them in the Prometheus built-in console
4+
5+
Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,18 @@
1+
# Openshift-state-metrics
2+
Red Hat® OpenShift® state metrics is a expansion upon kube-state-metrics adding specific OpenShift® resource metrics
3+
4+
Openshift provides a prometheus with Openshift-state-metrics but doesn't provide any dashboard with this information.
5+
You can gather the metrics with our agent and show all metrics in our dashboards or even in the grafana dashboards with
6+
the Sysdig datasource as a Prometheus datasource.
7+
8+
# Metrics
9+
The metrics gives you the information about the following:
10+
- ClusterResourceQuotas
11+
- BuildConfig
12+
- DeploymentConfig
13+
- Routes
14+
15+
# Attributions
16+
The configuration files and dashboards are maintained by [Sysdig team](https://sysdig.com/).
17+
18+
All the metrics are maintained by [OpenShift-state-metrics](https://github.com/openshift/openshift-state-metrics).
Lines changed: 61 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,61 @@
1+
apiVersion: v1
2+
kind: Alert
3+
app: openshift-state-metrics
4+
version: 1.0.0
5+
appVersion:
6+
- '4.7'
7+
descriptionFile: ALERTS.md
8+
configurations:
9+
- kind: Prometheus
10+
data: |
11+
groups:
12+
- name: openshift-state-metrics
13+
rules:
14+
- alert: '[OpenShift-state-metrics] CPU Resource Request Quota Usage'
15+
expr: |
16+
sum by (name) (openshift_clusterresourcequota_usage{resource="requests.cpu", type="used"})/sum by (name) (openshift_clusterresourcequota_usage{resource="requests.cpu", type="hard"})> 0.90
17+
for: 10m
18+
labels:
19+
severity: critical
20+
annotations:
21+
description: Resource request CPU usage is over 90% resource quota.
22+
- alert: '[OpenShift-state-metrics] CPU Resource Limit Quota Usage'
23+
expr: |
24+
sum by (name) (openshift_clusterresourcequota_usage{resource="limits.cpu", type="used"})/sum by (name) (openshift_clusterresourcequota_usage{resource="limits.cpu", type="hard"})> 0.90
25+
for: 10m
26+
labels:
27+
severity: critical
28+
annotations:
29+
description: Resource limit CPU usage is over 90% resource limit quota.
30+
- alert: '[OpenShift-state-metrics] Memory Resource Request Quota Usage'
31+
expr: |
32+
sum by (name) (openshift_clusterresourcequota_usage{resource="requests.memory", type="used"})/sum by (name) (openshift_clusterresourcequota_usage{resource="requests.memory", type="hard"})> 0.85
33+
for: 10m
34+
labels:
35+
severity: critical
36+
annotations:
37+
description: Resource request memory usage is over 90% resource quota.
38+
- alert: '[OpenShift-state-metrics] Memory Resource Limit Quota Usage'
39+
expr: |
40+
sum by (name) (openshift_clusterresourcequota_usage{resource="limits.memory", type="used"})/sum by (name) (openshift_clusterresourcequota_usage{resource="limits.memory", type="hard"})> 0.85
41+
for: 10m
42+
labels:
43+
severity: critical
44+
annotations:
45+
description: Resource limit memory usage is over 90% resource limit quota.
46+
- alert: '[OpenShift-state-metrics] Routes with issues'
47+
expr: |
48+
openshift_route_status{status != 'True'} > 0
49+
for: 10m
50+
labels:
51+
severity: critical
52+
annotations:
53+
description: A route status is in error and is having issues.
54+
- alert: '[OpenShift-state-metrics] Buid Processes with issues'
55+
expr: |
56+
sum by (build, buildconfig) (openshift_build_status_phase_total{build_phase=~"failed|error") > 0
57+
for: 10m
58+
labels:
59+
severity: critical
60+
annotations:
61+
description: A build process is in error or failed status.
Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
apiVersion: v1
2+
kind: Dashboard
3+
app: openshift-state-metrics
4+
version: 1.0.0
5+
appVersion:
6+
- '4.7'
7+
configurations:
8+
- name: 'OpenShift-state-metrics dashboard'
9+
kind: Sysdig
10+
image: 'openshift-state-metrics/images/openshift-state-metrics-v4.png'
11+
description: |
12+
This dashboard offers information on:
13+
* CPU Used vs Request Quotas
14+
* CPU Used vs Limit Quotas
15+
* Memory Used vs Request Quotas
16+
* Memory Used vs Limit Quotas
17+
* Route list
18+
* Routes with issues
19+
* Builds
20+
* Builds Processes with issues
21+
file: 'include/openshift-state-metrics-v4.json'
Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
apiVersion: v1
2+
kind: Description
3+
app: openshift-state-metrics
4+
version: 1.0.0
5+
appVersion:
6+
- '4.7'
7+
descriptionFile: README.md
1.36 MB
Loading

0 commit comments

Comments
 (0)