You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Special Resource Operator (SRO) exposes the following Prometheus metrics through the `metrics` service:
11
+
12
+
|===
13
+
|Metric Name |Description
14
+
15
+
|`sro_used_nodes`
16
+
|Returns the nodes that are running pods created by a SRO custom resource (CR). This metric is available for `DaemonSet` and `Deployment` objects only.
17
+
18
+
|`sro_kind_completed_info`
19
+
|Represents whether a `kind` of an object defined by the Helm Charts in a SRO CR has been successfully uploaded in the cluster (value `1`) or not (value `0`). Examples of objects are `DaemonSet`, `Deployment` or `BuildConfig`.
20
+
21
+
|`sro_states_completed_info`
22
+
|Represents whether the SRO has finished processing a CR successfully (value `1`) or the SRO has not processed the CR yet (value `0`).
23
+
24
+
|`sro_managed_resources_total`
25
+
|Returns the number of SRO CRs in the cluster, regardless of their state.
= Building and running the simple-kmod SpecialResource by using a config map
7
8
8
-
In this example, the simple-kmod kernel module is used to show how the SRO can manage a driver container which is defined in Helm chart templates stored in a config map.
9
+
In this example, the simple-kmod kernel module shows how the Special Resource Operator (SRO) manages a driver container. The container is defined in the Helm chart templates that are stored in a config map.
9
10
10
11
.Prerequisites
11
12
@@ -14,7 +15,7 @@ In this example, the simple-kmod kernel module is used to show how the SRO can m
14
15
* You installed the OpenShift CLI (`oc`).
15
16
* You are logged into the OpenShift CLI as a user with `cluster-admin` privileges.
16
17
* You installed the Node Feature Discovery (NFD) Operator.
17
-
* You installed the Special Resource Operator.
18
+
* You installed the SRO.
18
19
* You installed the Helm CLI (`helm`).
19
20
20
21
.Procedure
@@ -270,7 +271,15 @@ spec:
270
271
----
271
272
$ oc create -f simple-kmod-configmap.yaml
272
273
----
273
-
+
274
+
275
+
[NOTE]
276
+
====
277
+
To remove the simple-kmod kernel module from the node, delete the simple-kmod `SpecialResource` API object using the `oc delete` command. The kernel module is unloaded when the driver container pod is deleted.
278
+
====
279
+
280
+
281
+
.Verification
282
+
274
283
The `simple-kmod` resources are deployed in the `simple-kmod` namespace as specified in the object manifest. After a short time, the build pod for the `simple-kmod` driver container starts running. The build completes after a few minutes, and then the driver container pods start running.
275
284
276
285
. Use `oc get pods` command to display the status of the build pods:
@@ -310,7 +319,7 @@ simple_procfs_kmod 16384 0
310
319
simple_kmod 16384 0
311
320
----
312
321
313
-
[NOTE]
314
-
====
315
-
If you want to remove the simple-kmod kernel module from the node, delete the simple-kmod `SpecialResource` API object using the `oc delete` command. The kernel module is unloaded when the driver container pod is deleted.
322
+
[TIP]
316
323
====
324
+
The `sro_kind_completed_info` SRO Prometheus metric provides information about the status of the different objects being deployed, which can be useful to troubleshoot SRO CR installations. The SRO also provides other types of metrics that you can use to watch the health of your environment.
= Building and running the simple-kmod SpecialResource by using the templates from the SRO image
7
8
8
-
The SRO image contains a local repository of Helm charts including the templates for deploying the simple-kmod kernel module. In this example, the simple-kmod kernel module is used to show how the SRO can manage a driver container that is defined in the internal SRO repository.
9
+
The Special Resource Operator(SRO) image contains a local repository of Helm charts, including the templates for deploying the simple-kmod kernel module. In this example, the simple-kmod kernel module shows how the SRO can manage a driver container that is defined in the internal SRO repository.
9
10
10
11
.Prerequisites
11
12
@@ -53,10 +54,17 @@ spec:
53
54
----
54
55
$ oc create -f simple-kmod-local.yaml
55
56
----
56
-
+
57
+
58
+
[NOTE]
59
+
====
60
+
To remove the simple-kmod kernel module from the node, delete the simple-kmod `SpecialResource` API object using the `oc delete` command. The kernel module is unloaded when the driver container pod is deleted.
61
+
====
62
+
63
+
.Verification
64
+
65
+
57
66
The `simple-kmod` resources are deployed in the `simple-kmod` namespace as specified in the object manifest. After a short time, the build pod for the `simple-kmod` driver container starts running. The build completes after a few minutes, and then the driver container pods start running.
58
67
59
-
+
60
68
. Use the `oc get pods` command to display the status of the pods:
61
69
62
70
+
@@ -95,7 +103,7 @@ simple_procfs_kmod 16384 0
95
103
simple_kmod 16384 0
96
104
----
97
105
98
-
[NOTE]
106
+
[TIP]
99
107
====
100
-
If you want to remove the simple-kmod kernel module from the node, delete the simple-kmod `SpecialResource` API object using the `oc delete` command. The kernel module is unloaded when the driver container pod is deleted.
108
+
The `sro_kind_completed_info` SRO Prometheus metric provides information about the status of the different objects being deployed, which can be useful to troubleshoot SRO CR installations. The SRO also provides other types of metrics that you can use to watch the health of your environment.
0 commit comments