You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 2, 2025. It is now read-only.
Kubernetes clusters,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes clusters. Note: Cluster CPU and Memory Utilization is the average of all containers in the cluster.
3
+
Kubernetes nodes,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes nodes.
4
+
Kubernetes nodes,Node CPU,"A group of metrics and KPIs related to node CPU troubleshooting. Note: At the node level, CPU Limit and CPU Request is the sum of the respective values for all containers in the node.
5
+
"
6
+
Kubernetes nodes,Node memory,"A group of metrics and KPIs related to node memory troubleshooting. Note: At the node level, Memory Limit and Memory Request is the sum of the respective values for all containers in the node."
7
+
Kubernetes nodes,Node file system,"A group of metrics and KPIs related to node storage troubleshooting. Note: At the node level, Storage Limit and Storage Request is the sum of the respective values for all containers in the node."
8
+
Kubernetes nodes,Node network metrics,A group of metrics and KPIs related to node network troubleshooting.
9
+
Kubernetes pods,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes pods.
10
+
Kubernetes pods,Pod status,A group of metrics and KPIs related to pod status troubleshooting.
11
+
Kubernetes pods,Pod CPU,"A group of metrics and KPIs related to pod CPU troubleshooting. Note: At the pod level, CPU Limit and CPU Request is the sum of the respective values for all containers in the pod. "
12
+
Kubernetes pods,Pod memory,"A group of metrics and KPIs related to pod memory troubleshooting. Note: At the pod level, Memory Limit and Memory Request is the sum of the respective values for all containers in the pod. "
13
+
Kubernetes pods,Pod file system,A group of metrics and KPIs related to pod storage troubleshooting.
14
+
Kubernetes containers,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes containers.
15
+
Kubernetes containers,Container CPU,A group of metrics and KPIs related to container CPU troubleshooting.
16
+
Kubernetes containers,Container memory,A group of metrics and KPIs related to container memory troubleshooting.
17
+
Kubernetes containers,Container file system,A group of metrics and KPIs related to container storage troubleshooting.
18
+
Kubernetes containers,Container network metrics ,A group of metrics and KPIs related to container network troubleshooting.
19
+
Kubernetes workloads,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes workloads.
20
+
Kubernetes workloads,Workload pods,A group of metrics and KPIs related to the status of the pods in a workload.
21
+
Kubernetes workloads,Workload churn,A group of metrics and KPIs related to the churn rate of the pods in a workload.
22
+
Kubernetes workloads,Workload scaling,A group of metrics and KPIs related to scaling up/down the pods in a workload.
23
+
Kubernetes workloads,Workload attributes,A group of metrics and KPIs related to workload attributes.
24
+
Kubernetes deployments,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes deployments.
25
+
Kubernetes deployments,Deployment pods,A group of metrics and KPIs related to the status of the pods in a deployment.
26
+
Kubernetes deployments,Deployment churn,A group of metrics and KPIs related to the churn rate of the pods in a deployment.
27
+
Kubernetes deployments,Deployment scaling,A group of metrics and KPIs related to scaling up/down the pods in a deployment.
28
+
Kubernetes deployments,Deployment attributes,A group of metrics and KPIs related to deployment attributes.
29
+
Kubernetes ReplicaSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes ReplicaSets.
30
+
Kubernetes ReplicaSets,ReplicaSet pods,A group of metrics and KPIs related to the status of the pods in a ReplicaSet.
31
+
Kubernetes ReplicaSets,ReplicaSet churn,A group of metrics and KPIs related to the churn rate of the pods in a ReplicaSet.
32
+
Kubernetes ReplicaSets,ReplicaSet scaling,A group of metrics and KPIs related to scaling up/down the pods in a ReplicaSet.
33
+
Kubernetes ReplicaSets,ReplicaSet attributes,A group of metrics and KPIs related to ReplicaSet attributes.
34
+
Kubernetes DaemonSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes DaemonSets.
35
+
Kubernetes DaemonSets,DaemonSet nodes,A group of metrics and KPIs related to troubleshooting the nodes in a DaemonSet.
36
+
Kubernetes DaemonSets,DaemonSet churn,A group of metrics and KPIs related to the churn rate of the pods in a DaemonSet.
37
+
Kubernetes DaemonSets,DaemonSet attributes,A group of metrics and KPIs related to DaemonSet attributes.
38
+
Kubernetes StatefulSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes StatefulSets.
39
+
Kubernetes StatefulSets,StatefulSet pods,A group of metrics and KPIs related to troubleshooting the pods in a StatefulSet.
40
+
Kubernetes StatefulSets,StatefulSet churn,A group of metrics and KPIs related to the churn rate of the pods in a StatefulSet.
41
+
Kubernetes StatefulSets,StatefulSet attributes,A group of metrics and KPIs related to StatefulSet attributes.
42
+
Kubernetes namespaces,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes namespaces.
43
+
Kubernetes namespaces,Namespace workloads,A group of metrics and KPIs related to troubleshooting the workloads in a namespace.
44
+
Kubernetes namespaces,Namespace resources,A group of metrics and KPIs related to troubleshooting namespace resources.
45
+
Kubernetes CronJobs,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes CronJobs.
46
+
Kubernetes CronJobs,CronJob jobs,A group of metrics and KPIs related to troubleshooting CronJobs.
47
+
Kubernetes CronJobs,CronJob pods,A group of metrics and KPIs related to troubleshooting pods in a CronJob.
48
+
Kubernetes CronJobs,CronJob attributes,A group of metrics and KPIs related to CronJob attributes.
49
+
Kubernetes jobs,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes jobs.
50
+
Kubernetes jobs,Job attributes,A group of metrics and KPIs related to job attributes.
Copy file name to clipboardExpand all lines: infrastructure/infrastructure.rst
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -77,7 +77,7 @@ Splunk Observability Cloud also provides metrics and other data :ref:`for the fo
77
77
- View key metrics and visualize incidents for every supported cloud service. The Infrastructure Overview provides default dashboards for each cloud service. For example, there are separate dashboards for AWS EC2 instances and AWS EBS instances.
78
78
79
79
* - Containers
80
-
- :ref:`infrastructure-k8s`
80
+
- :ref:`infrastructure-k8s-nav`
81
81
- View key metrics and visualize incidents for your Kubernetes infrastructure at the cluster, node, pod, and container level.
Copy file name to clipboardExpand all lines: infrastructure/monitor/gcp-infra-monitor.rst
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -93,7 +93,7 @@ You can also export and monitor data from these sources running in your GCP envi
93
93
- :strong:`Description`
94
94
95
95
* - :ref:`get-started-k8s`
96
-
- :ref:`infrastructure-k8s`
96
+
- :ref:`infrastructure-k8s-nav`
97
97
- Collect metrics and logs from Kubernetes clusters running in Google Compute Engine (GCE) or Google Kubernetes Engine (GKE) instances.
98
98
99
99
* - - :ref:`get-started-linux`
@@ -139,4 +139,4 @@ In order to populate GCP Kubernetes Engine navigators and dashboards, Splunk Obs
139
139
140
140
If you have a private GKE cluster, refer to the :new-page:`official Google documentation on setting up limited access to the public control plane endpoint<https://cloud.google.com/kubernetes-engine/docs/how-to/private-clusters#public_cp>`, and reach out to Support to obtain the IP ranges that you need to authorize.
141
141
142
-
Alternatively, you can collect Kubernetes data using the OpenTelemetry Collector for Kubernetes and use the provided dashboards and navigators. Learn more at :ref:`get-started-k8s` and :ref:`infrastructure-k8s`.
142
+
Alternatively, you can collect Kubernetes data using the OpenTelemetry Collector for Kubernetes and use the provided dashboards and navigators. Learn more at :ref:`get-started-k8s` and :ref:`infrastructure-k8s-nav`.
Copy file name to clipboardExpand all lines: infrastructure/monitor/k8s-nav.rst
+25-51Lines changed: 25 additions & 51 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,14 +4,9 @@
4
4
Monitor Kubernetes
5
5
**********************************
6
6
7
-
8
7
.. meta::
9
8
:description: Learn how to monitor Kubernetes resources with Splunk Observability Cloud.
10
9
11
-
12
-
.. note:: The following topic describes the new Kubernetes navigator. See :ref:`infrastructure-k8s` for documentation on the classic Kubernetes navigator.
13
-
14
-
15
10
You can monitor Kubernetes metrics with Splunk Observability Cloud. Splunk Observability Cloud uses the Splunk Distribution of OpenTelemetry Collector for Kubernetes to provide robust infrastructure monitoring capabilities. To learn more, see :ref:`otel-intro`.
16
11
17
12
Prerequisites
@@ -25,7 +20,7 @@ To start monitoring Kubernetes resources, you must:
25
20
26
21
.. _use-k8s-nav:
27
22
28
-
Kubernetes navigators
23
+
About Kubernetes navigators
29
24
===============================
30
25
31
26
.. note:: The following sections describe components specific to the Kubernetes navigators. For information on components shared by all navigators, see :ref:`use-navigators-imm`.
@@ -46,13 +41,14 @@ The following table describes the Kubernetes navigators:
46
41
* Pods
47
42
* Containers
48
43
- * Provides a :ref:`table, heat map <navigator-views>`, and :ref:`hierarchy map <k8s-nav-hierarchy-map>` view of your Kubernetes infrastructure
49
-
* Provides a :ref:`Kubernetes Analyzer<k8s-nav-analyzer-overview>` that helps you troubleshoot Kubernetes problems at scale
44
+
* Provides a :ref:`Kubernetes analyzer<k8s-nav-analyzer-overview>` that helps you troubleshoot Kubernetes problems at scale
50
45
- * Get an overview of your Kubernetes infrastructure
51
46
* Monitor the health of part of your Kubernetes infrastructure
52
47
* Identify and diagnose an issue with part of your Kubernetes infrastructure
53
48
* View services and hosts running on Kubernetes
54
49
55
-
* - * Workloads
50
+
* - * Clusters
51
+
* Workloads
56
52
* Deployments
57
53
* ReplicaSets
58
54
* StatefulSets
@@ -77,13 +73,10 @@ Monitor your Kubernetes infrastructure with an interactive hierarchical map that
77
73
78
74
To navigate to the hierarchy map:
79
75
80
-
1. On the :guilabel:`Infrastructure` landing page, select the Kubernetes nodes, pods, or containers navigator.
81
-
2. The table view displays by default. Select an instance from the table.
:alt:How to select a different node to investigate and jump to the cluster level.
102
-
103
-
104
91
* Hover: Get more information about an instance, including its status or phase, by hovering over that instance.
92
+
* Select and zoom: Drill down into an instance and change the zoom level of the map, if applicable, by selecting the instance.
93
+
* Filter: Filter the map by any available metadata in your Kubernetes data, such as a namespace, a workload, or any other key-value pair. When you apply a filter, the map highlights instances that match the filter. You can still hover over the dimmed instances to view details about them.
:alt:Hovering over a pod shows its information and phase.
108
-
:width:50%
95
+
.. _k8s-nav-left-nav:
109
96
110
-
* Select and zoom: Drill down into an instance and change the zoom level of the map, if applicable, by selecting the instance. Details about the instance display in the sidebar, in the :guilabel:`About this <Entity_Type>` panel.
:alt:From the node-level hierarchy map, selecting a pod zooms the view to the pod level. Details about the selected pod display in the sidebar in the :guilabel:`About this pod` panel. From the pod level, selecting a container zooms the view to the container level.
97
+
Left navigation panel
98
+
============================
115
99
116
-
* Filter: Filter the map by any available metadata in your Kubernetes data, such as a namespace, a workload, or any other key-value pair. When you apply a filter, the map highlights instances that match the filter. You can still hover over the dimmed instances to view details about them.
100
+
Use the left navigation panel in the table or heat map view to quickly switch between Kubernetes entity types, search for filters, and access predefined filters.
:alt:Filtering ``k8s.container.name`` to ``config-reloader`` and ``core-metrics-exporter`` highlights matching pods and dims pods that don't match.
102
+
To use the left navigation panel:
103
+
104
+
* :guilabel:`Select entity type`: Use this drop-down menu to switch between Kubernetes entity types.
105
+
* :guilabel:`Refine by`: Use this panel to search for filters or access a list of predefined filters. The list of predefined filters is searchable and organized by :guilabel:`Relationship` and :guilabel:`Attribute`.
.. note:: The Analyzer is only available on the Kubernetes nodes, pods, and containers navigators.
112
+
.. note:: The analyzer is only available on the Kubernetes nodes, pods, and containers navigators.
128
113
129
-
The Analyzer, accessed through the :guilabel:`K8s analyzer` tab, helps you troubleshoot Kubernetes problems at scale by highlighting Kubernetes instances that are in a bad state, such as nodes that are not ready. The Analyzer produces theories about what those instances might have in common, such as that all of the instances are running the same workload or all instances are located in the same AWS region. Select a finding in the Analyzer panel to filter the map.
114
+
You can access the Kubernetes analyzer through the :guilabel:`K8s analyzer` tab. The analyzer helps you troubleshoot Kubernetes problems at scale by highlighting Kubernetes instances that are in a bad state, such as nodes that are not ready. The analyzer produces theories about what those instances might have in common, such as that all of the instances are running the same workload or all instances are located in the same AWS region. Select a finding in the analyzer to filter the map.
130
115
131
-
The Analyzer panel displays suggested filters for the elements selected in the table or heat map view. Select links in the Analyzer panel to add filters to the table or heat map view and explore conditions across your entire Kubernetes environment.
116
+
The analyzer displays suggested filters for the elements selected in the table or heat map view. Select links in the analyzer to add filters to the table or heat map view and explore conditions across your entire Kubernetes environment.
132
117
133
-
The Analyzer uses AI-driven insights to examine patterns that nodes, pods, or containers could have in common. The trouble indicators are:
118
+
The analyzer uses AI-driven insights to examine potential patterns between nodes, pods, or containers. The trouble indicators are:
134
119
135
120
- Pods that are in pending status
136
121
- Pods that are in failed status
@@ -141,18 +126,7 @@ The Analyzer uses AI-driven insights to examine patterns that nodes, pods, or co
141
126
- Nodes experiencing high CPU
142
127
- Nodes experiencing high memory
143
128
144
-
The Analyzer displays overrepresented metrics properties for known conditions, such as pods in pending status, pods in failed status, and so on. You can use properties that are highly correlated with these conditions to filter the table or heat map. You can explore data about each of those elements in the navigator using context-sensitive dashboards. This enables you to identify the underlying patterns noticeable on the filtered map that might be correlated with Kubernetes issues. For example, if all failed pods are in certain types of clusters, the Analyzer provides suggested paths to follow to troubleshoot such issues.
Apart from monitoring your Kubernetes infrastructure, you can also track services and hosts running on Kubernetes in the navigator sidebar. When you select a service or host from the sidebar, you are switching to the navigator for that service or host instance.
:alt:Starting on the Kubernetes pod navigator, switching to the MySQL host navigator, and then switching back to the Kubernetes pod navigator.
129
+
The analyzer displays overrepresented metrics properties for known conditions, such as pods in pending status, pods in failed status, and so on. You can use properties that are highly correlated with these conditions to filter the table or heat map. You can explore data about each of those elements in the navigator using context-sensitive dashboards. This enables you to identify the underlying patterns noticeable on the filtered map that might be correlated with Kubernetes issues. For example, if all failed pods are in certain types of clusters, the analyzer provides suggested paths to follow to troubleshoot such issues.
0 commit comments