Skip to content
This repository was archived by the owner on Sep 2, 2025. It is now read-only.

Commit 4f3291d

Browse files
authored
Merge pull request #2561 from splunk/stran-O11YDOCS-6547-k8s-nav-updates
[O11YDOCS-6547] CSFY25PI2 Kubernetes navigator updates
2 parents f73b667 + 2673e5e commit 4f3291d

File tree

13 files changed

+140
-390
lines changed

13 files changed

+140
-390
lines changed
Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,50 @@
1+
Navigator,Metric class name,Description
2+
Kubernetes clusters,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes clusters. Note: Cluster CPU and Memory Utilization is the average of all containers in the cluster.
3+
Kubernetes nodes,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes nodes.
4+
Kubernetes nodes,Node CPU,"A group of metrics and KPIs related to node CPU troubleshooting. Note: At the node level, CPU Limit and CPU Request is the sum of the respective values for all containers in the node.
5+
"
6+
Kubernetes nodes,Node memory,"A group of metrics and KPIs related to node memory troubleshooting. Note: At the node level, Memory Limit and Memory Request is the sum of the respective values for all containers in the node."
7+
Kubernetes nodes,Node file system,"A group of metrics and KPIs related to node storage troubleshooting. Note: At the node level, Storage Limit and Storage Request is the sum of the respective values for all containers in the node."
8+
Kubernetes nodes,Node network metrics,A group of metrics and KPIs related to node network troubleshooting.
9+
Kubernetes pods,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes pods.
10+
Kubernetes pods,Pod status,A group of metrics and KPIs related to pod status troubleshooting.
11+
Kubernetes pods,Pod CPU,"A group of metrics and KPIs related to pod CPU troubleshooting. Note: At the pod level, CPU Limit and CPU Request is the sum of the respective values for all containers in the pod. "
12+
Kubernetes pods,Pod memory,"A group of metrics and KPIs related to pod memory troubleshooting. Note: At the pod level, Memory Limit and Memory Request is the sum of the respective values for all containers in the pod. "
13+
Kubernetes pods,Pod file system,A group of metrics and KPIs related to pod storage troubleshooting.
14+
Kubernetes containers,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes containers.
15+
Kubernetes containers,Container CPU,A group of metrics and KPIs related to container CPU troubleshooting.
16+
Kubernetes containers,Container memory,A group of metrics and KPIs related to container memory troubleshooting.
17+
Kubernetes containers,Container file system,A group of metrics and KPIs related to container storage troubleshooting.
18+
Kubernetes containers,Container network metrics ,A group of metrics and KPIs related to container network troubleshooting.
19+
Kubernetes workloads,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes workloads.
20+
Kubernetes workloads,Workload pods,A group of metrics and KPIs related to the status of the pods in a workload.
21+
Kubernetes workloads,Workload churn,A group of metrics and KPIs related to the churn rate of the pods in a workload.
22+
Kubernetes workloads,Workload scaling,A group of metrics and KPIs related to scaling up/down the pods in a workload.
23+
Kubernetes workloads,Workload attributes,A group of metrics and KPIs related to workload attributes.
24+
Kubernetes deployments,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes deployments.
25+
Kubernetes deployments,Deployment pods,A group of metrics and KPIs related to the status of the pods in a deployment.
26+
Kubernetes deployments,Deployment churn,A group of metrics and KPIs related to the churn rate of the pods in a deployment.
27+
Kubernetes deployments,Deployment scaling,A group of metrics and KPIs related to scaling up/down the pods in a deployment.
28+
Kubernetes deployments,Deployment attributes,A group of metrics and KPIs related to deployment attributes.
29+
Kubernetes ReplicaSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes ReplicaSets.
30+
Kubernetes ReplicaSets,ReplicaSet pods,A group of metrics and KPIs related to the status of the pods in a ReplicaSet.
31+
Kubernetes ReplicaSets,ReplicaSet churn,A group of metrics and KPIs related to the churn rate of the pods in a ReplicaSet.
32+
Kubernetes ReplicaSets,ReplicaSet scaling,A group of metrics and KPIs related to scaling up/down the pods in a ReplicaSet.
33+
Kubernetes ReplicaSets,ReplicaSet attributes,A group of metrics and KPIs related to ReplicaSet attributes.
34+
Kubernetes DaemonSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes DaemonSets.
35+
Kubernetes DaemonSets,DaemonSet nodes,A group of metrics and KPIs related to troubleshooting the nodes in a DaemonSet.
36+
Kubernetes DaemonSets,DaemonSet churn,A group of metrics and KPIs related to the churn rate of the pods in a DaemonSet.
37+
Kubernetes DaemonSets,DaemonSet attributes,A group of metrics and KPIs related to DaemonSet attributes.
38+
Kubernetes StatefulSets,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes StatefulSets.
39+
Kubernetes StatefulSets,StatefulSet pods,A group of metrics and KPIs related to troubleshooting the pods in a StatefulSet.
40+
Kubernetes StatefulSets,StatefulSet churn,A group of metrics and KPIs related to the churn rate of the pods in a StatefulSet.
41+
Kubernetes StatefulSets,StatefulSet attributes,A group of metrics and KPIs related to StatefulSet attributes.
42+
Kubernetes namespaces,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes namespaces.
43+
Kubernetes namespaces,Namespace workloads,A group of metrics and KPIs related to troubleshooting the workloads in a namespace.
44+
Kubernetes namespaces,Namespace resources,A group of metrics and KPIs related to troubleshooting namespace resources.
45+
Kubernetes CronJobs,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes CronJobs.
46+
Kubernetes CronJobs,CronJob jobs,A group of metrics and KPIs related to troubleshooting CronJobs.
47+
Kubernetes CronJobs,CronJob pods,A group of metrics and KPIs related to troubleshooting pods in a CronJob.
48+
Kubernetes CronJobs,CronJob attributes,A group of metrics and KPIs related to CronJob attributes.
49+
Kubernetes jobs,Default,A default group of metrics and KPIs that are essential for troubleshooting Kubernetes jobs.
50+
Kubernetes jobs,Job attributes,A group of metrics and KPIs related to job attributes.

gdi/get-data-in/gdi-guide/infrastructure/send-server-cluster-data.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ For example, the host navigator shows charts and other visualizations of host da
6969
To learn how to use Infrastructure Monitoring navigators, see the following:
7070

7171
* :ref:`monitor-hosts`
72-
* :ref:`use-the-k8s-navigator`
72+
* :ref:`infrastructure-k8s-nav`
7373

7474
.. _view-metrics-inframon:
7575

gdi/opentelemetry/collector-kubernetes/k8s-infrastructure-tutorial/k8s-monitor-with-navigators.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,5 +79,5 @@ Learn more
7979
----------
8080

8181
* For more details about using navigators in general, see :ref:`Use navigators in Splunk Infrastructure Monitoring <use-navigators-imm>`.
82-
* For more details about the data displayed in the Kubernetes navigator, see :ref:`Use the Kubernetes navigator <use-the-k8s-navigator>`.
82+
* For more details about the data displayed in the Kubernetes navigator, see :ref:`Use the Kubernetes navigator <infrastructure-k8s-nav>`.
8383
* For more details about working with charts, see :ref:`Charts in Splunk Observability Cloud <data-visualization-charts>`.

infrastructure/infrastructure.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -77,7 +77,7 @@ Splunk Observability Cloud also provides metrics and other data :ref:`for the fo
7777
- View key metrics and visualize incidents for every supported cloud service. The Infrastructure Overview provides default dashboards for each cloud service. For example, there are separate dashboards for AWS EC2 instances and AWS EBS instances.
7878

7979
* - Containers
80-
- :ref:`infrastructure-k8s`
80+
- :ref:`infrastructure-k8s-nav`
8181
- View key metrics and visualize incidents for your Kubernetes infrastructure at the cluster, node, pod, and container level.
8282

8383
* - My Data Center

infrastructure/monitor/aws-infra-import.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -119,7 +119,7 @@ Splunk Observability Cloud also imports metrics, metadata, and logs for some of
119119
- :strong:`Description`
120120

121121
* - :ref:`get-started-k8s`
122-
- :ref:`infrastructure-k8s`
122+
- :ref:`infrastructure-k8s-nav`
123123
- Import metrics and logs from Kubernetes clusters running in EC2 instances or EKS.
124124

125125
* - - :ref:`get-started-linux`

infrastructure/monitor/azure.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ You can also export and monitor data from sources running in your Azure environm
2525
- :strong:`Description`
2626

2727
* - :ref:`get-started-k8s`
28-
- :ref:`infrastructure-k8s`
28+
- :ref:`infrastructure-k8s-nav`
2929
- Collect metrics and logs from Kubernetes clusters running in Azure Kubernetes Service.
3030

3131
* - - :ref:`get-started-linux`

infrastructure/monitor/gcp-infra-monitor.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ You can also export and monitor data from these sources running in your GCP envi
9393
- :strong:`Description`
9494

9595
* - :ref:`get-started-k8s`
96-
- :ref:`infrastructure-k8s`
96+
- :ref:`infrastructure-k8s-nav`
9797
- Collect metrics and logs from Kubernetes clusters running in Google Compute Engine (GCE) or Google Kubernetes Engine (GKE) instances.
9898

9999
* - - :ref:`get-started-linux`
@@ -139,4 +139,4 @@ In order to populate GCP Kubernetes Engine navigators and dashboards, Splunk Obs
139139

140140
If you have a private GKE cluster, refer to the :new-page:`official Google documentation on setting up limited access to the public control plane endpoint<https://cloud.google.com/kubernetes-engine/docs/how-to/private-clusters#public_cp>`, and reach out to Support to obtain the IP ranges that you need to authorize.
141141

142-
Alternatively, you can collect Kubernetes data using the OpenTelemetry Collector for Kubernetes and use the provided dashboards and navigators. Learn more at :ref:`get-started-k8s` and :ref:`infrastructure-k8s`.
142+
Alternatively, you can collect Kubernetes data using the OpenTelemetry Collector for Kubernetes and use the provided dashboards and navigators. Learn more at :ref:`get-started-k8s` and :ref:`infrastructure-k8s-nav`.

infrastructure/monitor/k8s-nav.rst

Lines changed: 25 additions & 51 deletions
Original file line numberDiff line numberDiff line change
@@ -4,14 +4,9 @@
44
Monitor Kubernetes
55
**********************************
66

7-
87
.. meta::
98
:description: Learn how to monitor Kubernetes resources with Splunk Observability Cloud.
109

11-
12-
.. note:: The following topic describes the new Kubernetes navigator. See :ref:`infrastructure-k8s` for documentation on the classic Kubernetes navigator.
13-
14-
1510
You can monitor Kubernetes metrics with Splunk Observability Cloud. Splunk Observability Cloud uses the Splunk Distribution of OpenTelemetry Collector for Kubernetes to provide robust infrastructure monitoring capabilities. To learn more, see :ref:`otel-intro`.
1611

1712
Prerequisites
@@ -25,7 +20,7 @@ To start monitoring Kubernetes resources, you must:
2520

2621
.. _use-k8s-nav:
2722

28-
Kubernetes navigators
23+
About Kubernetes navigators
2924
===============================
3025

3126
.. note:: The following sections describe components specific to the Kubernetes navigators. For information on components shared by all navigators, see :ref:`use-navigators-imm`.
@@ -46,13 +41,14 @@ The following table describes the Kubernetes navigators:
4641
* Pods
4742
* Containers
4843
- * Provides a :ref:`table, heat map <navigator-views>`, and :ref:`hierarchy map <k8s-nav-hierarchy-map>` view of your Kubernetes infrastructure
49-
* Provides a :ref:`Kubernetes Analyzer<k8s-nav-analyzer-overview>` that helps you troubleshoot Kubernetes problems at scale
44+
* Provides a :ref:`Kubernetes analyzer<k8s-nav-analyzer-overview>` that helps you troubleshoot Kubernetes problems at scale
5045
- * Get an overview of your Kubernetes infrastructure
5146
* Monitor the health of part of your Kubernetes infrastructure
5247
* Identify and diagnose an issue with part of your Kubernetes infrastructure
5348
* View services and hosts running on Kubernetes
5449

55-
* - * Workloads
50+
* - * Clusters
51+
* Workloads
5652
* Deployments
5753
* ReplicaSets
5854
* StatefulSets
@@ -77,13 +73,10 @@ Monitor your Kubernetes infrastructure with an interactive hierarchical map that
7773

7874
To navigate to the hierarchy map:
7975

80-
1. On the :guilabel:`Infrastructure` landing page, select the Kubernetes nodes, pods, or containers navigator.
81-
2. The table view displays by default. Select an instance from the table.
82-
3. Expand the hierarchy map.
83-
84-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-hierarchy-map.png
85-
:alt: Hierarchy map view in the Kubernetes nodes navigator.
86-
:width: 90%
76+
#. From the Splunk Observability Cloud main menu, select :guilabel:`Infrastructure`, then :guilabel:`Kubernetes`.
77+
#. Select the Kubernetes nodes, pods, or containers navigator.
78+
#. The table view displays by default. Select an instance from the table.
79+
#. Expand the :guilabel:`Hierarchy Map`.
8780

8881
Nodes, pods, and containers are colored by health and status, as reported by Kubernetes:
8982

@@ -95,42 +88,34 @@ Investigate instances in the hierarchy map
9588
---------------------------------------------
9689

9790
* Breadcrumb navigation: Switch to different instances and jump across entity levels using the breadcrumb navigation bar.
98-
99-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-breadcrumb.gif
100-
:width: 100%
101-
:alt: How to select a different node to investigate and jump to the cluster level.
102-
103-
10491
* Hover: Get more information about an instance, including its status or phase, by hovering over that instance.
92+
* Select and zoom: Drill down into an instance and change the zoom level of the map, if applicable, by selecting the instance.
93+
* Filter: Filter the map by any available metadata in your Kubernetes data, such as a namespace, a workload, or any other key-value pair. When you apply a filter, the map highlights instances that match the filter. You can still hover over the dimmed instances to view details about them.
10594

106-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-hover.png
107-
:alt: Hovering over a pod shows its information and phase.
108-
:width: 50%
95+
.. _k8s-nav-left-nav:
10996

110-
* Select and zoom: Drill down into an instance and change the zoom level of the map, if applicable, by selecting the instance. Details about the instance display in the sidebar, in the :guilabel:`About this <Entity_Type>` panel.
111-
112-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-zoom.gif
113-
:width: 100%
114-
:alt: From the node-level hierarchy map, selecting a pod zooms the view to the pod level. Details about the selected pod display in the sidebar in the :guilabel:`About this pod` panel. From the pod level, selecting a container zooms the view to the container level.
97+
Left navigation panel
98+
============================
11599

116-
* Filter: Filter the map by any available metadata in your Kubernetes data, such as a namespace, a workload, or any other key-value pair. When you apply a filter, the map highlights instances that match the filter. You can still hover over the dimmed instances to view details about them.
100+
Use the left navigation panel in the table or heat map view to quickly switch between Kubernetes entity types, search for filters, and access predefined filters.
117101

118-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-filter.gif
119-
:width: 100%
120-
:alt: Filtering ``k8s.container.name`` to ``config-reloader`` and ``core-metrics-exporter`` highlights matching pods and dims pods that don't match.
102+
To use the left navigation panel:
103+
104+
* :guilabel:`Select entity type`: Use this drop-down menu to switch between Kubernetes entity types.
105+
* :guilabel:`Refine by`: Use this panel to search for filters or access a list of predefined filters. The list of predefined filters is searchable and organized by :guilabel:`Relationship` and :guilabel:`Attribute`.
121106

122107
.. _k8s-nav-analyzer-overview:
123108

124-
Analyzer
125-
==============
109+
Troubleshoot performance with the analyzer
110+
======================================================
126111

127-
.. note:: The Analyzer is only available on the Kubernetes nodes, pods, and containers navigators.
112+
.. note:: The analyzer is only available on the Kubernetes nodes, pods, and containers navigators.
128113

129-
The Analyzer, accessed through the :guilabel:`K8s analyzer` tab, helps you troubleshoot Kubernetes problems at scale by highlighting Kubernetes instances that are in a bad state, such as nodes that are not ready. The Analyzer produces theories about what those instances might have in common, such as that all of the instances are running the same workload or all instances are located in the same AWS region. Select a finding in the Analyzer panel to filter the map.
114+
You can access the Kubernetes analyzer through the :guilabel:`K8s analyzer` tab. The analyzer helps you troubleshoot Kubernetes problems at scale by highlighting Kubernetes instances that are in a bad state, such as nodes that are not ready. The analyzer produces theories about what those instances might have in common, such as that all of the instances are running the same workload or all instances are located in the same AWS region. Select a finding in the analyzer to filter the map.
130115

131-
The Analyzer panel displays suggested filters for the elements selected in the table or heat map view. Select links in the Analyzer panel to add filters to the table or heat map view and explore conditions across your entire Kubernetes environment.
116+
The analyzer displays suggested filters for the elements selected in the table or heat map view. Select links in the analyzer to add filters to the table or heat map view and explore conditions across your entire Kubernetes environment.
132117

133-
The Analyzer uses AI-driven insights to examine patterns that nodes, pods, or containers could have in common. The trouble indicators are:
118+
The analyzer uses AI-driven insights to examine potential patterns between nodes, pods, or containers. The trouble indicators are:
134119

135120
- Pods that are in pending status
136121
- Pods that are in failed status
@@ -141,18 +126,7 @@ The Analyzer uses AI-driven insights to examine patterns that nodes, pods, or co
141126
- Nodes experiencing high CPU
142127
- Nodes experiencing high memory
143128

144-
The Analyzer displays overrepresented metrics properties for known conditions, such as pods in pending status, pods in failed status, and so on. You can use properties that are highly correlated with these conditions to filter the table or heat map. You can explore data about each of those elements in the navigator using context-sensitive dashboards. This enables you to identify the underlying patterns noticeable on the filtered map that might be correlated with Kubernetes issues. For example, if all failed pods are in certain types of clusters, the Analyzer provides suggested paths to follow to troubleshoot such issues.
145-
146-
.. _k8s-nav-view-services:
147-
148-
View services and hosts running on Kubernetes
149-
=======================================================
150-
151-
Apart from monitoring your Kubernetes infrastructure, you can also track services and hosts running on Kubernetes in the navigator sidebar. When you select a service or host from the sidebar, you are switching to the navigator for that service or host instance.
152-
153-
.. image:: /_images/infrastructure/k8s-nav/k8s-nav-dependencies.gif
154-
:width: 100%
155-
:alt: Starting on the Kubernetes pod navigator, switching to the MySQL host navigator, and then switching back to the Kubernetes pod navigator.
129+
The analyzer displays overrepresented metrics properties for known conditions, such as pods in pending status, pods in failed status, and so on. You can use properties that are highly correlated with these conditions to filter the table or heat map. You can explore data about each of those elements in the navigator using context-sensitive dashboards. This enables you to identify the underlying patterns noticeable on the filtered map that might be correlated with Kubernetes issues. For example, if all failed pods are in certain types of clusters, the analyzer provides suggested paths to follow to troubleshoot such issues.
156130

157131
Next steps
158132
=====================

0 commit comments

Comments
 (0)