|
| 1 | +:_mod-docs-content-type: ASSEMBLY |
| 2 | +include::_attributes/common-attributes.adoc[] |
| 3 | +[id="key-concepts"] |
| 4 | += Understanding the monitoring stack - key concepts |
| 5 | +:context: key-concepts |
| 6 | + |
| 7 | +toc::[] |
| 8 | + |
| 9 | +Get familiar with the {product-title} monitoring concepts and terms. Learn about how you can improve performance and scale of your cluster, store and record data, manage metrics and alerts, and more. |
| 10 | + |
| 11 | +[id="about-performance-and-scalability_{context}"] |
| 12 | +== About performance and scalability |
| 13 | + |
| 14 | +You can optimize the performance and scale of your clusters. |
| 15 | +You can configure the monitoring stack by performing any of the following actions: |
| 16 | + |
| 17 | +* Control the placement and distribution of monitoring components: |
| 18 | +** Use node selectors to move components to specific nodes. |
| 19 | +** Assign tolerations to enable moving components to tainted nodes. |
| 20 | +* Use pod topology spread constraints. |
| 21 | +* Manage CPU and memory resources. |
| 22 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 23 | +* Set the body size limit for metrics scraping. |
| 24 | +* Use metrics collection profiles. |
| 25 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 26 | + |
| 27 | +[role="_additional-resources"] |
| 28 | +.Additional resources |
| 29 | + |
| 30 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 31 | +* xref:../configuring-core-platform-monitoring/configuring-performance-and-scalability.adoc#configuring-performance-and-scalability[Configuring performance and scalability for core platform monitoring] |
| 32 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 33 | +* xref:../configuring-user-workload-monitoring/configuring-performance-and-scalability-uwm.adoc#configuring-performance-and-scalability-uwm[Configuring performance and scalability for user workload monitoring] |
| 34 | + |
| 35 | +include::modules/monitoring-using-node-selectors-to-move-monitoring-components.adoc[leveloffset=+2] |
| 36 | + |
| 37 | +include::modules/monitoring-using-pod-topology-spread-constraints-for-monitoring.adoc[leveloffset=+2] |
| 38 | + |
| 39 | +include::modules/monitoring-about-specifying-limits-and-requests-for-monitoring-components.adoc[leveloffset=+2] |
| 40 | + |
| 41 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 42 | +include::modules/monitoring-configuring-metrics-collection-profiles.adoc[leveloffset=+2] |
| 43 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 44 | + |
| 45 | +[id="about-storing-and-recording-data_{context}"] |
| 46 | +== About storing and recording data |
| 47 | + |
| 48 | +You can store and record data to help you protect the data and use them for troubleshooting. |
| 49 | +You can configure the monitoring stack by performing any of the following actions: |
| 50 | + |
| 51 | +* Configure persistent storage: |
| 52 | +** Protect your metrics and alerting data from data loss by storing them in a persistent volume (PV). As a result, they can survive pods being restarted or recreated. |
| 53 | +** Avoid getting duplicate notifications and losing silences for alerts when the Alertmanager pods are restarted. |
| 54 | +* Modify the retention time and size for Prometheus and Thanos Ruler metrics data. |
| 55 | +* Configure logging to help you troubleshoot issues with your cluster: |
| 56 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 57 | +** Configure audit logs for Metrics Server. |
| 58 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 59 | +** Set log levels for monitoring. |
| 60 | +** Enable the query logging for Prometheus and Thanos Querier. |
| 61 | + |
| 62 | +[role="_additional-resources"] |
| 63 | +.Additional resources |
| 64 | + |
| 65 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 66 | +* xref:../configuring-core-platform-monitoring/storing-and-recording-data.adoc#storing-and-recording-data[Storing and recording data for core platform monitoring] |
| 67 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 68 | +* xref:../configuring-user-workload-monitoring/storing-and-recording-data-uwm.adoc#storing-and-recording-data-uwm[Storing and recording data for user workload monitoring] |
| 69 | + |
| 70 | +include::modules/monitoring-retention-time-and-size-for-prometheus-metrics-data.adoc[leveloffset=+2] |
| 71 | + |
| 72 | +// Understanding metrics |
| 73 | +include::modules/monitoring-understanding-metrics.adoc[leveloffset=+1] |
| 74 | + |
| 75 | +[role="_additional-resources"] |
| 76 | +.Additional resources |
| 77 | + |
| 78 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 79 | +* xref:../configuring-core-platform-monitoring/configuring-metrics.adoc#configuring-metrics[Configuring metrics for core platform monitoring] |
| 80 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 81 | +* xref:../configuring-user-workload-monitoring/configuring-metrics-uwm.adoc#configuring-metrics-uwm[Configuring metrics for user workload monitoring] |
| 82 | +* xref:../accessing-metrics/accessing-metrics-as-an-administrator.adoc#accessing-metrics-as-an-administrator[Accessing metrics as an administrator] |
| 83 | +* xref:../accessing-metrics/accessing-metrics-as-a-developer.adoc#accessing-metrics-as-a-developer[Accessing metrics as a developer] |
| 84 | + |
| 85 | +include::modules/monitoring-controlling-the-impact-of-unbound-attributes-in-user-defined-projects.adoc[leveloffset=+2] |
| 86 | + |
| 87 | +include::modules/monitoring-adding-cluster-id-labels-to-metrics.adoc[leveloffset=+2] |
| 88 | + |
| 89 | +//About monitoring dashboards |
| 90 | + |
| 91 | +include::modules/monitoring-about-monitoring-dashboards.adoc[leveloffset=+1] |
| 92 | + |
| 93 | +[role="_additional-resources"] |
| 94 | +.Additional resources |
| 95 | + |
| 96 | +* xref:../accessing-metrics/accessing-metrics-as-an-administrator.adoc#reviewing-monitoring-dashboards-admin_accessing-metrics-as-an-administrator[Reviewing monitoring dashboards as a cluster administrator] |
| 97 | +* xref:../accessing-metrics/accessing-metrics-as-a-developer.adoc#reviewing-monitoring-dashboards-developer_accessing-metrics-as-a-developer[Reviewing monitoring dashboards as a developer] |
| 98 | + |
| 99 | +//Managing alerts |
| 100 | +include::modules/monitoring-about-managing-alerts.adoc[leveloffset=+1] |
| 101 | + |
| 102 | +[role="_additional-resources"] |
| 103 | +.Additional resources |
| 104 | + |
| 105 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 106 | +* xref:../configuring-core-platform-monitoring/configuring-alerts-and-notifications.adoc#configuring-alerts-and-notifications[Configuring alerts and notifications for core platform monitoring] |
| 107 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 108 | +* xref:../configuring-user-workload-monitoring/configuring-alerts-and-notifications-uwm.adoc#configuring-alerts-and-notifications-uwm[Configuring alerts and notifications for user workload monitoring] |
| 109 | +* xref:../managing-alerts/managing-alerts-as-an-administrator.adoc#managing-alerts-as-an-administrator[Managing alerts as an Administrator] |
| 110 | +* xref:../managing-alerts/managing-alerts-as-a-developer.adoc#managing-alerts-as-a-developer[Managing alerts as a Developer] |
| 111 | + |
| 112 | +include::modules/monitoring-managing-silences.adoc[leveloffset=+2] |
| 113 | + |
| 114 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 115 | +include::modules/monitoring-managing-core-platform-alerting-rules.adoc[leveloffset=+2] |
| 116 | + |
| 117 | +include::modules/monitoring-tips-for-optimizing-alerting-rules-for-core-platform-monitoring.adoc[leveloffset=+2] |
| 118 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 119 | + |
| 120 | +include::modules/monitoring-about-creating-alerting-rules-for-user-defined-projects.adoc[leveloffset=+2] |
| 121 | + |
| 122 | +include::modules/monitoring-managing-alerting-rules-for-user-defined-projects.adoc[leveloffset=+2] |
| 123 | + |
| 124 | +include::modules/monitoring-optimizing-alerting-for-user-defined-projects.adoc[leveloffset=+2] |
| 125 | + |
| 126 | +include::modules/monitoring-searching-alerts-silences-and-alerting-rules.adoc[leveloffset=+2] |
| 127 | + |
| 128 | +// Overview of setting up alert routing for user-defined projects |
| 129 | +include::modules/monitoring-understanding-alert-routing-for-user-defined-projects.adoc[leveloffset=+1] |
| 130 | + |
| 131 | +[role="_additional-resources"] |
| 132 | +.Additional resources |
| 133 | + |
| 134 | +* xref:../configuring-user-workload-monitoring/preparing-to-configure-the-monitoring-stack-uwm.adoc#enabling-alert-routing-for-user-defined-projects_preparing-to-configure-the-monitoring-stack-uwm[Enabling alert routing for user-defined projects] |
| 135 | + |
| 136 | +// Sending notifications to external systems |
| 137 | +include::modules/monitoring-sending-notifications-to-external-systems.adoc[leveloffset=+1] |
| 138 | + |
| 139 | +[role="_additional-resources"] |
| 140 | +.Additional resources |
| 141 | + |
| 142 | +ifndef::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 143 | +* xref:../configuring-core-platform-monitoring/configuring-alerts-and-notifications.adoc#configuring-alert-notifications_configuring-alerts-and-notifications[Configuring alert notifications for core platform monitoring] |
| 144 | +endif::openshift-dedicated,openshift-rosa,openshift-rosa-hcp[] |
| 145 | +* xref:../configuring-user-workload-monitoring/configuring-alerts-and-notifications-uwm.adoc#configuring-alert-notifications_configuring-alerts-and-notifications-uwm[Configuring alert notifications for user workload monitoring] |
0 commit comments