Skip to content

Commit d732f9a

Browse files
Merge pull request #264369 from bragi92/main
Add default targets for Control Plane to minimal ingestion profile
2 parents cf5e732 + 76f57a9 commit d732f9a

File tree

2 files changed

+91
-0
lines changed

2 files changed

+91
-0
lines changed

articles/azure-monitor/containers/prometheus-metrics-scrape-configuration-minimal.md

Lines changed: 62 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -20,15 +20,21 @@ Following targets are **enabled/ON** by default - meaning you don't have to prov
2020
- `nodeexporter` (`job=node`)
2121
- `kubelet` (`job=kubelet`)
2222
- `kube-state-metrics` (`job=kube-state-metrics`)
23+
- `controlplane-apiserver` (`job=controlplane-apiserver`)
24+
- `controlplane-etcd` (`job=controlplane-etcd`)
2325

2426
Following targets are available to scrape, but scraping isn't enabled (**disabled/OFF**) by default - meaning you don't have to provide any scrape job configuration for scraping these targets but they're disabled/OFF by default and you need to turn ON/enable scraping for these targets using [ama-metrics-settings-configmap](https://aka.ms/azureprometheus-addon-settings-configmap) under `default-scrape-settings-enabled` section
2527

2628
- `core-dns` (`job=kube-dns`)
2729
- `kube-proxy` (`job=kube-proxy`)
2830
- `api-server` (`job=kube-apiserver`)
31+
- `controlplane-cluster-autoscaler` (`job=controlplane-cluster-autoscaler`)
32+
- `controlplane-kube-scheduler` (`job=controlplane-kube-scheduler`)
33+
- `controlplane-kube-controller-manager` (`job=controlplane-kube-controller-manager`)
2934

3035
> [!NOTE]
3136
> The default scrape frequency for all default targets and scrapes is `30 seconds`. You can override it per target using the [ama-metrics-settings-configmap](https://aka.ms/azureprometheus-addon-settings-configmap) under `default-targets-scrape-interval-settings` section.
37+
> The control plane targets have a fixed scrape interval of `30 seconds` and cannot be overwritten.
3238
> You can read more about four different configmaps used by metrics addon [here](prometheus-metrics-scrape-configuration.md)
3339
3440
## Configuration setting
@@ -188,6 +194,33 @@ The following metrics are allow-listed with `minimalingestionprofile=true` for d
188194
- `node_time_seconds`
189195
- `node_uname_info"`
190196

197+
**controlplane-apiserver**<br>
198+
- `apiserver_request_total`
199+
- `apiserver_cache_list_fetched_objects_total`
200+
- `apiserver_cache_list_returned_objects_total`
201+
- `apiserver_flowcontrol_demand_seats_average`
202+
- `apiserver_flowcontrol_current_limit_seats`
203+
- `apiserver_request_sli_duration_seconds_bucket`
204+
- `apiserver_request_sli_duration_seconds_count`
205+
- `apiserver_request_sli_duration_seconds_sum`
206+
- `process_start_time_seconds`
207+
- `apiserver_request_duration_seconds_bucket`
208+
- `apiserver_request_duration_seconds_count`
209+
- `apiserver_request_duration_seconds_sum`
210+
- `apiserver_storage_list_fetched_objects_total`
211+
- `apiserver_storage_list_returned_objects_total`
212+
- `apiserver_current_inflight_requests`
213+
214+
**controlplane-etcd**<br>
215+
- `etcd_server_has_leader`
216+
- `rest_client_requests_total`
217+
- `etcd_mvcc_db_total_size_in_bytes`
218+
- `etcd_mvcc_db_total_size_in_use_in_bytes`
219+
- `etcd_server_slow_read_indexes_total`
220+
- `etcd_server_slow_apply_total`
221+
- `etcd_network_client_grpc_sent_bytes_total`
222+
- `etcd_server_heartbeat_send_failures_total`
223+
191224
### Minimal ingestion for default OFF targets
192225
The following are metrics that are allow-listed with `minimalingestionprofile=true` for default OFF targets. These metrics are not collected by default as these targets are not scraped by default (due to being OFF by default). You can turn ON scraping for these targets using `default-scrape-settings-enabled.<target-name>=true`' using [ama-metrics-settings-configmap](https://aka.ms/azureprometheus-addon-settings-configmap) under `default-scrape-settings-enabled` section.
193226

@@ -270,6 +303,35 @@ The following are metrics that are allow-listed with `minimalingestionprofile=tr
270303
- `process_cpu_seconds_total`
271304
- `go_goroutines`
272305

306+
**controlplane-cluster-autoscaler**<br>
307+
- `rest_client_requests_total`
308+
- `cluster_autoscaler_last_activity`
309+
- `cluster_autoscaler_cluster_safe_to_autoscale`
310+
- `cluster_autoscaler_scale_down_in_cooldown`
311+
- `cluster_autoscaler_scaled_up_nodes_total`
312+
- `cluster_autoscaler_unneeded_nodes_count`
313+
- `cluster_autoscaler_unschedulable_pods_count`
314+
- `cluster_autoscaler_nodes_count`
315+
- `cloudprovider_azure_api_request_errors`
316+
- `cloudprovider_azure_api_request_duration_seconds_bucket`
317+
- `cloudprovider_azure_api_request_duration_seconds_count`
318+
319+
**controlplane-kube-scheduler**<br>
320+
- `scheduler_pending_pods`
321+
- `scheduler_unschedulable_pods`
322+
- `scheduler_pod_scheduling_attempts`
323+
- `scheduler_queue_incoming_pods_total`
324+
- `scheduler_preemption_attempts_total`
325+
- `scheduler_preemption_victims`
326+
- `scheduler_scheduling_attempt_duration_seconds`
327+
- `scheduler_schedule_attempts_total`
328+
- `scheduler_pod_scheduling_duration_seconds`
329+
330+
**controlplane-kube-controller-manager**<br>
331+
- `rest_client_request_duration_seconds`
332+
- `rest_client_requests_total`
333+
- `workqueue_depth`
334+
273335
## Next steps
274336

275337
- [Learn more about customizing Prometheus metric scraping in Container insights](prometheus-metrics-scrape-configuration.md).

articles/azure-monitor/containers/prometheus-metrics-scrape-default.md

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,8 @@ Following targets are **enabled/ON** by default - meaning you don't have to prov
2424
- `nodeexporter` (`job=node`)
2525
- `kubelet` (`job=kubelet`)
2626
- `kube-state-metrics` (`job=kube-state-metrics`)
27+
- `controlplane-apiserver` (`job=controlplane-apiserver`)
28+
- `controlplane-etcd` (`job=controlplane-etcd`)
2729

2830
## Metrics collected from default targets
2931

@@ -151,6 +153,33 @@ The following metrics are collected by default from each default target. All oth
151153
- `kube_resource_labels` (ex - kube_pod_labels, kube_deployment_labels)
152154
- `kube_resource_annotations` (ex - kube_pod_annotations, kube_deployment_annotations)
153155

156+
**controlplane-apiserver (job=controlplane-apiserver)**<br>
157+
- `apiserver_request_total`
158+
- `apiserver_cache_list_fetched_objects_total`
159+
- `apiserver_cache_list_returned_objects_total`
160+
- `apiserver_flowcontrol_demand_seats_average`
161+
- `apiserver_flowcontrol_current_limit_seats`
162+
- `apiserver_request_sli_duration_seconds_bucket`
163+
- `apiserver_request_sli_duration_seconds_count`
164+
- `apiserver_request_sli_duration_seconds_sum`
165+
- `process_start_time_seconds`
166+
- `apiserver_request_duration_seconds_bucket`
167+
- `apiserver_request_duration_seconds_count`
168+
- `apiserver_request_duration_seconds_sum`
169+
- `apiserver_storage_list_fetched_objects_total`
170+
- `apiserver_storage_list_returned_objects_total`
171+
- `apiserver_current_inflight_requests`
172+
173+
**controlplane-etcd (job=controlplane-etcd)**<br>
174+
- `etcd_server_has_leader`
175+
- `rest_client_requests_total`
176+
- `etcd_mvcc_db_total_size_in_bytes`
177+
- `etcd_mvcc_db_total_size_in_use_in_bytes`
178+
- `etcd_server_slow_read_indexes_total`
179+
- `etcd_server_slow_apply_total`
180+
- `etcd_network_client_grpc_sent_bytes_total`
181+
- `etcd_server_heartbeat_send_failures_total`
182+
154183
## Default targets scraped for Windows
155184
Following Windows targets are configured to scrape, but scraping is not enabled (**disabled/OFF**) by default - meaning you don't have to provide any scrape job configuration for scraping these targets but they are disabled/OFF by default and you need to turn ON/enable scraping for these targets using [ama-metrics-settings-configmap](https://aka.ms/azureprometheus-addon-settings-configmap) under `default-scrape-settings-enabled` section
156185

0 commit comments

Comments
 (0)