@@ -489,13 +489,42 @@ is defined for pods without volumes and work in progress for pods with
489
489
volumes.
490
490
491
491
For kube-controller-manager, a metric that exposes the usual work
492
- queue metrics data (like queue length) will be made available.
493
- Furthermore, a count of PVC creation attempts will be added, labeled
494
- with the result (successful vs. error code). A non-zero count of attempts
495
- with "already exists" will indicate that there were conflicts with
496
- manually created PVCs.
492
+ queue metrics data (like queue length) will be made available with
493
+ "ephemeral_volume" as name. Here is one example after processing a
494
+ single pod with a generic ephemeral volume:
497
495
498
- TODO: list metrics names here and in kep.yaml
496
+ ```
497
+ workqueue_adds_total{name="ephemeral_volume"} 1
498
+ workqueue_depth{name="ephemeral_volume"} 0
499
+ workqueue_longest_running_processor_seconds{name="ephemeral_volume"} 0
500
+ workqueue_queue_duration_seconds_bucket{name="ephemeral_volume",le="1e-08"} 0
501
+ ...
502
+ workqueue_queue_duration_seconds_bucket{name="ephemeral_volume",le="9.999999999999999e-05"} 1
503
+ workqueue_queue_duration_seconds_bucket{name="ephemeral_volume",le="0.001"} 1
504
+ ...
505
+ workqueue_queue_duration_seconds_bucket{name="ephemeral_volume",le="+Inf"} 1
506
+ workqueue_queue_duration_seconds_sum{name="ephemeral_volume"} 4.8201e-05
507
+ workqueue_queue_duration_seconds_count{name="ephemeral_volume"} 1
508
+ workqueue_retries_total{name="ephemeral_volume"} 0
509
+ workqueue_unfinished_work_seconds{name="ephemeral_volume"} 0
510
+ workqueue_work_duration_seconds_bucket{name="ephemeral_volume",le="1e-08"} 0
511
+ ...
512
+ workqueue_work_duration_seconds_bucket{name="ephemeral_volume",le="0.1"} 1
513
+ ...
514
+ workqueue_work_duration_seconds_bucket{name="ephemeral_volume",le="+Inf"} 1
515
+ workqueue_work_duration_seconds_sum{name="ephemeral_volume"} 0.035308659
516
+ workqueue_work_duration_seconds_count{name="ephemeral_volume"} 1
517
+ ```
518
+
519
+ Furthermore, counters of PVC creation attempts and failed attempts
520
+ will be added. There should be no failures. If there are any, analyzing
521
+ the logs of kube-controller manager will provide further insights into
522
+ the reason why they occurred.
523
+
524
+ ```
525
+ ephemeral_volume_controller_create_total 1
526
+ ephemeral_volume_controller_create_failures_total 0
527
+ ```
499
528
500
529
* ** What are the reasonable SLOs (Service Level Objectives) for the above SLIs?**
501
530
0 commit comments