-
Notifications
You must be signed in to change notification settings - Fork 302
KFLUXINFRA-2067: Push Kueue metrics to RHOBS #7563
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Code Review by Gemini--- a/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
+++ b/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
@@ -147,7 +147,7 @@
# Kueue
- '{__name__="tekton_kueue_cel_evaluations_total"}'
- '{__name__="kube_pod_container_status_restarts_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator}'
+ - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="kueue_cluster_queue_status"}'
- '{__name__="kueue_admission_wait_time_seconds_bucket"}'
- '{__name__="apiserver_admission_webhook_request_total", name="pipelinerun-kueue-defaulter.tekton-kueue.io"}' Issue: Suggested change: - '{__name__="kube_pod_container_status_restarts_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator}'
+ - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="kueue_cluster_queue_status"}' |
Code Review by Gemini--- a/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
+++ b/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
@@ -148,9 +148,9 @@ spec:
- '{__name__="kube_deployment_spec_replicas", namespace=~"gitops-service-argocd"}'
# Kueue
- - '{__name__="tekton_kueue_cel_evaluations_total"}'
+ - '{__name__="tekton_kueue_cel_evaluations_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="kube_pod_container_status_restarts_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- - '{__name__="kueue_cluster_queue_status"}'
- - '{__name__="kueue_admission_wait_time_seconds_bucket"}'
+ - '{__name__="kueue_cluster_queue_status", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
+ - '{__name__="kueue_admission_wait_time_seconds_bucket", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="apiserver_admission_webhook_request_total", name="pipelinerun-kueue-defaulter.tekton-kueue.io"}'
# Namespace (expression): "build-service" |
Code Review by Gemini### Suggested Improvements
For consistency with other `kube_pod_container_status` entries in the file (e.g., for `release-service`), it is recommended to list `kube_pod_container_status_ready` before `kube_pod_container_status_restarts_total`.
**File: `components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml`**
```diff
--- a/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
+++ b/components/monitoring/prometheus/base/monitoringstack/monitoringstack.yaml
@@ -148,8 +148,8 @@
# Kueue
- '{__name__="tekton_kueue_cel_evaluations_total"}'
- - '{__name__="kube_pod_container_status_restarts_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
+ - '{__name__="kube_pod_container_status_ready", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
+ - '{__name__="kube_pod_container_status_restarts_total", namespace=~"tekton-kueue|kueue-external-admission|openshift-kueue-operator"}'
- '{__name__="kueue_cluster_queue_status"}'
- '{__name__="kueue_admission_wait_time_seconds_bucket"}'
- '{__name__="apiserver_admission_webhook_request_total", name="pipelinerun-kueue-defaulter.tekton-kueue.io"}'
|
@FaisalAl-Rayes @mike-kingsbury can you ptal? |
/test appstudio-e2e-tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM....
@gbenhaim: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Those metrics will be used for creating SLO alerts. Signed-off-by: Gal Ben Haim <[email protected]>
Code Review by GeminiNo bugs or issues were found in the provided code changes. The additions are appropriate for pushing Kueue metrics and correctly ensure that necessary labels are preserved.
|
@mike-kingsbury I rebased the PR. I also modified the |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: gbenhaim, mike-kingsbury The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
88cea78
into
redhat-appstudio:main
Those metrics will be used for creating SLO alerts.