- 
                Notifications
    
You must be signed in to change notification settings  - Fork 381
 
OCPBUGS-34568,OCPBUGS-35095,OCPBUGS-60689,OCPBUGS-60691,OCPBUGS-60692: non-HA alert cases #2630
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCPBUGS-34568,OCPBUGS-35095,OCPBUGS-60689,OCPBUGS-60691,OCPBUGS-60692: non-HA alert cases #2630
Conversation
| 
           @rexagod: This pull request references Jira Issue OCPBUGS-34568, which is invalid: 
 Comment  The bug has been updated to refer to the pull request using the external bug tracker. This pull request references Jira Issue OCPBUGS-35095, which is invalid: 
 Comment  The bug has been updated to refer to the pull request using the external bug tracker. In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.  | 
    
| 
           /jira refresh  | 
    
| 
           @rexagod: This pull request references Jira Issue OCPBUGS-34568, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: This pull request references Jira Issue OCPBUGS-35095, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.  | 
    
682569b    to
    f93c51c      
    Compare
  
    | 
           /retest-required  | 
    
4c7aca4    to
    8d1fadc      
    Compare
  
    | 
           @rexagod could you rebase on main to get rid of the version changes which make it harder to review?  | 
    
Pulls in changes from [1], which refactors alerts to accomodate for non-HA cases. [1]: kubernetes-monitoring/kubernetes-mixin#1010 Signed-off-by: Pranshu Srivastava <[email protected]>
a34b00d    to
    49eed1d      
    Compare
  
    There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/hold
I see that the bump adds a couple of alerting rules (most of them at info level). Maybe it's worth listing them in the CHANGELOG? Also how confident are we that these new rules won't interfere with the CI (e.g. the origin e2e tests will fail if they detect firing alerts)?
| pvExcludedSelector: 'label_alerts_k8s_io_kube_persistent_volume_filling_up="disabled"', | ||
| containerfsSelector: 'id!=""', | ||
| clusterLabel: $.values.common.dashboardClusterLabel, | ||
| showMultiCluster: false, // Opt-out of multi-cluster dashboards (opted-in by midstream kube-prometheus) | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
| 
           /payload-aggregate periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aws-ovn 3  | 
    
| 
           @rexagod: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command 
 See details on https://pr-payload-tests.ci.openshift.org/runs/ci/c82b9a40-7821-11f0-9bd2-835c3a7ddd94-0  | 
    
| 
           /payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aws-ovn /payload-aggregate periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aws-ovn 6  | 
    
| 
           @rexagod: trigger 2 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command 
 See details on https://pr-payload-tests.ci.openshift.org/runs/ci/4f47f430-7839-11f0-849e-5581308be2cc-0  | 
    
| 
           /payload-job periodic-ci-openshift-release-master-ci-4.20-e2e-aws-ovn  | 
    
| 
           @rexagod: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command 
 See details on https://pr-payload-tests.ci.openshift.org/runs/ci/9966f4c0-7baf-11f0-8d35-5666afe3a7b2-0  | 
    
          
 @simonpasquier All origin monitoring tests are passing (such as   | 
    
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The jsonnet bump also brings a few (useful!) fixes:
- kubernetes-monitoring/kubernetes-mixin#1026
 - kubernetes-monitoring/kubernetes-mixin#991
 - kubernetes-monitoring/kubernetes-mixin#1012
 
Do you know if we have OCPBUGS tickets opened for them? If not, should we create them and link to this PR for traceability?
/lgtm
/hold
| 
           [APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rexagod, simonpasquier The full list of commands accepted by this bot can be found here. The pull request process is described here 
Needs approval from an approver in each of these files:
 
      
 Approvers can indicate their approval by writing   | 
    
| 
           /retest-required  | 
    
| 
           @rexagod: This pull request references Jira Issue OCPBUGS-34568, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. This pull request references Jira Issue OCPBUGS-35095, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. This pull request references Jira Issue OCPBUGS-60689, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. This pull request references Jira Issue OCPBUGS-60691, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. This pull request references Jira Issue OCPBUGS-60692, which is valid. 3 validation(s) were run on this bug
 Requesting review from QA contact: The bug has been updated to refer to the pull request using the external bug tracker. In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.  | 
    
| 
           Created tickets for the aforementioned PRs (since there were no existing ones that tracked them), PTAL.  | 
    
| 
           /hold cancel  | 
    
    
      
        2 similar comments
      
    
  
    | 
           @rexagod: The following tests failed, say  
 Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.  | 
    
| 
           @rexagod: Jira Issue OCPBUGS-34568: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-34568 has been moved to the MODIFIED state. Jira Issue OCPBUGS-35095: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-35095 has been moved to the MODIFIED state. Jira Issue OCPBUGS-60689: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-60689 has been moved to the MODIFIED state. Jira Issue OCPBUGS-60691: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-60691 has been moved to the MODIFIED state. Jira Issue OCPBUGS-60692: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-60692 has been moved to the MODIFIED state. In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.  | 
    
| 
           [ART PR BUILD NOTIFIER] Distgit: cluster-monitoring-operator  | 
    
| 
           /cherrypick release-4.19,release-4.18,release-4.17,release-4.16  | 
    
| 
           @rexagod: cannot checkout  In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.  | 
    
| 
           /cherrypick release-4.19 release-4.18 release-4.17 release-4.16  | 
    
| 
           @rexagod: #2630 failed to apply on top of branch "release-4.19": In response to this: 
 Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.  | 
    
Pulls in changes from 1, which refactors alerts to accomodate for non-HA cases.