docs( cluster ): Improved Runbooks and monitoring by philippemnoel · Pull Request #774 · cloudnative-pg/charts

philippemnoel · 2026-01-02T17:12:45Z

Here @paradedb we've been building our Helm Chart based off of the Cluster chart for the past year. We run them in production on clusters in the 10TBs+ range, many of which ingest via logical replication. While doing this, we've developed improved runbooks and Prometheus rules to help with operating the database. I figured they could benefit the wider CNPG community, and so here they are. They are primarily rules and runbooks around physical and logical replication.

Here is the original commit in our fork, in case I missed anything: 917bb7f

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Signed-off-by: Itay Grudev <itay@verito.digital>

Signed-off-by: Zied ABID <zied.abid@jobteaser.com> Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Signed-off-by: Itay Grudev <itay@verito.digital>

…loudnative-pg#650) Signed-off-by: Ryan Yin <xiaoyin_c@qq.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Signed-off-by: Itay Grudev <itay@verito.digital>

This PR contains the following updates: | Package | Update | Change | |---|---|---| | [plugin-barman-cloud](https://cloudnative-pg.io) ([source](https://github.com/cloudnative-pg/charts)) | minor | `0.4.0` → `0.5.0` | --- ### Release Notes <details> <summary>cloudnative-pg/charts (plugin-barman-cloud)</summary> ### [`v0.5.0`](https://github.com/cloudnative-pg/charts/releases/tag/plugin-barman-cloud-v0.5.0) [Compare Source](cloudnative-pg/charts@plugin-barman-cloud-v0.4.0...plugin-barman-cloud-v0.5.0) Helm Chart for CloudNativePG's CNPG-I backup plugin using Barman Cloud #### What's Changed - docs: add security policy by [@zied-jt](https://github.com/zied-jt) in [#621](cloudnative-pg/charts#621) - fix(cloudnative-pg): queries identation in cnpg-default-monitoring by [@ryan4yin](https://github.com/ryan4yin) in [#650](cloudnative-pg/charts#650) - docs( cluster ): Improved Runbooks and monitoring by [@philippemnoel](https://github.com/philippemnoel) in [#774](cloudnative-pg/charts#774) - feat(plugin-barman-cloud): Add imagePullSecrets to deployment.yaml by [@faymard](https://github.com/faymard) in [#704](cloudnative-pg/charts#704) - feat([#727](cloudnative-pg/charts#727)): add container and pod security context configuration via values by [@Fuochi](https://github.com/Fuochi) in [#728](cloudnative-pg/charts#728) - chore(deps): update Grafana Helm Chart dependency version by [@justsomescripts](https://github.com/justsomescripts) in [#784](cloudnative-pg/charts#784) - feat( cluster ): Add support for console stateful set by [@philippemnoel](https://github.com/philippemnoel) in [#782](cloudnative-pg/charts#782) - Release plugin-barman-cloud-v0.5.0 by [@cnpg-bot](https://github.com/cnpg-bot) in [#803](cloudnative-pg/charts#803) #### New Contributors - [@zied-jt](https://github.com/zied-jt) made their first contribution in [#621](cloudnative-pg/charts#621) - [@ryan4yin](https://github.com/ryan4yin) made their first contribution in [#650](cloudnative-pg/charts#650) - [@faymard](https://github.com/faymard) made their first contribution in [#704](cloudnative-pg/charts#704) - [@Fuochi](https://github.com/Fuochi) made their first contribution in [#728](cloudnative-pg/charts#728) - [@justsomescripts](https://github.com/justsomescripts) made their first contribution in [#784](cloudnative-pg/charts#784) **Full Changelog**: <cloudnative-pg/charts@plugin-barman-cloud-v0.4.0...plugin-barman-cloud-v0.5.0> </details> --- ### Configuration 📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied. ♻ **Rebasing**: Whenever PR is behind base branch, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).  Reviewed-on: https://gitea.alexlebens.dev/alexlebens/infrastructure/pulls/3716 Co-authored-by: Renovate Bot <renovate-bot@alexlebens.net> Co-committed-by: Renovate Bot <renovate-bot@alexlebens.net>

* fix(ci): use PAT instead of GITHUB_TOKEN in release-pr workflow (cloudnative-pg#754) PRs created with GITHUB_TOKEN don't trigger other workflows. Using REPO_GHA_PAT instead allows the continuous-delivery and lint workflows to run properly on release PRs. Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> * docs: add plugin-barman-cloud chart release instructions (cloudnative-pg#717) Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> * chore(cloudnative-pg): use port names rather than numbers (cloudnative-pg#765) Signed-off-by: Pat Riehecky <riehecky@fnal.gov> Co-authored-by: Itay Grudev <itay@verito.digital> * Release plugin-barman-cloud-v0.4.0 (cloudnative-pg#767) Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> * docs: add security policy (cloudnative-pg#621) Signed-off-by: Zied ABID <zied.abid@jobteaser.com> Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> * fix(cloudnative-pg): queries identation in `cnpg-default-monitoring` (cloudnative-pg#650) Signed-off-by: Ryan Yin <xiaoyin_c@qq.com> * docs( cluster ): Improved Runbooks and monitoring (cloudnative-pg#774) Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital> * feat(plugin-barman-cloud): Add imagePullSecrets to deployment.yaml (cloudnative-pg#704) Signed-off-by: Florian Aymard <florian.aymard@batch.com> Co-authored-by: Itay Grudev <itay@verito.digital> * feat(cloudnative-pg#727): add container and pod security context configuration via values (cloudnative-pg#728) Signed-off-by: Fuochi <ffuochi@hotmail.com> Signed-off-by: Itay Grudev <itay@verito.digital> Co-authored-by: Itay Grudev <itay@verito.digital> * chore(deps): update Grafana Helm Chart dependency version (cloudnative-pg#784) Signed-off-by: Itay Grudev <itay@verito.digital> Co-authored-by: Itay Grudev <itay@verito.digital> * feat( cluster ): Add support for console stateful set (cloudnative-pg#782) Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital> Co-authored-by: Itay Grudev <itay@verito.digital> * Release plugin-barman-cloud-v0.5.0 (cloudnative-pg#803) Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> * Release cloudnative-pg-v0.27.1 (cloudnative-pg#804) Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> * fix(cluster): document plugins and fix replica secret indentation Add documentation for the plugins configuration in values.yaml. Correct the indentation of the replica password secret key to ensure it is correctly nested within the passwordSecret object. Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> * feat(crds): add selectorLabels to enable VPA support Add the selectorLabels field to the Cluster status schema and configure the labelSelectorPath in the scale subresource. This enables support for Vertical Pod Autoscaler (VPA) by allowing it to query pods managed by the operator. Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> * build: switch cloudnative-pg image to maarlab-rethinking fork Update the image repository to ghcr.io/maarlab-rethinking/cloudnative-pg across Renovate configuration, Helm chart values, and documentation. Additionally, bump the appVersion to 1.28.1-maarlab4 to track the forked version. Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> * docs(helm): update repository URL in documentation Update the CloudNativePG Helm repository URL from the official GitHub Pages location to the maarlab-rethinking repository across all README files, release guides, and example commands. Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> * test(console-statefulset): wait for PostgreSQL connections in chainsaw test Add a retry loop to ensure PostgreSQL is accepting connections before executing the long-running query in the console-statefulset chainsaw test. This prevents race conditions where the pod is ready but the database service is not yet fully initialized. Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> * Release cloudnative-pg-v0.27.1-maarlab1 Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> --------- Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> Signed-off-by: Pat Riehecky <riehecky@fnal.gov> Signed-off-by: Zied ABID <zied.abid@jobteaser.com> Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Signed-off-by: Ryan Yin <xiaoyin_c@qq.com> Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital> Signed-off-by: Florian Aymard <florian.aymard@batch.com> Signed-off-by: Fuochi <ffuochi@hotmail.com> Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com> Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com> Co-authored-by: Pat Riehecky <3534830+jcpunk@users.noreply.github.com> Co-authored-by: Itay Grudev <itay@verito.digital> Co-authored-by: Peggie <info@cloudnative-pg.io> Co-authored-by: Zied ABID <58431623+zied-jt@users.noreply.github.com> Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com> Co-authored-by: Ryan Yin <xiaoyin_c@qq.com> Co-authored-by: Philippe Noël <21990816+philippemnoel@users.noreply.github.com> Co-authored-by: Florian Aymard <florian.aymard@gmail.com> Co-authored-by: Fuochi <ffuochi@hotmail.com> Co-authored-by: David Gries <mail@dgries.de> Co-authored-by: Enrique Hernández Bello <ehernandez@maarlab.com>

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

philippemnoel force-pushed the phil/improved-runbooks branch from dd9e58d to 142e5b3 Compare January 2, 2026 17:33

philippemnoel marked this pull request as ready for review January 2, 2026 17:38

philippemnoel requested a review from itay-grudev as a code owner January 2, 2026 17:38

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. chart( cluster ) Related to the cluster chart labels Jan 2, 2026

philippemnoel and others added 15 commits January 15, 2026 13:50

Improved runbooks

f10d57f

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

DSstore

e25599f

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

More files

5029542

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

DS Store

0cabd19

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Rm paradedb dashboard

5e1902d

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Rm ParadeDB

03520a4

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Rm more ParadeDB

43ccc99

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Rm more paradedb

a8cc64a

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Nits

b675b1c

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

Remove BYOC references

84048de

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

no paradedb left

9f61e91

Signed-off-by: Philippe Noël <philippemnoel@gmail.com> Signed-off-by: Itay Grudev <itay@verito.digital>

fix: prometheus rule description unit ms -> s

dcfc8ea

Signed-off-by: Itay Grudev <itay@verito.digital>

fix(cloudnative-pg): queries identation in cnpg-default-monitoring (c…

1a4651d

…loudnative-pg#650) Signed-off-by: Ryan Yin <xiaoyin_c@qq.com> Signed-off-by: Itay Grudev <itay@verito.digital>

added missing instrumentation option

adf4b51

Signed-off-by: Itay Grudev <itay@verito.digital>

itay-grudev force-pushed the phil/improved-runbooks branch from 8c1ac35 to adf4b51 Compare January 15, 2026 11:51

itay-grudev requested review from fcanovai, gbartolini, leonardoce, mnencia, phisco and sxd as code owners January 15, 2026 11:51

Merge branch 'main' into phil/improved-runbooks

c5a0dc9

itay-grudev approved these changes Jan 15, 2026

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 15, 2026

itay-grudev merged commit 2c85b15 into cloudnative-pg:main Jan 15, 2026
18 checks passed

itay-grudev deleted the phil/improved-runbooks branch January 15, 2026 12:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs( cluster ): Improved Runbooks and monitoring#774

docs( cluster ): Improved Runbooks and monitoring#774
itay-grudev merged 16 commits intocloudnative-pg:mainfrom
paradedb:phil/improved-runbooks

philippemnoel commented Jan 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

philippemnoel commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

philippemnoel commented Jan 2, 2026 •

edited

Loading