Skip to content

docs( cluster ): Improved Runbooks and monitoring#774

Merged
itay-grudev merged 16 commits intocloudnative-pg:mainfrom
paradedb:phil/improved-runbooks
Jan 15, 2026
Merged

docs( cluster ): Improved Runbooks and monitoring#774
itay-grudev merged 16 commits intocloudnative-pg:mainfrom
paradedb:phil/improved-runbooks

Conversation

@philippemnoel
Copy link
Copy Markdown
Contributor

@philippemnoel philippemnoel commented Jan 2, 2026

Here @paradedb we've been building our Helm Chart based off of the Cluster chart for the past year. We run them in production on clusters in the 10TBs+ range, many of which ingest via logical replication. While doing this, we've developed improved runbooks and Prometheus rules to help with operating the database. I figured they could benefit the wider CNPG community, and so here they are. They are primarily rules and runbooks around physical and logical replication.

Here is the original commit in our fork, in case I missed anything: 917bb7f

@philippemnoel philippemnoel force-pushed the phil/improved-runbooks branch from dd9e58d to 142e5b3 Compare January 2, 2026 17:33
@philippemnoel philippemnoel marked this pull request as ready for review January 2, 2026 17:38
@dosubot dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. chart( cluster ) Related to the cluster chart labels Jan 2, 2026
philippemnoel and others added 15 commits January 15, 2026 13:50
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Zied ABID <zied.abid@jobteaser.com>
Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>
Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
…loudnative-pg#650)

Signed-off-by: Ryan Yin <xiaoyin_c@qq.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Itay Grudev <itay@verito.digital>
@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Jan 15, 2026
@itay-grudev itay-grudev merged commit 2c85b15 into cloudnative-pg:main Jan 15, 2026
18 checks passed
@itay-grudev itay-grudev deleted the phil/improved-runbooks branch January 15, 2026 12:06
alexlebens pushed a commit to alexlebens/infrastructure that referenced this pull request Feb 4, 2026
This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [plugin-barman-cloud](https://cloudnative-pg.io) ([source](https://github.com/cloudnative-pg/charts)) | minor | `0.4.0` → `0.5.0` |

---

### Release Notes

<details>
<summary>cloudnative-pg/charts (plugin-barman-cloud)</summary>

### [`v0.5.0`](https://github.com/cloudnative-pg/charts/releases/tag/plugin-barman-cloud-v0.5.0)

[Compare Source](cloudnative-pg/charts@plugin-barman-cloud-v0.4.0...plugin-barman-cloud-v0.5.0)

Helm Chart for CloudNativePG's CNPG-I backup plugin using Barman Cloud

#### What's Changed

- docs: add security policy by [@&#8203;zied-jt](https://github.com/zied-jt) in [#&#8203;621](cloudnative-pg/charts#621)
- fix(cloudnative-pg): queries identation in cnpg-default-monitoring by [@&#8203;ryan4yin](https://github.com/ryan4yin) in [#&#8203;650](cloudnative-pg/charts#650)
- docs( cluster ): Improved Runbooks and monitoring by [@&#8203;philippemnoel](https://github.com/philippemnoel) in [#&#8203;774](cloudnative-pg/charts#774)
- feat(plugin-barman-cloud): Add imagePullSecrets to deployment.yaml by [@&#8203;faymard](https://github.com/faymard) in [#&#8203;704](cloudnative-pg/charts#704)
- feat([#&#8203;727](cloudnative-pg/charts#727)): add container and pod security context configuration via values by [@&#8203;Fuochi](https://github.com/Fuochi) in [#&#8203;728](cloudnative-pg/charts#728)
- chore(deps): update Grafana Helm Chart dependency version by [@&#8203;justsomescripts](https://github.com/justsomescripts) in [#&#8203;784](cloudnative-pg/charts#784)
- feat( cluster ): Add support for console stateful set by [@&#8203;philippemnoel](https://github.com/philippemnoel) in [#&#8203;782](cloudnative-pg/charts#782)
- Release plugin-barman-cloud-v0.5.0 by [@&#8203;cnpg-bot](https://github.com/cnpg-bot) in [#&#8203;803](cloudnative-pg/charts#803)

#### New Contributors

- [@&#8203;zied-jt](https://github.com/zied-jt) made their first contribution in [#&#8203;621](cloudnative-pg/charts#621)
- [@&#8203;ryan4yin](https://github.com/ryan4yin) made their first contribution in [#&#8203;650](cloudnative-pg/charts#650)
- [@&#8203;faymard](https://github.com/faymard) made their first contribution in [#&#8203;704](cloudnative-pg/charts#704)
- [@&#8203;Fuochi](https://github.com/Fuochi) made their first contribution in [#&#8203;728](cloudnative-pg/charts#728)
- [@&#8203;justsomescripts](https://github.com/justsomescripts) made their first contribution in [#&#8203;784](cloudnative-pg/charts#784)

**Full Changelog**: <cloudnative-pg/charts@plugin-barman-cloud-v0.4.0...plugin-barman-cloud-v0.5.0>

</details>

---

### Configuration

📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied.

♻ **Rebasing**: Whenever PR is behind base branch, or you tick the rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about this update again.

---

 - [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check this box

---

This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).
<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiI0My4wLjIiLCJ1cGRhdGVkSW5WZXIiOiI0My4wLjIiLCJ0YXJnZXRCcmFuY2giOiJtYWluIiwibGFiZWxzIjpbImNoYXJ0Il19-->

Reviewed-on: https://gitea.alexlebens.dev/alexlebens/infrastructure/pulls/3716
Co-authored-by: Renovate Bot <renovate-bot@alexlebens.net>
Co-committed-by: Renovate Bot <renovate-bot@alexlebens.net>
ehbello added a commit to maarlab-rethinking/cloudnative-pg-charts that referenced this pull request Feb 12, 2026
* fix(ci): use PAT instead of GITHUB_TOKEN in release-pr workflow (cloudnative-pg#754)

PRs created with GITHUB_TOKEN don't trigger other workflows. Using
REPO_GHA_PAT instead allows the continuous-delivery and lint workflows
to run properly on release PRs.

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

* docs: add plugin-barman-cloud chart release instructions (cloudnative-pg#717)

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

* chore(cloudnative-pg): use port names rather than numbers (cloudnative-pg#765)

Signed-off-by: Pat Riehecky <riehecky@fnal.gov>
Co-authored-by: Itay Grudev <itay@verito.digital>

* Release plugin-barman-cloud-v0.4.0 (cloudnative-pg#767)

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>
Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

* docs: add security policy (cloudnative-pg#621)

Signed-off-by: Zied ABID <zied.abid@jobteaser.com>
Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>
Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>

* fix(cloudnative-pg): queries identation in `cnpg-default-monitoring` (cloudnative-pg#650)

Signed-off-by: Ryan Yin <xiaoyin_c@qq.com>

* docs( cluster ): Improved Runbooks and monitoring (cloudnative-pg#774)

Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>

* feat(plugin-barman-cloud): Add imagePullSecrets to deployment.yaml (cloudnative-pg#704)

Signed-off-by: Florian Aymard <florian.aymard@batch.com>
Co-authored-by: Itay Grudev <itay@verito.digital>

* feat(cloudnative-pg#727): add container and pod security context configuration via values (cloudnative-pg#728)

Signed-off-by: Fuochi <ffuochi@hotmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Co-authored-by: Itay Grudev <itay@verito.digital>

* chore(deps): update Grafana Helm Chart dependency version (cloudnative-pg#784)

Signed-off-by: Itay Grudev <itay@verito.digital>
Co-authored-by: Itay Grudev <itay@verito.digital>

* feat( cluster ): Add support for console stateful set (cloudnative-pg#782)

Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Co-authored-by: Itay Grudev <itay@verito.digital>

* Release plugin-barman-cloud-v0.5.0 (cloudnative-pg#803)

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>
Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

* Release cloudnative-pg-v0.27.1 (cloudnative-pg#804)

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>
Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>

* fix(cluster): document plugins and fix replica secret indentation

Add documentation for the plugins configuration in values.yaml.
Correct the indentation of the replica password secret key to ensure
it is correctly nested within the passwordSecret object.

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

* feat(crds): add selectorLabels to enable VPA support

Add the selectorLabels field to the Cluster status schema and configure
the labelSelectorPath in the scale subresource. This enables support
for Vertical Pod Autoscaler (VPA) by allowing it to query pods managed
by the operator.

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

* build: switch cloudnative-pg image to maarlab-rethinking fork

Update the image repository to ghcr.io/maarlab-rethinking/cloudnative-pg
across Renovate configuration, Helm chart values, and documentation.
Additionally, bump the appVersion to 1.28.1-maarlab4 to track the
forked version.

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

* docs(helm): update repository URL in documentation

Update the CloudNativePG Helm repository URL from the official
GitHub Pages location to the maarlab-rethinking repository across all
README files, release guides, and example commands.

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

* test(console-statefulset): wait for PostgreSQL connections in chainsaw test

Add a retry loop to ensure PostgreSQL is accepting connections before
executing the long-running query in the console-statefulset chainsaw
test. This prevents race conditions where the pod is ready but the
database service is not yet fully initialized.

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

* Release cloudnative-pg-v0.27.1-maarlab1

Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>

---------

Signed-off-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>
Signed-off-by: Pat Riehecky <riehecky@fnal.gov>
Signed-off-by: Zied ABID <zied.abid@jobteaser.com>
Signed-off-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>
Signed-off-by: Ryan Yin <xiaoyin_c@qq.com>
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Signed-off-by: Florian Aymard <florian.aymard@batch.com>
Signed-off-by: Fuochi <ffuochi@hotmail.com>
Signed-off-by: Enrique Hernández Bello <ehernandez@maarlab.com>
Co-authored-by: Marco Nenciarini <marco.nenciarini@enterprisedb.com>
Co-authored-by: Pat Riehecky <3534830+jcpunk@users.noreply.github.com>
Co-authored-by: Itay Grudev <itay@verito.digital>
Co-authored-by: Peggie <info@cloudnative-pg.io>
Co-authored-by: Zied ABID <58431623+zied-jt@users.noreply.github.com>
Co-authored-by: Gabriele Bartolini <gabriele.bartolini@enterprisedb.com>
Co-authored-by: Ryan Yin <xiaoyin_c@qq.com>
Co-authored-by: Philippe Noël <21990816+philippemnoel@users.noreply.github.com>
Co-authored-by: Florian Aymard <florian.aymard@gmail.com>
Co-authored-by: Fuochi <ffuochi@hotmail.com>
Co-authored-by: David Gries <mail@dgries.de>
Co-authored-by: Enrique Hernández Bello <ehernandez@maarlab.com>
ehbello pushed a commit to maarlab-rethinking/cloudnative-pg-charts that referenced this pull request Feb 13, 2026
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
maarlab pushed a commit to maarlab-rethinking/cloudnative-pg-charts that referenced this pull request Feb 13, 2026
Signed-off-by: Philippe Noël <philippemnoel@gmail.com>
Signed-off-by: Itay Grudev <itay@verito.digital>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

chart( cluster ) Related to the cluster chart lgtm This PR has been approved by a maintainer size:XXL This PR changes 1000+ lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants