Skip to content

Conversation

@dtapiacl
Copy link
Contributor

@dtapiacl dtapiacl commented May 7, 2025

  • Install openvpn db instance.
  • Performs authenticated pulls to help mitigate Docker Hub rate limits by associating the image pulls with a Docker Hub account.
  • I currently have a working, tested instance running on Ruka with authenticated Docker pull.

@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch 7 times, most recently from 805dd60 to 06b3810 Compare May 7, 2025 18:14
@dtapiacl dtapiacl self-assigned this May 7, 2025
@dtapiacl dtapiacl added the enhancement New feature or request label May 7, 2025
@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch 5 times, most recently from ffcd9ae to 02f9cf5 Compare May 9, 2025 17:14
@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch 2 times, most recently from 74f6a28 to 31dc4d2 Compare May 9, 2025 18:07
@dtapiacl dtapiacl marked this pull request as ready for review May 9, 2025 18:56
@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch 5 times, most recently from 3d6e6c9 to 250961d Compare May 12, 2025 20:19
Copy link
Member

@jhoblitt jhoblitt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is the bitnami chart being used instead of mariadb-operator per https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745964809373419?thread_ts=1745857564.357769&cid=C07RXFUPVAA ?

This is deploying a single pod, which will go down when a node is drained, crashes, etc., which isn't acceptable for a production deployment. https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745966019358389?thread_ts=1745857564.357769&cid=C07RXFUPVAA

What is the backup mechanism for the database?

What is the relationship between this PR and lsst-it/lsst-control#1707 ?

@csilva-cl
Copy link
Contributor

Why is the bitnami chart being used instead of mariadb-operator per https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745964809373419?thread_ts=1745857564.357769&cid=C07RXFUPVAA ?

It doesnt look there's an official mariadb-operator. Bitnami is the chart with more stars and it's being actively developed, there's also plenty of testing with this chart. I would leave it as it is.

@dtapiacl
Copy link
Contributor Author

dtapiacl commented May 13, 2025

Why is the bitnami chart being used instead of mariadb-operator per https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745964809373419?thread_ts=1745857564.357769&cid=C07RXFUPVAA ?

I have a tested working instance in Ruka of the current helm chart. I'm sure there is stuff that needs to be ironed out, which I'm working on based on the feedback provided, but I would like to continue working with what I have at the moment, try to get this deployed as soon as possible, as we are running out of time. I

This is deploying a single pod, which will go down when a node is drained, crashes, etc., which isn't acceptable for a production deployment. https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745966019358389?thread_ts=1745857564.357769&cid=C07RXFUPVAA

I will set up a secondary replica count for this deployment based on values provided by the chart so a single pod is not deployed.

What is the backup mechanism for the database?

I'm going to use S3 backup method similar to what Carlos does with CNPG, I will be working on this today.

What is the relationship between this PR and lsst-it/lsst-control#1707 ?

This PR is needed as it deploys the DB in wich PR lsst-it/lsst-control#1707 depends on for the OVPN cluster setup. The cluster setup is done through the GUI of OVPN admin portal once VPN server is installed.

@jhoblitt
Copy link
Member

Why is the bitnami chart being used instead of mariadb-operator per https://rubin-obs.slack.com/archives/C07RXFUPVAA/p1745964809373419?thread_ts=1745857564.357769&cid=C07RXFUPVAA ?

It doesnt look there's an official mariadb-operator. Bitnami is the chart with more stars and it's being actively developed, there's also plenty of testing with this chart. I would leave it as it is.

There is an official operator.

https://github.com/mariadb-operator/mariadb-operator

The main concern is that there are no replicas, failover, or backups.

Copy link
Member

@jhoblitt jhoblitt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are 3 things required for a production database.

  • online replicas
  • automated failover between those replicas and/or automation such that failover of any 2 k8s nodes doesn't take the database offline
  • automated backups (and a tested and known working db restore procedure)

@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch from 250961d to f3c8941 Compare May 13, 2025 18:23
@dtapiacl dtapiacl force-pushed the IT-6153_openvpn_db branch from f3c8941 to 53edb89 Compare May 13, 2025 18:37
@dtapiacl
Copy link
Contributor Author

Ok i have tested the replication offered by the chart and it is not what we need as it only offers replicas for the secondary pods in a master slave setup Primary still could fail. I will put this activity on hold for now this week and resume next week using the mariadb-operator suggested by @jhoblitt.

cc: @csilva-cl

@dtapiacl dtapiacl closed this May 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants