-
Notifications
You must be signed in to change notification settings - Fork 6
Introduce Logging Stack: Add fluentd, add loki (🚧🚧 DEVOPS 🚧🚧) #1058
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduce Logging Stack: Add fluentd, add loki (🚧🚧 DEVOPS 🚧🚧) #1058
Conversation
…oundation#979) * Introduce longhorn chart * Further longhorn configuration * Longhorn: further settings configuration * Fix longhorn configuration bugs Extra: introduce longhorn pv vales for portainer * Add comment for deletion longhorn * Further longhorn configuration * Add README.md for Longhorn wit FAQ * Update Longhorn readme * Update readme * Futher LH configuration * Update LH's Readme * Update Longhorn Readme * Improve LH's Readme * LH: Reduce reserved default disk space to 5% Since we use a dedicated disk for LH, we can go ahead with 5% * Use values to set Longhorn storage class * Update LH's Readme * LH Readme: add requirements reference * PR Review: bring back portainer s3 pv * LH: decrease portinaer volume size
bisgaard-itis
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot for the effort.
YuryHrytsuk
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
matusdrobuliak66
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, I haven’t gone through it thoroughly, to unblock you a approve
|
|
@mrnicegyu11 not entirely sure here but from computational clusters end what is important is:
|
sanderegg
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
commented
YuryHrytsuk
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the state of unchecked items? Could you add a note next to them what is the state?
Also do I get this correctly, that once this is merged and deployed to master, all logging workflows would be broken until machines are reprovisioned? The same with AMIs I presume. So, probably, once merged, DevOps must be on top of it.
All the infra changes would need to be applied on PROD, so we need clear and easy to do (difficult to make mistakes) instructions to ensure stag / prod release are done smoothly (if it gets to this stage ✊ ) @mrnicegyu11
Thanks for all the effort!
|
@mrnicegyu11 if is safer if it only gets to master and does not block future staging / prod releases (these stages will not be affected). Thanks 🙏 |
|
|
re |
|
re |
|
Required fix: bf6719e |
What do these changes do?
Logs will now be sent from the docker daemon to fluend, with
fluentdrunning as a docker swarm service inside theloggingstack, via a TCP port that is host-exposed.Fluentdwill send the logs to graylog and the newly introduced grafana loki.Syslogs from the Linux Kernel will for now only be available in graylog.
Computational clusters and the impact of this PR on them have not been fully evaluated, please comment on this quickly @sanderegg . If it doesnt impact them at all, even better.
This unblocks grafana alerting.
🚧🚧 DevOps 🚧🚧:
repo.config.template/etc/docker/daemon.jsonmust be updated.Related issue/s
Related PR/s
Checklist