Skip to content

Conversation

@JakeSCahill
Copy link
Contributor

@JakeSCahill JakeSCahill commented Jan 27, 2025

@JakeSCahill JakeSCahill requested a review from a team as a code owner January 27, 2025 11:27
@JakeSCahill JakeSCahill requested a review from pgellert January 27, 2025 11:27
@netlify
Copy link

netlify bot commented Jan 27, 2025

Deploy Preview for redpanda-docs-preview ready!

Name Link
🔨 Latest commit 0e371ef
🔍 Latest deploy log https://app.netlify.com/sites/redpanda-docs-preview/deploys/6799c467f172b50008cd9d84
😎 Deploy Preview https://deploy-preview-966--redpanda-docs-preview.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Copy link
Contributor

@pgellert pgellert left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, the content looks good, I just left some minor suggestions.

To prevent infinite crash loops, the Redpanda Helm chart sets the `crash_loop_limit` node property to 5. The crash loop limit is the number of consecutive crashes that can happen within one hour of each other. After Redpanda reaches this limit, it will not start until its internal consecutive crash counter is reset to zero. In Kubernetes, the Pod running Redpanda remains in a `CrashLoopBackoff` state until its internal consecutive crash counter is reset to zero.
To prevent infinite crash loops, the Redpanda Helm chart sets the xref:reference:properties/broker-properties.adoc#crash_loop_limit[`crash_loop_limit`] broker configuration property to `5`. The crash loop limit is the number of consecutive crashes that can happen within one hour of each other. By default, the broker terminates immediately after hitting the `crash_loop_limit`. The Pod running Redpanda remains in a `CrashLoopBackoff` state until its internal consecutive crash counter is reset to zero.

To facilitate debugging in environments where a broker is stuck in a crash loop, you can also set the xref:reference:properties/broker-properties.adoc#crash_loop_sleep_sec[`crash_loop_sleep_sec` configuration] configuration. This setting determines how long the broker sleeps before terminating the process after reaching the crash loop limit. By providing a window during which the Pod remains in a paused state, you can SSH into the Pod and troubleshoot the issue.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@chrisseto fyi when you implement K8S-491 to set crash_loop_sleep_sec to be on by default in the Redpanda Helm chart, then this paragraph will probably need updating.

Co-authored-by: Gellért Peresztegi-Nagy <[email protected]>
@JakeSCahill JakeSCahill requested a review from pgellert January 27, 2025 17:03
@JakeSCahill
Copy link
Contributor Author

Thanks @pgellert

Copy link
Contributor

@pgellert pgellert left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Copy link
Contributor

@Feediver1 Feediver1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@JakeSCahill JakeSCahill merged commit bfb3e0a into main Jan 29, 2025
6 of 7 checks passed
@JakeSCahill JakeSCahill deleted the DOC-975 branch January 29, 2025 06:03
@kbatuigas kbatuigas mentioned this pull request Apr 7, 2025
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants