refactor(kubernetes): simplify cleanLogsCollector to skip if daemonset is not found by barnabasbusa · Pull Request #2915 · kurtosis-tech/kurtosis

barnabasbusa · 2026-02-25T09:37:05Z

PR Summary: Fix kurtosis clean -a hanging on k8s clusters with tainted/unhealthy nodes

Problem

kurtosis clean -a hangs indefinitely on Kubernetes clusters where some nodes have taints (e.g. DiskPressure, smc). The fluentbit logs collector
Clean method creates remove-dir-pod cleanup pods targeted at each node, but nodes with taints won't schedule these pods. The waitForPodAvailability
function then blocks for 15 minutes per unschedulable pod, and this happens sequentially per node.

Changes (5 files, +46/-19)

kubernetes_manager.go — waitForPodAvailability now:
- Respects context cancellation (was ignoring ctx.Done())
- Detects PodReasonUnschedulable and returns immediately instead of waiting 15 minutes
fluentbit_logs_collector_daemonset.go — Clean method now:
- Returns nil instead of error when zero pods found
- Makes WaitForPodTermination best-effort (warn, don't fail)
- Makes RemoveDirPathFromNode best-effort with 2-minute per-node timeout (skips tainted nodes)
- Makes waitForAtLeastOneActivePodManagedByDaemonSet best-effort
kubernetes_kurtosis_backend_enclave_functions.go — CleanLogsCollector and CleanLogsAggregator errors downgraded from fatal to best-effort
warnings
clean_logs_collector.go — Calls getLogsCollectorKubernetesResourcesForCluster directly, adds nil check for missing DaemonSet
shared_helpers.go — Two fixes:
- namespace.Namespace → namespace.Name (was always empty for k8s Namespace objects, causing cross-namespace service account lookups and "found 2"
errors)
- Zero pods case returns Stopped status with warning instead of error

Result

kurtosis clean -a completes in ~40 seconds even with tainted/unhealthy nodes, instead of hanging indefinitely.

…t is not found refactor(kubernetes): treat logs collector as stopped if no pods are found for daemonset

barnabasbusa added 2 commits February 25, 2026 10:36

refactor(kubernetes): simplify cleanLogsCollector to skip if daemonse…

88bb8cc

…t is not found refactor(kubernetes): treat logs collector as stopped if no pods are found for daemonset

fix

f645490

barnabasbusa requested a review from tedim52 February 25, 2026 13:26

barnabasbusa enabled auto-merge February 25, 2026 13:45

tedim52 approved these changes Feb 27, 2026

View reviewed changes

barnabasbusa added this pull request to the merge queue Feb 27, 2026

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Feb 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(kubernetes): simplify cleanLogsCollector to skip if daemonset is not found#2915

refactor(kubernetes): simplify cleanLogsCollector to skip if daemonset is not found#2915
barnabasbusa wants to merge 2 commits intomainfrom
bbusa/fix-log-collector

barnabasbusa commented Feb 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

barnabasbusa commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

barnabasbusa commented Feb 25, 2026 •

edited

Loading