Skip to content

Commit 58f120b

Browse files
authored
feat(argo-workflows): Support failedPodRestart to controller (#3656)
1 parent a8cc0d1 commit 58f120b

File tree

4 files changed

+15
-2
lines changed

4 files changed

+15
-2
lines changed

charts/argo-workflows/Chart.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ appVersion: v4.0.2
33
name: argo-workflows
44
description: A Helm chart for Argo Workflows
55
type: application
6-
version: 1.0.2
6+
version: 1.0.3
77
icon: https://argo-workflows.readthedocs.io/en/stable/assets/logo.png
88
home: https://github.com/argoproj/argo-helm
99
sources:
@@ -17,4 +17,4 @@ annotations:
1717
url: https://argoproj.github.io/argo-helm/pgp_keys.asc
1818
artifacthub.io/changes: |
1919
- kind: added
20-
description: Add extraEnv to CRD install Job for proxy configuration
20+
description: Support failedPodRestart to controller.

charts/argo-workflows/README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -265,6 +265,8 @@ Fields to note:
265265
| controller.extraContainers | list | `[]` | Extra containers to be added to the controller deployment |
266266
| controller.extraEnv | list | `[]` | Extra environment variables to provide to the controller container |
267267
| controller.extraInitContainers | list | `[]` | Enables init containers to be added to the controller deployment |
268+
| controller.failedPodRestart.enabled | bool | `false` | Enable to restart of pods that fail before entering Running state. |
269+
| controller.failedPodRestart.maxRestarts | int | `3` | Maximum number of automatic restarts per node before giving up. |
268270
| controller.image.registry | string | `"quay.io"` | Registry to use for the controller |
269271
| controller.image.repository | string | `"argoproj/workflow-controller"` | Registry to use for the controller |
270272
| controller.image.tag | string | `""` | Image tag for the workflow controller. Defaults to `.Values.images.tag`. |

charts/argo-workflows/templates/controller/workflow-controller-config-map.yaml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -218,4 +218,7 @@ data:
218218
{{- with .Values.controller.podGCDeleteDelayDuration }}
219219
podGCDeleteDelayDuration: {{ . }}
220220
{{- end }}
221+
{{- with .Values.controller.failedPodRestart }}
222+
failedPodRestart: {{- toYaml . | nindent 6 }}
223+
{{- end }}
221224
{{- end }}

charts/argo-workflows/values.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -506,6 +506,14 @@ controller:
506506
# name: argo-mysql-config
507507
# key: password
508508

509+
failedPodRestart:
510+
# -- Enable to restart of pods that fail before entering Running state.
511+
## This is useful for recovering from transient infrastructure issues like node eviction due to DiskPressure or MemoryPressure without requiring a retryStrategy on every template.
512+
## ref: https://argo-workflows.readthedocs.io/en/latest/pod-restarts/
513+
enabled: false
514+
# -- Maximum number of automatic restarts per node before giving up.
515+
maxRestarts: 3
516+
509517
# mainContainer adds default config for main container that could be overriden in workflows template
510518
mainContainer:
511519
# -- imagePullPolicy to apply to Workflow main container. Defaults to `.Values.images.pullPolicy`.

0 commit comments

Comments
 (0)