-
Notifications
You must be signed in to change notification settings - Fork 239
Open
Description
From @mikibonacci and @giovannipizzi in #7069 (comment)
With @giovannipizzi, we were discussing of adding the pause feature also for handled errors that reached the maximum of the allowed restarts (
max_iterations). This because it can happen that we just hit several times the same exit code for the CalcJobs, but actually the handling is not effective (for whatever reason) and so a pause can help and let the user decide how to proceed.
For example, I was running a muon calculation, and the accuracy of the scf was never decreasing (due to a sort of issue in QE). The walltime was not enough and the WorkChain restarted 5 times without solving the issue - the walltime was not the real issue.
Something like
on_handled_but_exceeded_failure?
Metadata
Metadata
Assignees
Labels
No labels
Type
Projects
Status
No status