@@ -7,34 +7,44 @@ stateDiagram-v2
77 [*] --> uninitialized
88 uninitialized --> ready: initialize_jobs
99 uninitialized --> blocked: has dependencies
10+ uninitialized --> disabled: job disabled
1011
1112 blocked --> ready: dependencies met
1213 ready --> pending: runner claims
1314 pending --> running: execution starts
1415
1516 running --> completed: exit 0
16- running --> failed: exit != 0
17+ running --> failed: exit != 0 (handler match + max retries)
18+ running --> pending_failed: exit != 0 (no handler match)
19+ running --> ready: exit != 0 (failure handler retry)
1720 running --> canceled: user cancels
1821 running --> terminated: system terminates
1922
23+ pending_failed --> failed: AI classifies as permanent
24+ pending_failed --> ready: AI classifies as transient
25+ pending_failed --> uninitialized: reset-status
26+
2027 completed --> [*]
2128 failed --> [*]
2229 canceled --> [*]
2330 terminated --> [*]
31+ disabled --> [*]
2432
2533 classDef waiting fill:#6c757d,color:#fff
2634 classDef ready fill:#17a2b8,color:#fff
2735 classDef active fill:#ffc107,color:#000
2836 classDef success fill:#28a745,color:#fff
2937 classDef error fill:#dc3545,color:#fff
3038 classDef stopped fill:#6f42c1,color:#fff
39+ classDef classification fill:#fd7e14,color:#fff
3140
3241 class uninitialized,blocked waiting
3342 class ready ready
3443 class pending,running active
3544 class completed success
3645 class failed error
37- class canceled,terminated stopped
46+ class canceled,terminated,disabled stopped
47+ class pending_failed classification
3848```
3949
4050## State Descriptions
@@ -48,3 +58,7 @@ stateDiagram-v2
4858- ** failed** (6) - Finished with error (exit code != 0)
4959- ** canceled** (7) - Explicitly canceled by user or torc. Never executed.
5060- ** terminated** (8) - Explicitly terminated by system, such as at wall-time timeout
61+ - ** disabled** (9) - Job is disabled and will not run
62+ - ** pending_failed** (10) - Job failed without a matching failure handler. Awaiting AI-assisted
63+ classification to determine if the error is transient (retry) or permanent (fail). See
64+ [ AI-Assisted Recovery] ( ../specialized/fault-tolerance/ai-assisted-recovery.md ) .
0 commit comments