Skip to content

Commit 242d80f

Browse files
authored
test: addressing flaky spawn "process 0 terminated with signal SIGABRT" (#20933)
1 parent 489bbf7 commit 242d80f

File tree

3 files changed

+4
-2
lines changed

3 files changed

+4
-2
lines changed

.github/workflows/ci-tests-fabric.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -167,7 +167,7 @@ jobs:
167167
run: |
168168
echo $GITHUB_RUN_ID
169169
python -m coverage run --source ${{ env.COVERAGE_SCOPE }} \
170-
-m pytest -v --timeout=30 --durations=50 --random-order-seed=$GITHUB_RUN_ID \
170+
-m pytest -v --timeout=60 --durations=50 --random-order-seed=$GITHUB_RUN_ID \
171171
--junitxml=junit.xml -o junit_family=legacy # NOTE: for Codecov's test results
172172
173173
- name: Statistics

.github/workflows/ci-tests-pytorch.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -196,7 +196,7 @@ jobs:
196196
run: |
197197
echo $GITHUB_RUN_ID
198198
python -m coverage run --source ${{ env.COVERAGE_SCOPE }} \
199-
-m pytest . -v --timeout=60 --durations=50 --random-order-seed=$GITHUB_RUN_ID \
199+
-m pytest . -v --timeout=90 --durations=50 --random-order-seed=$GITHUB_RUN_ID \
200200
--junitxml=junit.xml -o junit_family=legacy # NOTE: for Codecov's test results
201201
202202
- name: Statistics

tests/tests_pytorch/callbacks/test_stochastic_weight_avg.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -354,6 +354,8 @@ def test_swa_resume_training_from_checkpoint_custom_scheduler(tmp_path, crash_on
354354

355355

356356
@RunIf(skip_windows=True)
357+
# flaky with "process 0 terminated with signal SIGABRT" (GLOO)
358+
@pytest.mark.flaky(reruns=3, only_rerun="torch.multiprocessing.spawn.ProcessExitedException")
357359
def test_swa_resume_training_from_checkpoint_ddp(tmp_path):
358360
model = SwaTestModel(crash_on_epoch=3)
359361
resume_model = SwaTestModel()

0 commit comments

Comments
 (0)