Skip to content

Commit 89d695f

Browse files
committed
suites: check for host thrasher
The last PR modified the suites to only check for host thrasher. This update fixes that issue by implementing different settings with dedicated YAML files for host thrashing Fixes: https://tracker.ceph.com/issues/66657 Signed-off-by: Nitzan Mordechai <[email protected]>
1 parent 66b42c3 commit 89d695f

File tree

7 files changed

+115
-3
lines changed

7 files changed

+115
-3
lines changed

qa/suites/rados/thrash/thrashers/careful.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,6 @@ tasks:
2121
- thrashosds:
2222
timeout: 1200
2323
min_in: 2
24-
thrash_hosts: true
2524
chance_pgnum_grow: 1
2625
chance_pgnum_shrink: 1
2726
chance_pgpnum_fix: 1
Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,28 @@
1+
overrides:
2+
ceph:
3+
log-ignorelist:
4+
- but it is still running
5+
- objects unfound and apparently lost
6+
conf:
7+
osd:
8+
osd debug reject backfill probability: .3
9+
osd scrub min interval: 60
10+
osd scrub max interval: 120
11+
osd max backfills: 3
12+
osd snap trim sleep: 2
13+
mon:
14+
mon min osdmap epochs: 50
15+
paxos service trim min: 10
16+
# prune full osdmaps regularly
17+
mon osdmap full prune min: 15
18+
mon osdmap full prune interval: 2
19+
mon osdmap full prune txsize: 2
20+
tasks:
21+
- thrashosds:
22+
timeout: 1200
23+
min_in: 2
24+
thrash_hosts: true
25+
chance_pgnum_grow: 1
26+
chance_pgnum_shrink: 1
27+
chance_pgpnum_fix: 1
28+
aggressive_pg_num_changes: false

qa/suites/rados/thrash/thrashers/default.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,6 @@ tasks:
2222
- thrashosds:
2323
timeout: 1200
2424
min_in: 2
25-
thrash_hosts: true
2625
chance_pgnum_grow: 1
2726
chance_pgnum_shrink: 1
2827
chance_pgpnum_fix: 1
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
overrides:
2+
ceph:
3+
log-ignorelist:
4+
- but it is still running
5+
- objects unfound and apparently lost
6+
conf:
7+
osd:
8+
osd debug reject backfill probability: .3
9+
osd scrub min interval: 60
10+
osd scrub max interval: 120
11+
osd max backfills: 3
12+
osd snap trim sleep: 2
13+
osd delete sleep: 1
14+
mon:
15+
mon min osdmap epochs: 50
16+
paxos service trim min: 10
17+
# prune full osdmaps regularly
18+
mon osdmap full prune min: 15
19+
mon osdmap full prune interval: 2
20+
mon osdmap full prune txsize: 2
21+
tasks:
22+
- thrashosds:
23+
timeout: 1200
24+
min_in: 2
25+
thrash_hosts: true
26+
chance_pgnum_grow: 1
27+
chance_pgnum_shrink: 1
28+
chance_pgpnum_fix: 1
29+
chance_bluestore_reshard: 1
30+
bluestore_new_sharding: random
Lines changed: 31 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,31 @@
1+
overrides:
2+
ceph:
3+
log-ignorelist:
4+
- but it is still running
5+
- objects unfound and apparently lost
6+
- osd_map_cache_size
7+
conf:
8+
mon:
9+
mon min osdmap epochs: 50
10+
paxos service trim min: 10
11+
# prune full osdmaps regularly
12+
mon osdmap full prune min: 15
13+
mon osdmap full prune interval: 2
14+
mon osdmap full prune txsize: 2
15+
osd:
16+
osd map cache size: 1
17+
osd scrub min interval: 60
18+
osd scrub max interval: 120
19+
osd scrub during recovery: false
20+
osd max backfills: 6
21+
osd beacon report interval: 30
22+
tasks:
23+
- thrashosds:
24+
timeout: 1800
25+
min_in: 2
26+
thrash_hosts: true
27+
chance_pgnum_grow: 0.25
28+
chance_pgnum_shrink: 0.25
29+
chance_pgpnum_fix: 0.25
30+
chance_test_map_discontinuity: 2
31+
map_discontinuity_sleep_time: 200

qa/suites/rados/thrash/thrashers/pggrow.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,6 +21,5 @@ tasks:
2121
- thrashosds:
2222
timeout: 1200
2323
min_in: 2
24-
thrash_hosts: true
2524
chance_pgnum_grow: 2
2625
chance_pgpnum_fix: 1
Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
overrides:
2+
ceph:
3+
log-ignorelist:
4+
- but it is still running
5+
- objects unfound and apparently lost
6+
conf:
7+
osd:
8+
osd scrub min interval: 60
9+
osd scrub max interval: 120
10+
filestore odsync write: true
11+
osd max backfills: 2
12+
osd snap trim sleep: .5
13+
mon:
14+
mon min osdmap epochs: 50
15+
paxos service trim min: 10
16+
# prune full osdmaps regularly
17+
mon osdmap full prune min: 15
18+
mon osdmap full prune interval: 2
19+
mon osdmap full prune txsize: 2
20+
tasks:
21+
- thrashosds:
22+
timeout: 1200
23+
min_in: 2
24+
thrash_hosts: true
25+
chance_pgnum_grow: 2
26+
chance_pgpnum_fix: 1

0 commit comments

Comments
 (0)