Skip to content

Commit 518ff28

Browse files
committed
tuned: use tree instead of bruck at scale
The switch from tree to bruck between 512 and 1023 processes leads to unexpected latency changes in benchmarks of other collectives. We should be consistent here. There is no good reason for why bruck would perform better in that range but not beyond. Signed-off-by: Joseph Schuchart <[email protected]> (cherry picked from commit 9bd7757)
1 parent 052dca1 commit 518ff28

File tree

1 file changed

+1
-7
lines changed

1 file changed

+1
-7
lines changed

ompi/mca/coll/tuned/coll_tuned_decision_fixed.c

Lines changed: 1 addition & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -490,14 +490,8 @@ int ompi_coll_tuned_barrier_intra_dec_fixed(struct ompi_communicator_t *comm,
490490
alg = 3;
491491
} else if (communicator_size < 256) {
492492
alg = 4;
493-
} else if (communicator_size < 512) {
494-
alg = 6;
495-
} else if (communicator_size < 1024) {
496-
alg = 4;
497-
} else if (communicator_size < 4096) {
498-
alg = 6;
499493
} else {
500-
alg = 4;
494+
alg = 6;
501495
}
502496

503497
return ompi_coll_tuned_barrier_intra_do_this (comm, module,

0 commit comments

Comments
 (0)