Skip to content

v1.10.1: mpirun does not warn about oversubscription #1009

@ammore1

Description

@ammore1

When I mpirun with more processes than cores with Open MPI v1.8.4, I get the following warning:

$ mpirun -V
 mpirun (Open MPI) 1.8.4cisco1.0.1.367.sles12sp0
$ mpirun --host <HOSTNAME1,HOSTNAME2> -np 36 hello_c 
---------------------------------------------------- ----------------------
A request was made to bind to that would result in binding more
processes than cpus on a resource:

   Bind to:     NONE
   Node:        usnic3
   #processes:  2
   #cpus:       1

You can override this protection by adding the "overload-allowed"
option to your binding directive.
--------------------------------------------------------------------------

But when I mpirun with more processes than cores with Open MPI v1.10, I do not get a warning:

$ mpirun -V
mpirun (Open MPI) 1.10.1cisco1.0.0.387.sles12sp0
$ mpirun --host <HOSTNAME1,HOSTNAME2> -np 36 hello_c 
Hello, world, I am 18 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 19 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 20 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 21 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 22 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 23 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 24 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 27 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 25 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 28 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 26 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 31 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 32 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 34 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 30 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 33 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 35 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 29 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 5 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 6 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 7 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 8 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 9 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 10 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 11 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 12 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 14 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 13 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 15 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 16 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 17 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 0 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 1 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 3 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 4 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)
Hello, world, I am 2 of 36, (Open MPI v1.10.1cisco1.0.0.387.sles12sp0, package: Cisco Open MPI, ident: 1.10.1cisco1.0.0.387, repo rev: git_ad777cc, Sat Sep 19 12:05:03 2015 PDT, 148)

Is this intended behaviour, or is it a regression?

This was found during Cisco QA testing.

@jsquyres @rhc54

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions