- 
                Notifications
    You must be signed in to change notification settings 
- Fork 928
WeeklyTelcon_20220301
        Geoffrey Paulsen edited this page Mar 5, 2022 
        ·
        1 revision
      
    - Dialup Info: (Do not post to public mailing list or public wiki)
Not recorded.
- Geoffrey Paulsen (IBM)
- Austen Lauria (IBM)
- Jeff Squyres (Cisco)
- Brendan Cunningham (Cornelis Networks)
- Brian Barrett (AWS)
- Christoph Niethammer (HLRS)
- David Bernhold (ORNL)
- Hessam Mirsadeghi (UCX/nVidia)
- Howard Pritchard (LANL)
- Josh Hursey (IBM)
- Thomas Naughton (ORNL)
- Todd Kordenbrock (Sandia)
- Tomislav Janjusic (nVidia)
- William Zhang (AWS)
- Akshay Venkatesh (NVIDIA)
- Artem Polyakov (nVidia)
- Aurelien Bouteiller (UTK)
- Brandon Yates (Intel)
- Charles Shereda (LLNL)
- Edgar Gabriel (UoH)
- Erik Zeiske
- Geoffroy Vallee (ARM)
- George Bosilca (UTK)
- Harumi Kuno (HPE)
- Joseph Schuchart
- Joshua Ladd (nVidia)
- Marisa Roman (Cornelius)
- Mark Allen (IBM)
- Matias Cabral (Intel)
- Matthew Dosanjh (Sandia)
- Michael Heinz (Cornelis Networks)
- Nathan Hjelm (Google)
- Noah Evans (Sandia)
- Raghu Raja (AWS)
- Ralph Castain (Intel)
- Sam Gutierrez (LLNL)
- Scott Breyer (Sandia?)
- Shintaro iwasaki
- Xin Zhao (nVidia)
- 
Ralph (retired) is stepping back from Open MPI, and will be 100% PMIx and PRRTE. - He's no longer on our slack, or OMPI github repo.
- If you @rhc54in a comment / pull request he won't see, please use PMIx or PRRTE.
 
- 
New Big Payload test in public test suite. - Tries to do collectives up to close to MAX_INT.
- We fixes as much as we found and pushed fixes upstream.
- Josh also used Jeff Hammands's Big Count test suite.
- If there's a way to do exact same tests, but call through Fortran bindings instead, that'd be cool.
- Wouldn't expect this to turn up anything, since bugs haven't been MPI interface. Instead it's been underlying implementations that have been showing bugs.
 
- What do we want MPI_Count to be?
- MPI_Count already exists in Open MPI.  It's a size_t.- We have some MPIX_routines
 
- We have some 
- Size of MPI_Count is implementation dependent.
- Could make the interface size_t easily.
- But then there's the question of the max number.
 
 
- MPI_Count already exists in Open MPI.  It's a 
- Some of the memory pressure does NOT scale.
- Need a design discussion about what limits we have.
- Might be a good topic for a face2face meeting.
 
- We COULD throw an exception if the datatype * count is > some internal threshold.
- Might not be standards compliant, but better than crashing.
- IBM
 
 
- Some pushback about ABI breaking in v5.0
- Fortran mostly care about mpi.h.- Removed some of the deprecated functions because no one wanted to keep them.
- Because we need a stable ABI for ISV codes, otherwise no one will use v5.0 for years.
 
 
- Fortran mostly care about 
- One of the issues is libmpi interface is quite large.
- 3 part proposal
- Put deleted functions back in.
- Put Only MPI C bindings in libmpi, and everything else in another library.
- Split out Fortran
- useMPI and useF08 are "unrecoverable", and need to be fixed.
 
 
- ROMIO issue, because they call the MPI_API functions.
- Some possible ways to fix this.
 
- Shared libraries.
- Could keep cached versions of pre-built OMPIs, and run some tests with it.
 
- 
A question came up for implementations. - Threads that are syncronized.  Multiple threads are sending on same comm + tag.
- Should ordering be across all threads (global ordering)
- Concerns that a single atomic that is contended by one lock.
 
- Some are arguing the text should be ordered per thread.
 
- Should ordering be across all threads (global ordering)
- Don't actually need a global lock today (maybe, in practice probably do).
- From a hardware implementer's point of view per thread would be very expensive.
- Text is not very clear.
- For single threaded, ordering is very well defined.
- Could you make use of this or not?
 
- Threads that are syncronized.  Multiple threads are sending on same comm + tag.
- 
Two HWLOC issues - PRRTE/PMIx hwloc issue: https://github.com/openpmix/prrte/pull/1185 and https://github.com/openpmix/openpmix/pull/2445
- hwloc when built with CUDA support, is hard linking against it.
- This doesn't work in the common case where CUDA isn't installed on login nodes.
 
- hwloc v2.5 - v2.7.0 is putting variables in read-only memory into environ, but prrte is trying to modify these and segvs.
- PMIx and PRRTE has block-listed large hwloc versions 2.5-2.7.0
- putstr(env) is segv-ing.
 
- Discussions about minimizing mpirun/mpicc to only link against subset of opal.
- Makes things slightly better, but not really. Still have cuda on some nodes and not on others.
- Projected solution is to use hwloc plugins (dlopen cuda libs)
- A while back, hwloc changed default to NOT load components as plugins.
- He this this for Open MPI (some cyclic dependencies).
- This is no longer an issue for us.
 
- Now hwloc has reasonable defaults for some things build as plugins (dlopened at runtime).
- Usually customers install in local filesystems.
- This gets us around the dependencies.
- So whenever this is actually fixed, Jeff will write docs, and we can touch on points.
- From JOSH'es HWLOC PR, if there are any other suggestions or modifications, please put this on the hwloc PR.
 
- A while back, hwloc changed default to NOT load components as plugins.
 
- 
Resuming MTT development - send email - Doodle.
- Like to have a monthly call.
- Christopph Niethammer is interested.
- Might need a new cleanup mechanism when rolling out lots of versions.
 
- Find out who's using python client, and what problems.
- IU database plugin (what ends up getting data into MTT viewer) has a number of issues.
 
- Schedule: No schedule for v4.0.8 yet
- Winding down v4.0.x, and after v5.0.x will stop
- Really only want small changes reported by users.
- Otherwise, point users to v4.1.x release.
- Schedule: Shooting for v4.1.3 end of March/Q1.
- RC next weeks or so.
 
- No other update.
- Sessions, just one more thing on to-do list for TAG_UB for communicator created from Session create.
- Howard has a PR ready for this.
 
- Each of these PRs have link to original Sessions PR to help find.
- Will use PRRTE v2.1 for OMPI v5.0
- Includes ompi_schizo for prrterun for command line parsing.
- Changes for this will need to go to PRRTE and then updating in our submodule ptr.
 
- Docs, tremendous amount of work has happened.
- All automated transformations have happened for man-pages.
- A bit of work left to do for docs/configury.
- So once configury is ready, like to merge to master.
- git clone, and want to build html and man-pages, will need sphinx in env before configure.
- git clone without sphinx, just no man-pages or html docs.  Won't be able to make dist.
- requirement for pandoc is going away.
- tarball will include html and man-pages already, so won't need.
 
 
- libompitrace - removing for v5.0, as no one seems to be using / maintaining.
- Docs question
- When we publish docs for v5.0
- There will probably be one that will be at master/HEAD.
- We can also make docs available from a TAG (for release).
- Do we want a v5.0.x?
- If we're going to have latest/masteror whatever, we should NOT make many links to that.
 
- Do we want a 
 
- There will probably be one that will be at 
 
- When we publish docs for v5.0
- NO UPDATE THIS WEEK PR 9996 - bug with current cuda common code.
- Some user fixed a problem in smcuda.
- Ask tommi to
 
- Writing the API, and will try to port over code.
- ported this code to UTIL, to try to fix the bug, but been an ask to do a bit more.
- An accelerator framework,
- Need to figure out how we move forward here.  Moving it into util is not the right place.
- Don't need more things with knarly dependencies in util.
- this makes the mpicc problem worse.
 
 
- Don't need more things with knarly dependencies in util.
- William will take a stab at it, but if it's not a lot of work.
- four to six functions that datatype engine calls.
- Is accellerator?
- data movement functions.
- need to figure out memory hooks stuff.
 
- libfabric has this abstraction, so we could
- No new code, just moving things around.
 
- four to six functions that datatype engine calls.
 
- Some user fixed a problem in smcuda.
- No new Gnus
- A fix pending to workaround the IBM XL MTT build failure (compiler abort)
- Issue 9919 - Thinks this common component should still be built.
- Commons get built when it's likely their is a dependency.
- Commons self-select if they should be built or not.