-
Notifications
You must be signed in to change notification settings - Fork 861
WeeklyTelcon_20180403
Geoffrey Paulsen edited this page Jan 15, 2019
·
1 revision
- Dialup Info: (Do not post to public mailing list or public wiki)
- Geoff Paulsen
- Akvenkatesh
- Artem Polyakov
- David Bernholdt
- Edgar Gabriel
- Howard
- Josh Hursey
- Nathan Hjelm
- Todd Kordenbrock
- Xin Zhao
Review All Open Blockers
Review v2.x Milestones v2.1.3
- v2.1.4 - Targeting Oct 15th,
- Merged in a bunch of stuff.
- One-sided multithreaded bugs that came up.
- Doesn't feel like it's worth it to fix in v2.1.x, so instead pulled configurey changes from v2.0 to v2.1.x
- No new news on v2.1.x
Review v3.0.x Milestones v3.0.2
- v3.0.1 went out the door.
- Oops, Did not get PMIx Compatibility pieces in embedded PMIx
- v3.0.2 open for bugfixes. Probably a quick turnaround on this.
- Will pre-emptively fix PMIx compatibility pieces to pickup PMIx v1.2.5 clients.
- This will bring in PMIx compatibility with OMPI client (mpirun/orted/libmpi) from OMPI v2.1.3
- memkind disable needs to get into v3.0.2
- Josh Ladd should review 4398
Review v3.1.x Milestones v3.1.0
- Brian not here today.
- PR4977 caused corruption. Nathan PRed last week.
- fixed in v3.0.1 and v3.1.0
- Cisco is seeing a number of Spawn issues on v3.1.x in MTT.
- Most of these were oversubscription issues, needed ini changes
- Still seeing another 100 or so more failures, he expects the same. Possibly new regressions.
- Jeff still needs to look at.
Review Master Master Pull Requests
- All 32bit builds failed in CI for all PRs over the weekend. Brian fixed it yesterday.
- All 32bit builds failed in CI for all PRs over the weekend. Brian fixed it yesterday.
- Ending Open MPI Mirrors program
- Used to have website would be "mirror friendly" so others could host it around the world.
- Now, Internet connectivity and bandwidth is much better.
- Moving behind Amazon's Cloud front performs this for our mirror clouds.
- sending out the "So Long and thanks for all the Fish" message to Mirrors.
- nightly and release will be moved to download.open-mpi.org (From Amazon's S3) Fast!
- No longer version controlled.
- Review Open MPI / PMIx embedding / SLURM - client & server version compatibilities.
- Reviewed a google doc spreadsheet, Jeff shared. Sent out in email on discussion list.
- Artem commented on some SLURM compatibilities. SLURM 16.05 - support PMIx v1.x SLURM 17.11 - SLURM can be configured with either PMIx v1.x or 2.x SLURM does not imbed PMIx, must configure against.
- Implications for OpenMPI
- When you have PMIx client v1.2.3 with server v1.2.3 works. (all testing with itself works)
- This graph is coming from a PMIx client / server standpoint, and describes
- Wasn't there some blanket cross-version support statements?
- v1.2.5, v2.0.3, v2.1.1, v3.0.0
- How is PMIx dstore represented in this graph? ORTE MCA parameter needed for client/server missmatch
- There is a 3rd chart to describe what testing should be done.
- This chart does not describe configuring with external PMIx, and compatibility.
- Containers and externals are different, to be discussed later.
- Need to figure out how to discuss this with Users.
- Perhaps discussing compatibilities between user's tools (Orte / slurm / mpirun / Debuggers / etc)
- one of the things good about PMI v1 or v2, is that their interface stayed the same for years.
- Well, also PMIx supporting multiple "levels" the message is no longer "use PMI v1/v2 everywhere... there are various levels of support / compatibility everywhere.
- IBM CI is back up
- Cisco and IBM MTT didn't trigger last night.
Review Master MTT testing
- Mellanox, Sandia, Intel
- LANL, Houston, IBM, Fujitsu
- Amazon,
- Cisco, ORNL, UTK, NVIDIA