LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Brian Barrett (brbarret_at_[hidden])
Date: 2007-07-21 16:58:12


Hi all -

Just wanted to let you know that I posted LAM/MPI 7.1.4b4 this
afternoon. The only change since b3 is the addition of a work around
to allow LAM to work under the BProc Job Scheduler (BJS). LAM still
does not properly read the allocation information from BJS, due to
some issues with getting number of nodes. But it will now prevent
the lamds from being killed when lamboot exits by forking a "starter
lamboot" behind the scenes that lives until the lam daemons have all
exited.

The complete list of changes since LAM/MPI 7.1.3 is:

- Work around some batch schedulers (BJS, LANL's BProc + MOAB)
   from killing the lamds when lamboot exits by keeping a child
   of lamboot around for the life of the lamds.
- Properly escape SSI parameters and pass SSI parameters to
   lamboot when using mpiexec. Also use /tmp or $TMPDIR for
   the app schema. Thanks to Sam Steingold for bringing this
   to our attention.
- Allow user to disable building the TM or SLURM boot ssi
   module, even if the libraries are available on the system.
   Thanks to Jens Klostermann for bringing this to our
   attention.
- Fix compile issue on NetBSD 3.0 and later. Thanks to Aleksey
   Cheusov for the patch.
- Properly handle slurm clusters where all nodes do not have the
   same prefix in a hostname. Thanks to Moe Jette for the patch.

I'm planning on releasing this as the stable 7.1.4 release on Monday,
unless someone finds a regression from 7.1.3 before then.

Thanks,

Brian

-- 
   Brian Barrett
   LAM/MPI Developer
   Make today a LAM/MPI day!