LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Alexander L. Belikoff (ABEL_at_[hidden])
Date: 2006-10-11 11:40:22


Jeff Squyres wrote:
>
> These error messages mean that processes 2-7 tried to do a receive from
> someone who they later found out were dead, so they aborted.
>
What would be a "standard" (that is, a portable) way for one of the peer
processes to get notified about such a death? For example, if one of
processes dies, I'd like the process of rank 0 to know it in order to
change the strategy.

Cheers,
-- Sasha