LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2005-08-30 05:54:49


This is not enough information to accurately figure out what is going
on -- there must be something else going on in your application such
that changing the for to a while changes some logic somewhere such that
someone is [probably] issuing an MPI_Recv after others have already
exited.

On Aug 29, 2005, at 11:26 AM, Oren Shapira wrote:

> Hello!
>
> I have an iterative algorithm on master that sends the data to the
> slaves in the beginning of every iteration, the slaves return the
> results before the master starts the new iteration.
> Currently the code goes as follows:
> for (iter=1;iter<NumIter; iter++)
> {
> if (myid==0)
> //masters part
> if (myid!=0)
> //slaves part
> }
> This code works good, but if I change "for" loop to the "while" loop,
> the algorithm fails. The error msg is following:
>
> MPI_Recv: process in local group is dead (rank 1, MPI_COMM_WORLD)
> Rank (1, MPI_COMM_WORLD): Call stack within LAM:
> Rank (1, MPI_COMM_WORLD): - MPI_Recv()
> Rank (1, MPI_COMM_WORLD): - main()
> ..........................................
> .........................................
> .............
> -----------------------------------------------------------------------
> ------
> One of the processes started by mpirun has exited with a nonzero exit
> code. This typically indicates that the process finished in error.
> If your process did not finish in error, be sure to include a "return
> 0" or "exit(0)" in your C code before exiting the application.
>
> PID 4224 failed on node n2 (10.132.101.3) with exit status 1.
>
> Do you have any idea what it can be?
>
> -----------------------------------------------------------------
> Oren Shapira,
> Research Assistant
> Multimedia and Intelligent Systems Research Team
> Industrial and Management Engineering Department
> Ben Gurion University of the Negev,
> Beer-Sheva, 84105, Israel.
> Office: 972-8-6472240
> Fax: 972-8-6472958
> ------------------------------------------------------------------
>
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>

-- 
{+} Jeff Squyres
{+} jsquyres_at_[hidden]
{+} http://www.lam-mpi.org/