LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: bcruchet_at_[hidden]
Date: 2005-02-22 14:34:04


In some cases this happen when you try write (MPI_Send) when there aren't
anybody to receive (MPI_Recv).

in other words you probably is send data to a node that not is waiting for
recieve

-------------------------------------------------
 Boris Cruchet C.
 http://boris.guliv.cl
 http://guliv.cl
-------------------------------------------------

> Hello,
>
> Could anyone explain why I am getting errors like:
>
> MPI_Recv: process in local group is dead (rank 1, MPI_COMM_WORLD)
> Rank (1, MPI_COMM_WORLD): Call stack within LAM:
> Rank (1, MPI_COMM_WORLD): - MPI_Recv()
> Rank (1, MPI_COMM_WORLD): - main()
> ---------------------------------------------------------------------
>
> One of the processes started by mpirun has exited with a nonzero exit
> code. This typically indicates that the process finished in error.
> If your process did not finish in error, be sure to include a "return
> 0" or "exit(0)" in your C code before exiting the application.
>
> PID 22326 failed on node n0 with exit status 1.
> -------------------------------------------------------------------------
>
> I am attaching part of my subroutine which calls MPI_Send/MPI_Recv.
>
> here is the part of my code:
>
> void exchange_interface_data(int rank, int local_Nz, int comm_tag)
> {
>
> MPI_Status status;
>
> if(rank==0)
> MPI_Send(&T[local_Nz-1][0]
> [0],Nx*Ny,MPI_DOUBLE,rank+1,comm_tag,MPI_COMM_WORLD);
>
> if(rank==1)
> MPI_Recv(&T[rows_per_process*rank-1][0][0],Nx*Ny,MPI_DOUBLE,rank-
> 1,comm_tag,MPI_COMM_WORLD,&status);
>
> }
>
> Pls help me out.
>
> Thanks!
> Ravi
>
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>