LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Kumar, Ravi Ranjan (rrkuma0_at_[hidden])
Date: 2005-03-20 00:12:25


Hello,

I wrote a code in C++ using MPI. It works fine and gives correct result for
smaller 3D array size case for e.g. T[51][51][51]. However, my code hangs when
I try to run the same for larger size case i.e T[101][101][101] with an error
message as below:

MPI_Recv: message truncated (rank 0, MPI_COMM_WORLD)
Rank (0, MPI_COMM_WORLD): Call stack within LAM:
Rank (0, MPI_COMM_WORLD): - MPI_Recv()
Rank (0, MPI_COMM_WORLD): - main()
-----------------------------------------------------------------------------

One of the processes started by mpirun has exited with a nonzero exit
code. This typically indicates that the process finished in error.
If your process did not finish in error, be sure to include a "return
0" or "exit(0)" in your C code before exiting the application.

PID 5708 failed on node n0 with exit status 1.
-----------------------------------------------------------------------------

I read sometime ago that this may be due to mismatch in number of data sent and
number of data received in MPI_Send/MPI_Recv process. I have checked this thing
many times and found no mismatch in number of data exchanged, still I am
getting this error. What can be the reason for this? Could anyone please
explain?

Thanks a lot!
Ravi R. Kumar