On Dec 19, 2005, at 9:32 AM, mouraud_anthony wrote:
> I've tried to use valgrind in order to debug my system.
> But the only errors i get are from mpi,
>
> then shall i ask you if there're some particular specifications to
> give in
> order to let valgrind signal the errors ?
>
> I'm still blocked on the same error :
>
> In some runs (not all) , one or more processes keeps blocked in
> probing, or in sending .... i though it was a buffer allocation
> error but it seem that it's not ...
Perhaps I misunderstood -- I thought you said you were seg faulting,
not blocking.
If you're simply blocking (i.e., MPI_SEND blocks, or you loop forever
over MPI_IPROBE and it never returns flag=true indicating that the
communication completed), then you may have a communication mismatch
-- where you're sending one thing and receiving another (e.g.,
sending on one communicator or with tag X, and you're receiving on
another communicator or with tag Y). Remember that MPI_SEND is
allowed to block if it wants to. LAM's MPI_SEND will block if the
message is "long" until a corresponding receive is posted on the
receiver.
Does that help?
--
{+} Jeff Squyres
{+} The Open MPI Project
{+} http://www.open-mpi.org/
|