On Tue, 2004-06-22 at 12:10, Thomas Lavergne wrote:
> Dear all,
> I am running into an odd behaviour of my code which, from time to time,
> hangs on communication. It does so only when several processes are
> launched by
> mpirun on each node (we have bi-procs, here so I try to use them with
> cpu=2 in the hosts_list). In order to reproduce/locate/understand this
> behaviour, I would like to access info concerning the LAM implementation
> of MPI_Send blocking communication scheme. Particularly, I would like to
> know how does LAM react as many MPI_Send's (with possibly large
> messages) are call without any MPI_Recv to gather them. For example, are
> there several reactions, depending on the amount and size of previously
> buffered messages?
> I have seen the FAQ and documentation pages but could only find that:
> "The LAM Team [...] will more than likely only be able to direct you to
> relevant parts of the LAM source code." So... where should I start, please?
Thomas,
Although this seems to be a question for lam-devel, I'll make the reply
here.
You can find the implementation of MPI_Send at
lam-7.0.6/share/mpi/send.c and lam-7.0.6/share/mpi/lamsend.c, but the
real communication is done by the RPI module. The RPI modules can be
found at lam-7.0.6/share/ssi/rpi, if you want to take a closer look at
the low level stuff.
Regards,
-- Ulisses
|