LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Bogdan Costescu (bogdan.costescu_at_[hidden])
Date: 2004-02-05 06:47:21


[ Pressed the wrong key and the message got sent before being finished and
with some spelling mistakes... ]

On Thu, 5 Feb 2004, Sergei Lisenkov wrote:

> LAM internal GM send: gmID=3 'kappa2' send failed to complete (see kernel log for details): send timed out

That is exactly the error message that I metioned in a previous e-mail
about 2 week ago, also when running with Myrinet. Jeff Squyres said that
yet another person has seen the same message and that there might be some
problem in LAM-MPI.

> LAM internal GM send: gmID=7 'kappa5' send failed to complete (see kernel log for details): send timed out

... but you get this message from all hosts. I only got it from one host
and in all cases that I remember, it was n1 when running on 2 nodes or n2
when running one 3 or more nodes (and I tried on different nodes to rule
out hardware problems).

> After lamboot, I run my code:
> mpirun -np 13 ./test.x input > output &

I usually add "-v" and "-O" (letter o, not zero), which might not be
needed nowadays, but I got used to it.

-- 
Bogdan Costescu
IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: Bogdan.Costescu_at_[hidden]