LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Gurgul, Dennis J. (DGURGUL_at_[hidden])
Date: 2003-07-08 14:18:23


I have a 5 node cluster with OSCAR 2.2 and Lam 6.5.9. All 4 internal nodes
are identical. But, while 2 of them will work, the other two will not. If
I comment the two that fail out of my lamhosts file, lamboot starts OK. But
if I put thost two in, lamboot fails with no real indication of why.
Etc/hosts has only localhost.localdomain and localhost in the 127.0.0.1
field. Recon says everything's fine. "lamboot -vd lamhosts" looks exactly
the same (except for process ID) if lamboot starts successfully or if it
fails.

The last line in the output before the error message (lamboot encountered
some error.....) is:

topology n3...

The error message says "(see above", however, there is nothing to indicate
what went wrong.

Thanks.

Dennis Gurgul
Massachusetts General Hospital
Research Management
617.724.3169
dgurgul_at_[hidden]