LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Madhurjya P. Bora (mpbora_at_[hidden])
Date: 2005-06-21 22:02:11


Hi All!

I have the lam-7.0.3 successfully running on my Fedora Core 2 standalone
machine (localhost), which I use for test purpose. This lam-7.0.3 came
as an RPM along with the system.

When I've built the lam-7.1.1 from the .tar.bz2 package for the
Lahey-Fujitsu FORTRAN 95 compiler, the built went on successfully. But
during lamboot from the newly built pacakge complains of TCP random
ports. However recon is successful!

My configure option was just with a prefix dir i.e. ./configure
-prefix=/usr/local/lam/lf95.
The old lam still boots! I'm using SSH-2. Kindly help!

The error message I get
-----------------------------------------------------------------------------
The lamboot agent failed to read a message over a socket from the
newly-booted process. This should not happen (especially since TCP is
a guaranteed protocol).

*** PLEASE READ THIS ENTIRE MESSAGE, FOLLOW ITS SUGGESTIONS, AND
*** CONSULT THE "BOOTING LAM" SECTION OF THE LAM/MPI FAQ
*** (http://www.lam-mpi.org/faq/) BEFORE POSTING TO THE LAM/MPI USER'S
*** MAILING LIST.

You should probably check the following:

- Network connectivity: Ensure that messages can be passed reliably
  over TCP using random ports.
- Environment / PATH settings: Ensure that you are running the same
  version of LAM/MPI on all nodes. Sometimes premature disconnects
  (and therefore this error message) may be caused if mismatched
  versions of LAM are used on different nodes.
- Node health: Ensure that the host where the newly-booted process was
  launched is healthy and still available on the network.
-----------------------------------------------------------------------------

Thanks in advance,

Madhurjya P. Bora