LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Castle, Lindsay (lindsay.castle_at_[hidden])
Date: 2003-09-30 21:46:10


Hi all,

I'm new to LAM and are trying to get it running on a RedHat 9 system. I'm
using the standard rsh lamboot method at the moment.

I've upgraded the RPM version of the product to 7.0.2.

Can anyone point out the obvious to me as per my problem below :-)

To my problem;

rsh works fine for a command with no password prompt.
recon works fine with it's woo hoo! output....
lamboot -d looks like this:

<snip>
hboot: booting...
hboot: fork /usr/bin/lamd
hboot: attempting to execute
lamd kernel: problem with bind(): Address already in use
[1] 1927 lamd -H 134.251.222.200 -P 33502 -n 0 -o 0 -d
n0<1924> ssi:boot:rsh: successfully launched on n0 (local)
n0<1924> ssi:boot:base:server: expecting connection from finite list
n0<1924> ssi:boot:base:server: got connection from 155.163.5.8

<snip>

----------------------------------------------------------------------------
-
n0<1924> ssi:boot:base:server: failed to connect to remote lamd!
n0<1924> ssi:boot:base:server: closing server socket
n0<1924> ssi:boot:base:linear: aborted!
----------------------------------------------------------------------------
-
lamboot encountered some error (see above) during the boot process,
and will now attempt to kill all nodes that it was previously able to
boot (if any).

Please wait for LAM to finish; if you interrupt this process, you may
have LAM daemons still running on remote nodes.
----------------------------------------------------------------------------
-
lamboot: wipe -- nothing to do
lamboot did NOT complete successfully

Thanks for your time.

Linz