Hi,
I am using 3 computers running RHEL 3 and same version
of LAM that comes with RHEL 3. I have accounts on all
these 3 computers and have the same default shell
(bash2). I have set passwordless connection.
The name of the nodes (computers) are put in .rhosts
file in the order:
node1.xxx.xxx (Master node)
node2.xxx.xxx
node3.xxx.xxx
With "lamboot -v .rhosts" command form "node1.xxx.xxx"
I got following errors:
==================
LAM 6.5.9/MPI 2 C++ - Indiana University
Executing hboot on n0 (node1.xxx.xxx - 1 CPU)...
Executing hboot on n1 (node2.xxx.xxx - 1 CPU)...
-------------------------------------------------------
lamboot encountered some error (see above) during the
boot process,
and will now attempt to kill all nodes that it was
previously able to
boot (if any).
Please wait for LAM to finish; if you interrupt this
process, you may
have LAM daemons still running on remote nodes.
-------------------------------------------------------
wipe ...
LAM 6.5.9/MPI 2 C++ - Indiana University
Executing tkill on n0 (node1.xxx.xxx)...
Executing tkill on n1 (node2.xxx.xxx)...
===================
Actually, the problem is on "node2.xxx.xxx" because it
can run lamboot on "node3.xxx.xxx" if I do not put
node2.xxx.xxx.
It is not the prblem with LAM in "node2.xxx.xxx"
because lamboot can be run on "node2.xxx.xxx" from
"node2.xxx.xxx".
Also, it should not be the problem with my account
because I can connect "node2.xxx.xxx" from
"node1.xxx.xxx".
Could you please help me to solve this problem.
Thanks.
Manoj
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
|