LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Swati Longia (swati_longia_at_[hidden])
Date: 2006-03-06 00:24:36


Hello All

I have a demo Beowulf cluster consisting of just 2 machines, master and
slave. I have installed LAM on it
The version of LAM I have is 7.1.1.
I installed it exactly the way it is specified in the manual.

When I try to boot it up, it gives me the following error.
    lamboot -v lamhosts

LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University

n-1<6141> ssi:boot:base:linear: booting n0 (slave)
ERROR: LAM/MPI unexpectedly received the following on stderr:
connect to address 10.1.22.165: Connection refused
connect to address 10.1.22.165: Connection refused
trying normal rsh (/usr/bin/rsh)

slave: Connection refused
-----------------------------------------------------------------------------
LAM failed to execute a process on the remote node "slave".
LAM was not trying to invoke any LAM-specific commands yet -- we were
simply trying to determine what shell was being used on the remote
host.

LAM tried to use the remote agent command "rsh"
to invoke "echo $SHELL" on the remote node.

*** PLEASE READ THIS ENTIRE MESSAGE, FOLLOW ITS SUGGESTIONS, AND
*** CONSULT THE "BOOTING LAM" SECTION OF THE LAM/MPI FAQ
*** (http://www.lam-mpi.org/faq/) BEFORE POSTING TO THE LAM/MPI USER'S
*** MAILING LIST.

This usually indicates an authentication problem with the remote
agent, some other configuration type of error in your .cshrc or
.profile file, or you were unable to executable a command on the
remote node for some other reason. The following is a list of items
that you should check on the remote node:

        - You have an account and can login to the remote machine
        - Incorrect permissions on your home directory (should
          probably be 0755)
        - Incorrect permissions on your $HOME/.rhosts file (if you are
          using rsh -- they should probably be 0644)
        - You have an entry in the remote $HOME/.rhosts file (if you
          are using rsh) for the machine and username that you are
          running from
        - Your .cshrc/.profile must not print anything out to the
          standard error
        - Your .cshrc/.profile should set a correct TERM type
        - Your .cshrc/.profile should set the SHELL environment
          variable to your default shell

Try invoking the following command at the unix command line:

        rsh slave -n 'echo $SHELL'

You will need to configure your local setup such that you will *not*
be prompted for a password to invoke this command on the remote node.
No output should be printed from the remote node before the output of
the command is displayed.

When you can get this command to execute successfully by hand, LAM
will probably be able to function properly.
-----------------------------------------------------------------------------
n-1<6141> ssi:boot:base:linear: Failed to boot n0 (slave)
n-1<6141> ssi:boot:base:linear: aborted!
lamboot did NOT complete successfully

I tried the following command
        ssh slave -n 'echo $SHELL'
It gave me the proper result without asking for password, but when I try
to do the same with 'rsh'
it hangs.
Can someone help me on it.

-- 
Thanks n Regards
Swati Longia
Oh, what a bitter thing it is to look into happiness through
another man's eyes. -  William Shakespeare
********** DISCLAIMER **********
Information contained and transmitted by this E-MAIL is proprietary to 
Sify Limited and is intended for use only by the individual or entity to 
which it is addressed, and may contain information that is privileged, 
confidential or exempt from disclosure under applicable law. If this is a 
forwarded message, the content of this E-MAIL may not have been sent with 
the authority of the Company. If you are not the intended recipient, an 
agent of the intended recipient or a  person responsible for delivering the 
information to the named recipient,  you are notified that any use, 
distribution, transmission, printing, copying or dissemination of this 
information in any way or in any manner is strictly prohibited. If you have 
received this communication in error, please delete this mail & notify us 
immediately at admin_at_[hidden]
www.sify.com - your homepage on the internet for news, sports, finance,
astrology, movies, entertainment, food, languages etc