Hi,
 
I have setup a 5 nodes cluster (1 master, 4 slaves) using Oscar 2.2.  After installation, I logon to the head node and recon, lamboot and test a simple mpi program successfully.  After switching off and restart the machines, at the head node, whenever I recon or lamboot, I was asked for the password of the slave node.  Below are the messages:
 
lamboot -v lamhosts4
LAM 6.5.9/MPI 2 C++/ROM - Indiana University
Executing hboot on n0 (lilacnode1.lilac.com -1 CPU) ....
Password:
permission denied, please try again (this message appear when I press enter, and I would have thought that Oscar would have setup passwordless logon for the cluster, still don't know why)
 
-----------
I also did a recon in another instance:
recon -v lamhosts4
recon: --testing n0 (lilacnode1.lilac.com)
password:
when I gave the correct password, the following message appear:
"Could not chdir to home directory /home/chiam; no such file or directory "
 
Would really appreciate any help to get my lam-mpi working.
 
Thanks and best regards.
 
 
Chiam Tow Jong
Experimental Mechanics Laboratory
Mechanical Engineering Department 
National University of Singapore


Upgrade Your Email - Click here!