I just installed lam-mpi from the latest source release.
I did recon and it works. But lamboot fails:
lamboot -d stone1
LAM 6.5.6/MPI 2 C++/ROMIO - University of Notre Dame
lamboot: boot schema file: stone1
lamboot: opening hostfile stone1
lamboot: found the following hosts:
lamboot: n0 stone01
lamboot: resolved hosts:
lamboot: n0 stone01 --> 10.1.2.201
lamboot: found 1 host node(s)
lamboot: origin node is 0 (stone01)
lamboot: attempting to execute "hboot -t -c lam-conf.lam -d -I " -H 10.1.2.201 -P 33170 -n 0 -o 0 ""
hboot: process schema = "/home/david/nds/Billb/lam/etc/lam-conf.lam"
hboot: found /home/david/nds/Billb/lam/bin/lamd
hboot: performing tkill
hboot: tkill
hboot: booting...
hboot: fork /home/david/nds/Billb/lam/bin/lamd
[1] 25701 lamd -H 10.1.2.201 -P 33170 -n 0 -o 0 -d
hboot: attempting to execute
-----------------------------------------------------------------------------
lamboot encountered some error (see above) during the boot process,
and will now attempt to kill all nodes that it was previously able to
boot (if any).
Please wait for LAM to finish; if you interrupt this process, you may
have LAM daemons still running on remote nodes.
-----------------------------------------------------------------------------
wipe ...
LAM 6.5.6/MPI 2 C++/ROMIO - University of Notre Dame
Executing tkill on n0 (stone01)...
lamboot did NOT complete successfully
%
The file stone1 contains only the line
stone01
/tmp has all permissions allowed, rsh works, I don't see 127.0.0.1
in the about output, so I'm stuck.
Bill
_______________________________________________
This list is archived at http://www.lam-mpi.org/MailArchives/lam/
|