On Fri, 16 Jul 2004, Gkikas Magiorkinis wrote:
> I have checked the security settings and it is at the "no firewall"
> setting. Is there any specific test to check the firewall?
Bogdan answered this.
> All the nodes are running the lam. When the tping hangs the only way to
> bring down the lam at the tpinging nodes is to use wipe. Lamhalt does
> not work for these specific nodes but it works for the rest of the nodes
> (i mean the nodes i did not choose to tping).
When tping hangs, can you check to see if the lamd is still running on all
the nodes? One of the reaons that tping (and lamhalt) may hang is if a
lamd fails/aborts.
If this is what is happening, it is quite possible that the RPM you
installed is not compatible with your system (there's a million reasons
this could be happening). It may be advistable to either build from
source or download the SRPM and rebuild it for your system (see the thread
that just wrapped up about your installed version of Libtool!
http://www.lam-mpi.org/MailArchives/lam/msg08359.php).
> One additional info is that i have installed MPICH also and it seems to
> work for some applications. The MPICH is installed in directory that is
> commonly shared by all the nodes.
Note that LAM can do this as well; if you uninstall the RPM and build LAM
from source in a directory that is accessible on all nodes, it can be an
easier software management solution in many cases. See the LAM FAQ for
more details here ("Typical setup of LAM").
Hope this helps.
--
{+} Jeff Squyres
{+} jsquyres_at_[hidden]
{+} http://www.lam-mpi.org/
|