LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Craig Lam (craig.mpi_at_[hidden])
Date: 2005-06-23 22:17:59


Hello,

I've set up a diskless cluster running Fedora Core 3 (modified to
allow the diskless cluster nodes to start up). When I run an MPI job,
it seems that stdout does not get directed from remote nodes correctly
although all local processes' output shows up correctly. Does anyone
know why this might be?

My system set up is an 8 node dual opteron cluster running in 32-bit
mode on Linux. Each node has dual infiniband over PCI express
(although I am only using one interface currently). My configuration
of MPI is done with "./configure --with-debug --prefix=/opt/lam-7.0.6
--exec-prefix=/opt/lam-7.0.6 --with-rsh=ssh". The problem exhibits
itself on both Lam-7.0.6 and Lam-7.1.1 (I have not tried other
version). My diskless clusters run NFS version 4, and each cluster
node binds /var/${HOSTNAME}/ to /var and /tmp/${HOSTNAME} to /tmp to
give each node an individual copy of these directories (would this
contribute to these problems?)

I must admit that I am a bit stumped.

Thanks for all your thoughts,
Craig Casey
craig.mpi_at_[hidden]