LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: BOYRIE Fabrice (boyrie_at_[hidden])
Date: 2006-02-14 16:47:19


  Hello
  
  We have two networks on our cluster. The first one (100Mbit/s) gives
hostnames as node1, node2... and the second one (1gbit/s) as gblan1,
gblan2...
  We use torque on our cluster on the first network. So when we
integrate lam, messages are transfered on this network.

  The documentation suggests to use lam-hostmap.txt.
lamd was configured with
 ./configure --with-trillium \
--prefix=/usr/local/lam-7.1.2b31 \
 --with-tm=/usr/local/torque2.0.0p7

So I've added in the file /usr/local/lam-7.1.2b31/etc/lam-hostmap.txt
node1 mpi=gbnode1

(and node1.alineos.net mpi=gbnode1 to be sure)

But the test with NetPipe show that messages are still transfered on the
slow network.

  How can I debug this problem ? strace doesn't show any read of
lam-hostmap.txt.

  Thanks
  
  Fabrice BOYRIE