Hello
We have two networks on our cluster. The first one (100Mbit/s) gives
hostnames as node1, node2... and the second one (1gbit/s) as gblan1,
gblan2...
We use torque on our cluster on the first network. So when we
integrate lam, messages are transfered on this network.
The documentation suggests to use lam-hostmap.txt.
lamd was configured with
./configure --with-trillium \
--prefix=/usr/local/lam-7.1.2b31 \
--with-tm=/usr/local/torque2.0.0p7
So I've added in the file /usr/local/lam-7.1.2b31/etc/lam-hostmap.txt
node1 mpi=gbnode1
(and node1.alineos.net mpi=gbnode1 to be sure)
But the test with NetPipe show that messages are still transfered on the
slow network.
How can I debug this problem ? strace doesn't show any read of
lam-hostmap.txt.
Thanks
Fabrice BOYRIE
|