Hi,
I am trying to get LAM/MPI 7.0.6 working on two nodes of a cluster using
RedHat Linux 9. I installed a rpm copy on one of the nodes and from source on
another node. Now when I do a lamboot -v hostfile (hostfile has the names of
the two machines) lam is booted on both the nodes, but when I run an mpi
program (eg.: mpirun -np 2 reduc), only one instance of the mpirun is
started. This one is on the node in which I did a source install. The other
node does not start the mpirun.
At the end of the mpirun I get the following error.
------------------------------------------------------------------------------------------------------------
It seems that [at least] one of the processes that was started with
mpirun did not invoke MPI_INIT before quitting (it is possible that
more than one process did not invoke MPI_INIT -- mpirun was only
notified of the first one, which was on node n0).
mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
-----------------------------------------------------------------------------------------------------------
Can someone help with this please.
Pravin
|