Hello everyone, i'm new to the mailing list - and to cluster configuration -
I've currently got 4 nodes - redhat 9.0 kernel 2.4.20 - lam 6.5.9 installed and
after issuing:
# lamboot -v
it shows all the nodes completed successfully as being added onto the topology.
Also when I issue:
# lamnodes
it shows a listing of all active nodes and the number of cpus available (i.e.
n0 hostA:1
n1 hostB:2
n2 hostC:3
n3 hostD:4
)
However, when I run mpirun like this:
# mpirun -np 4 /some/mpi/program/from/LSTC info
I get the following error:
--------------------------------------------------------------------------------------
It seems that [ at least ] one of processes that was started with mpirun did not
invoke MPI_INIT before quitting ( it is possible that more than one process did
not invoked MPI_INIT
-- mpirun was only notified of the first one, which was on node n0).
mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
---------------------------------------------------------------------------------------------
Does anyone have any idea why this occurs? I looked at the archives and saw
someone else mention this, but I didn't see any answers but having installed a
"miss-compiled" rpm binary.
Thanks,
-Louis
|