We are using a software package called LS-DYNA to run on our cluster. The software itself is written to use MPI. Here is the problem. Basically lam-mpi works fine. I have run the test suite and everything passes. When I go to run the lsdynampp software (which is written for lam-mpi) I get the following error message:
becker_at_edms-head1:~$ mpp970
-----------------------------------------------------------------------------
It seems that there is no lamd running on the host .
This indicates that the LAM/MPI runtime environment is not operating.
The LAM/MPI runtime environment is necessary for MPI programs to run
(the MPI program tired to invoke the "MPI_Init" function).
Please run the "lamboot" command the start the LAM/MPI runtime
environment. See the LAM/MPI documentation for how to invoke
"lamboot" across multiple machines.
-----------------------------------------------------------------------------
becker_at_edms-head1:/usr/local/bin$ mpirun C /home/becker/mpp970
-----------------------------------------------------------------------------
It seems that there is no lamd running on the host .
This indicates that the LAM/MPI runtime environment is not operating.
The LAM/MPI runtime environment is necessary for MPI programs to run
(the MPI program tired to invoke the "MPI_Init" function).
Please run the "lamboot" command the start the LAM/MPI runtime
environment. See the LAM/MPI documentation for how to invoke
"lamboot" across multiple machines.
-----------------------------------------------------------------------------
-----------------------------------------------------------------------------
It seems that [at least] one of the processes that was started with
mpirun did not invoke MPI_INIT before quitting (it is possible that
more than one process did not invoke MPI_INIT -- mpirun was only
notified of the first one, which was on node n0).
mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
-----------------------------------------------------------------------------
By doing a ps -aux I am able to see that lamd is currently running. I am also able to use the tping command and get a reply from both the remote and the local node. LSTC, the maker of LS-DYNA is telling me that the issue is with LAM/MPI and not their software. Since the test suite runs properly and everything else is working I tend to disagree with them, but I figured I would ask here to see if anyone else has had any problems.
Thank you for your time.
Rob Becker
Unix Administrator
Battelle.
11-1-029D
505 King Ave.
Columbus, OH 43201
614-424-6378
|