LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Andre Kempe (andre_at_[hidden])
Date: 2004-05-24 13:54:00


hi.

signal 9 usually is the kill-signal, which means someone/thing killed
your process. maybe someone on that machine has objections against your
process running on his machine?

for a list of signals have a look at /usr/include/signal.h or
/usr/include/bits/signum.h

good luck

andre

syoon_at_[hidden] wrote:

>Hello,
>
>My job has been mysteriously exited form MPI run
>with the following error messages returned.
>I'm wondering what the "singal 9" means, and why this
>happenened.
>
> Thanks, Phil
>
>
>=======================================================
>Starting on node004 at Sun May 23 21:32:34 CDT 2004
>Nodes for this job:
>node004g
>node003g
>
>LAM 7.0.3/MPI 2 C++/ROMIO - Indiana University
>----------------------------------------------------------------------------
>One of the processes started by mpirun has exited with a nonzero exit
>code. This typically indicates that the process finished in error.
>If your process did not finish in error, be sure to include a "return
>0" or "exit(0)" in your C code before exiting the application.
>
>PID 1554 failed on node n0 due to signal 9.
>-----------------------------------------------------------------------------
>
>LAM 7.0.3/MPI 2 C++/ROMIO - Indiana University
>
>Ended at Sun May 23 23:57:44 CDT 2004
>================================================================
>
>
>_______________________________________________
>This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>
>
>