hi.
signal 9 usually is the kill-signal, which means someone/thing killed
your process. maybe someone on that machine has objections against your
process running on his machine?
for a list of signals have a look at /usr/include/signal.h or
/usr/include/bits/signum.h
good luck
andre
syoon_at_[hidden] wrote:
>Hello,
>
>My job has been mysteriously exited form MPI run
>with the following error messages returned.
>I'm wondering what the "singal 9" means, and why this
>happenened.
>
> Thanks, Phil
>
>
>=======================================================
>Starting on node004 at Sun May 23 21:32:34 CDT 2004
>Nodes for this job:
>node004g
>node003g
>
>LAM 7.0.3/MPI 2 C++/ROMIO - Indiana University
>----------------------------------------------------------------------------
>One of the processes started by mpirun has exited with a nonzero exit
>code. This typically indicates that the process finished in error.
>If your process did not finish in error, be sure to include a "return
>0" or "exit(0)" in your C code before exiting the application.
>
>PID 1554 failed on node n0 due to signal 9.
>-----------------------------------------------------------------------------
>
>LAM 7.0.3/MPI 2 C++/ROMIO - Indiana University
>
>Ended at Sun May 23 23:57:44 CDT 2004
>================================================================
>
>
>_______________________________________________
>This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>
>
>
|