LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Bogdan Costescu (Bogdan.Costescu_at_[hidden])
Date: 2005-12-13 20:56:18


On Tue, 13 Dec 2005, Jeff Squyres wrote:

> > PID 5074 failed on node n0 (134.153.50.235) due to signal 15.
> I'm assuming that this is a linux system -- signal 15 is ENOTBLK.

Err, no. Signal 15 is SIGTERM as shown by /usr/include/bits/signum.h
... you mistakenly looked at errno.h as all signal names start with
SIG :-)

But this doesn't say much about the reason for terminating the
parallel job... Maybe the remote shell is not clean - does it write
something on stdout or stderr ?

-- 
Bogdan Costescu
IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: Bogdan.Costescu_at_[hidden]