LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Bogdan Costescu (Bogdan.Costescu_at_[hidden])
Date: 2006-10-26 10:46:44


On Thu, 26 Oct 2006, Lily Li wrote:

> we start having a higher rate of lamd hanging problem on the
> headnode. The lamd will not response to the command "lamnodes" after
> the LAM is booted and used for couple of days.

This is a bit vague description of the problem. Have you done anything
to diagnose why the lamd would not respond anymore ? For example, have
you tried attaching to the "hung" lamd with gdb or using 'strace -p'
to know what the process is actually doing ?

> do we need to recompile/link the LAM and the applications after we
> upgrade the linux kernel ?

No. Especially with kernels from an enterprise class Linux
distribution which should not change too much between updates.

-- 
Bogdan Costescu
IWR - Interdisziplinaeres Zentrum fuer Wissenschaftliches Rechnen
Universitaet Heidelberg, INF 368, D-69120 Heidelberg, GERMANY
Telephone: +49 6221 54 8869, Telefax: +49 6221 54 8868
E-mail: Bogdan.Costescu_at_[hidden]