LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Andrew Friedley (afriedle_at_[hidden])
Date: 2006-10-26 10:48:52


Lily Li wrote:
>
> Hello, everyone,
>
> It seems that after we upgrade the linux kernel from 2.6.9-34.Elsmp to
> 2.6.9-42.Elsmp for our Pentium III cluster, we start having a higher
> rate of lamd hanging problem on the headnode. The lamd will not response
> to the command "lamnodes" after the LAM is booted and used for couple of
> days.
>
> The question is : do we need to recompile/link the LAM and the
> applications after we upgrade the linux kernel ?

I'm not sure why lamd would hang, but I certainly wouldn't expect
changing the kernel to cause it.. especially when both are 2.6.9.

Exactly how often was lamd hanging before, and how often is it hanging now?

I'm wondering if something else on the system is causing it.. Does
changing back to the old kernel take you back to the old lamd behavior?
  Has your LAM/MPI workload changed? Anything else?

Andrew