> > at 10:58 ....."lamboot -v -s lamboot.mpi" command received
> > PBIND-------------------------------------
> > LAM failed to execute a LAM binary on the remote node beo132.
> > NJS_WORKDIR,NJS_STEPNAME xa=-40_migrate.apps" command received
> > YPBINDPROC_DOMAIN: Domain not bound
>
> Specifically, the "PBIND" and "YPBINDPROC_DOMAIN" errors were not
> printed by LAM. The "LAM failed to execute a LAM binary..." message
> doesn't look familiar, either.
I might be very wrong on this one (hadn't encountered these errors myself)
but YPBINDPROC_DOMAIN error may be connected with the yellow pages
(NIS/NIS+). If such a system is in use on your cluster I'd bet it's the
culprit here. (If NIS/NIS+ sounds unfamiliar - it's a system allowing
single password/login on a group of machines. It's nice when it works but
I had the dubious pleasure to see it die... Quite a mess.)
best regards
Konrad Karczewski
Czestochowa University of Technology
|