LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2005-04-30 06:46:21


Unfortunately, this doesn't tell us what the error was. Can you post
the full output of lamboot -d? Sometimes the error message is not in
the same location as the rest of the text for that host (this is
because it comes out on stderr rather than stdout, and they get
buffered separately).

On Apr 28, 2005, at 1:44 PM, ew fgff wrote:

> Hi,
>
> I am trying to connect 4 computers to run LAM/MPI. All
> computers are running RedHat Enterprise. When I run
> "recon -v lam-bhost.def" it shows me all 4 cpu. But
> when I run "lamboot -v -d", it executes "hboot" in 3
> of them. It faced some problem with the other one and
> could not complete lamboot. As I know, all computers
> are identical.
> Could you please tell what could be the problem and
> how to fix this. The error massage is:
>
> ----------------------------------------------
> hboot: process schema = "/etc/lam/lam-conf.lam"
> hboot: found /usr/bin/lamd
> hboot: performing tkill
> hboot: tkill
> hboot: booting...
> hboot: fork /usr/bin/lamd
> [1] 19834 lamd -H 131.123.234.176 -P 38625 -n 2 -o 0
> -d
>
> lamboot encountered some error (see above) during the
> boot process,
> and will now attempt to kill all nodes that it was
> previously able to
> boot (if any).
> --------------------------------------------------
>
> The first paragraph of the massage is identical for
> all cpu.
>
> Thank you very much for your help.
> Manoj
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam? Yahoo! Mail has the best spam protection around
> http://mail.yahoo.com
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>

-- 
{+} Jeff Squyres
{+} jsquyres_at_[hidden]
{+} http://www.lam-mpi.org/