Konrad,
lamboot hostfile was initiated prior to running any command. I have tried lamhalt multiple times which succeeds in shutting down the lamd daemon. Like I said, I am at a total loss to why this is happening. LSTC says the message I am getting is from the MPI, yet I can't see why it would since it passes every other test I can throw at it.
-Rob Becker
-----Original Message-----
From: Konrad Karczewski [mailto:xeno_at_[hidden]]
Sent: Thursday, December 11, 2003 2:37 AM
To: General LAM/MPI mailing list
Subject: Re: LAM: It says lamd isn't running?
On Wed, 10 Dec 2003, Becker, Robert P wrote:
> We are using a software package called LS-DYNA to run on our cluster.
....
> By doing a ps -aux I am able to see that lamd is currently running. I
> am also able to use the tping command and get a reply from both the
> remote and the local node. LSTC, the maker of LS-DYNA is telling me
> that the issue is with LAM/MPI and not their software. Since the test
> suite runs properly and everything else is working I tend to disagree
> with them, but I figured I would ask here to see if anyone else has had
> any problems.
It's not sufficient to have any lam daemon running, you have to start
"your own" process - I suppose that you're not running lamboot before
starting your task. Try running lamhalt (it should fail in this case) and
then lamboot <lamhosts> as the user which is trying to run the
application. If this doesn't help - you found an interesting case ;)
best regards
Konrad Karczewski
_______________________________________________
This list is archived at http://www.lam-mpi.org/MailArchives/lam/
|