LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2008-02-14 14:59:12


On Feb 14, 2008, at 1:37 PM, fahad saeed wrote:

> This is what I am trying to accomplish. I am trying to run a single
> binary on different nodes for different data sets.i.e. a single
> binary has to run on different data sets on different nodes....I
> understand that it can be done using ssh or rsh i guess....

Yes.

> but I want to do this using MPI library.....

For the code you showed, you're not using any MPI function calls in a
meaningful way (i.e., you could get the rank in different ways). It
seems like MPI is not the right tool for what you're trying to do.

> For example....
>
> node1 may run --------> ./binary -in file1 -out file1-output
> node2 may run --------> ./binary -in file2 -out file2-output

If you're running a different command (e.g., different argv) on every
node, and you're not using MPI for message passing, why not just use
rsh/ssh or lamexec?

If it comes down to selecting which in/out file to use, doing it with

> so on and so forth....
> where in my mpi program this line(./binary -in file1 -out file1-
> output) is in "1-exec" and so on.....
>
> I have tried to distribute the 'load' on each node using the
> rank.......but the get errors as I discussed...

Did you look at the man page for lamexec(1)?

An alternative would be to put MPI_Init / MPI_Finalize in the ./binary
program itself and have them figure out their argv based on their
MPI_COMM_WORLD rank. Then you would just do "mpirun C ./binary" and
they would figure out their in and out filenames themselves.

>
> Thanks
>
> Fahad
>
>
> > Date: Thu, 14 Feb 2008 11:56:09 +0100
> > From: Bogdan.Costescu_at_[hidden]
> > To: lam_at_[hidden]
> > Subject: Re: LAM: caused collective abort of all ranks
> >
> > On Wed, 13 Feb 2008, fahad saeed wrote:
> >
> > > mpiexec or lamexec, are used to execute non-MPI programs, but how
> > > would that help in this case.Would a scheduler be used along with
> > > mpiexec to do the task under question.
> >
> > You seem not to understand what message passing means and what it
> can
> > do, plus you haven't really explained what you want to achieve,
> you've
> > only shown us the errors that you get. So please try to write down a
> > description of what your goals are and maybe we can find together a
> > solution. Also remember that this is the LAM/MPI list, dedicated to
> > issues related to LAM/MPI and not to clustering in general.
> >
> > --
> > Bogdan Costescu
> >
> > IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany
> > Phone: +49 6221 54 8869/8240, Fax: +49 6221 54 8868/8850
> > E-mail: bogdan.costescu_at_[hidden]
> > _______________________________________________
> > This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>
> Climb to the top of the charts! Play the word scramble challenge
> with star power. Play now!
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/

-- 
Jeff Squyres
Cisco Systems