LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Brian Barrett (brbarret_at_[hidden])
Date: 2005-01-26 02:39:04


Sorry about the slow reply - I've been traveling the last couple of
days. Your error is very strange, as we don't do any BProc specific
functionality after lamboot. Are you sure that you are linking against
LAM's libmpi and not an MPICH built for BProc? It looks like something
is trying to do BProc moves, which is unexpected for mpirun.
Otherwise, all I can recommend is trying to use gdb to find out what
application is calling the function that contains the bproc/move.c line
1886.

Brian

On Jan 18, 2005, at 4:15 AM, Alexander Lichius wrote:

> Hello all,
>
> I have just installed a small testbed for our cluster consisting of
> only
> 2 computers. I have installed clustermatic 5 on top of Fedora Core 3.
> Booting and bpsh'ing works fine, but I have some trouble getting MPI
> programs to work with LAM/MPI.
>
> I found some postings on the archives but no clues how to solve them.
>
> The following issue concerns LAM/MPI 7.1.1-2 and the latest SVN
> snapshot
> (7.2b1r10023). I compiled both from scratch using gcc/g77 and
> gcc/nagware. I can lamboot without any problems using the bproc ssi
> boot
> module and tping reports that it can find all computers (master and 1
> node).
>
> Then I try to start one of the examples contained in the LAM/MPI
> distro,
> e.g. the pi one. As soon as I start "mpirun n0-1
> PATH_TO_LAM/example/fpi", I get the following message on the nodes
> console:
> "bproc: WARNING: bproc/move.c: 1886: send_recv_process needs to be
> reworked to be consistent with the rest of the move code" And on the
> master the mpirun program reports:
>
> "----------------------------------------------------------------------
> -
> ------
> It seems that [at least] one of the processes that was started with
> mpirun did not invoke MPI_INIT before quitting (it is possible that
> more
> than one process did not invoke MPI_INIT -- mpirun was only notified of
> the first one, which was on node n0).
>
> mpirun can *only* be used with MPI programs (i.e., programs that invoke
> MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program to run
> non-MPI programs over the lambooted nodes.
> -----------------------------------------------------------------------
> -
> -----"
>
> Does anybody experienced similar problems or has a tip how to verify
> that my setup is basically ok? Help would be very appreciated. Thanks
> in
> advance.
>
> Alex
>
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>

-- 
   Brian Barrett
   LAM/MPI developer and all around nice guy
   Have an LAM/MPI day: http://www.lam-mpi.org/