Hi
We have installed the LAM package that comes with Debian Sarge. The program
compiles even without -lmpi.
The systems already had mpich2 installed in them. How can we make sure that
we are using the mpicc from LAM implementation? Do we have to uninstall
mpich2?
Thanks
MEC
On 1/18/06, Jeff Squyres <jsquyres_at_[hidden]> wrote:
>
> One thing to double check -- ensure that you are using the mpicc and
> mpirun from the LAM installation, and not from some other MPI
> installation. For example, with LAM's mpicc, there is no need to add
> -lmpi.
>
> Can you confirm that you are using the right ones?
>
>
> On Jan 18, 2006, at 4:28 AM, Soumya,Teena, Ranjana,Liss,Navya wrote:
>
> > Hi
> > The lam test suite reported 12 of 12 Tests failed.
> > The following files have been attached.
> >
> > Output from the "laminfo" command
> > Output from "./configure" in the lamtests directory
> > The config.log file from the lamtests directory
> > Output from "make check"
> >
> > We are using networked file system.
> > A total of 4 nodes are there in the cluster. One is master and it has
> > 3 network interface cards. The rest have a single network interface
> > card. Does this cause any problem?
> >
> > We can compile successfully. Even lamboot produces no error. But
> > mpirun is producing error.
> > The following is what happens:
> > ______________________________________________________________
> > project_at_ltsp:~/lam$ lamboot
> >
> > LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University
> >
> > project_at_ltsp:~/lam$ lamnodes
> > n0 master.mec.ac.in:1:origin,this_node
> > n1 node-1.mec.ac.in:1:
> > n2 node-2.mec.ac.in:1:
> > n3 node-3.mec.ac.in:1 :
> > project_at_ltsp:~/lam$ mpicc hello.c -lmpi
> > project_at_ltsp:~/lam$ mpirun.lam -np 4 a.out
> > ----------------------------------------------------------------------
> > -------
> > One of the processes started by mpirun has exited with a nonzero exit
> > code. This typically indicates that the process finished in error.
> > If your process did not finish in error, be sure to include a "return
> > 0" or "exit(0)" in your C code before exiting the application.
> >
> > PID 25390 failed on node n0 (192.168.3.200) due to signal 11.
> > _____________________________________________________________
> > On running the allgather_inter.o, the following happens
> > _____________________________________________________________
> > project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$ mpirun.lam -np 2
> > allgather_inter
> > [**ERROR**]: LAM/MPI MPI_COMM_WORLD rank 0, file allgather_inter.c:40:
> > This test requires an even number of processes to run. Aborting.
> > [unset]: aborting job:
> > application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
> > mpirun: cannot start allgather_inter on n1: No such file or directory
> > project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$
> > _____________________________________________________________
> > On running make check
> > _____________________________________________________________
> > project_at_ltsp:~/lamtests-7.1.1$ make check >>make_check_output
> > make[3]: *** [check-TESTS] Error 1
> > make[2]: *** [check-am] Error 2
> > make[1]: *** [check-recursive] Error 1
> > make: *** [check-recursive] Error 1
> > project_at_ltsp:~/lamtests-7.1.1$
> > _____________________________________________________________
> > Please help.
> >
> > Thanks
> > MEC
> > --
> > CE 2002-06
> > <config.log>
> > <make_check_output>
> > <laminfo.txt>
> > <ouput_of_configure.txt>
> > _______________________________________________
> > This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>
>
> --
> {+} Jeff Squyres
> {+} The Open MPI Project
> {+} http://www.open-mpi.org/
>
>
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
>
--
CE 2002-06
|