- Next message: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
- Previous message: Jeff Squyres: "Re: LAM: installing superLU-dist 2.0"
- In reply to: Soumya,Teena, Ranjana,Liss,Navya: "LAM: lam test suite failed"
- Next in thread: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
- Reply: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
One thing to double check -- ensure that you are using the mpicc and
mpirun from the LAM installation, and not from some other MPI
installation. For example, with LAM's mpicc, there is no need to add
-lmpi.
Can you confirm that you are using the right ones?
On Jan 18, 2006, at 4:28 AM, Soumya,Teena, Ranjana,Liss,Navya wrote:
> Hi
> The lam test suite reported 12 of 12 Tests failed.
> The following files have been attached.
>
> Output from the "laminfo" command
> Output from "./configure" in the lamtests directory
> The config.log file from the lamtests directory
> Output from "make check"
>
> We are using networked file system.
> A total of 4 nodes are there in the cluster. One is master and it has
> 3 network interface cards. The rest have a single network interface
> card. Does this cause any problem?
>
> We can compile successfully. Even lamboot produces no error. But
> mpirun is producing error.
> The following is what happens:
> ______________________________________________________________
> project_at_ltsp:~/lam$ lamboot
>
> LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University
>
> project_at_ltsp:~/lam$ lamnodes
> n0 master.mec.ac.in:1:origin,this_node
> n1 node-1.mec.ac.in:1:
> n2 node-2.mec.ac.in:1:
> n3 node-3.mec.ac.in:1:
> project_at_ltsp:~/lam$ mpicc hello.c -lmpi
> project_at_ltsp:~/lam$ mpirun.lam -np 4 a.out
> ----------------------------------------------------------------------
> -------
> One of the processes started by mpirun has exited with a nonzero exit
> code. This typically indicates that the process finished in error.
> If your process did not finish in error, be sure to include a "return
> 0" or "exit(0)" in your C code before exiting the application.
>
> PID 25390 failed on node n0 (192.168.3.200) due to signal 11.
> _____________________________________________________________
> On running the allgather_inter.o, the following happens
> _____________________________________________________________
> project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$ mpirun.lam -np 2
> allgather_inter
> [**ERROR**]: LAM/MPI MPI_COMM_WORLD rank 0, file allgather_inter.c:40:
> This test requires an even number of processes to run. Aborting.
> [unset]: aborting job:
> application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
> mpirun: cannot start allgather_inter on n1: No such file or directory
> project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$
> _____________________________________________________________
> On running make check
> _____________________________________________________________
> project_at_ltsp:~/lamtests-7.1.1$ make check >>make_check_output
> make[3]: *** [check-TESTS] Error 1
> make[2]: *** [check-am] Error 2
> make[1]: *** [check-recursive] Error 1
> make: *** [check-recursive] Error 1
> project_at_ltsp:~/lamtests-7.1.1$
> _____________________________________________________________
> Please help.
>
> Thanks
> MEC
> --
> CE 2002-06
> <config.log>
> <make_check_output>
> <laminfo.txt>
> <ouput_of_configure.txt>
> _______________________________________________
> This list is archived at http://www.lam-mpi.org/MailArchives/lam/
--
{+} Jeff Squyres
{+} The Open MPI Project
{+} http://www.open-mpi.org/
- Next message: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
- Previous message: Jeff Squyres: "Re: LAM: installing superLU-dist 2.0"
- In reply to: Soumya,Teena, Ranjana,Liss,Navya: "LAM: lam test suite failed"
- Next in thread: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
- Reply: Soumya,Teena, Ranjana,Liss,Navya: "Re: LAM: lam test suite failed"
|