Hi
The lam test suite reported 12 of 12 Tests failed.
The following files have been attached.
Output from the "laminfo" command
Output from "./configure" in the lamtests directory
The config.log file from the lamtests directory
Output from "make check"
We are using networked file system.
A total of 4 nodes are there in the cluster. One is master and it has
3 network interface cards. The rest have a single network interface
card. Does this cause any problem?
We can compile successfully. Even lamboot produces no error. But
mpirun is producing error.
The following is what happens:
______________________________________________________________
project_at_ltsp:~/lam$ lamboot
LAM 7.1.1/MPI 2 C++/ROMIO - Indiana University
project_at_ltsp:~/lam$ lamnodes
n0 master.mec.ac.in:1:origin,this_node
n1 node-1.mec.ac.in:1:
n2 node-2.mec.ac.in:1:
n3 node-3.mec.ac.in:1:
project_at_ltsp:~/lam$ mpicc hello.c -lmpi
project_at_ltsp:~/lam$ mpirun.lam -np 4 a.out
-----------------------------------------------------------------------------
One of the processes started by mpirun has exited with a nonzero exit
code. This typically indicates that the process finished in error.
If your process did not finish in error, be sure to include a "return
0" or "exit(0)" in your C code before exiting the application.
PID 25390 failed on node n0 (192.168.3.200) due to signal 11.
_____________________________________________________________
On running the allgather_inter.o, the following happens
_____________________________________________________________
project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$ mpirun.lam -np 2
allgather_inter
[**ERROR**]: LAM/MPI MPI_COMM_WORLD rank 0, file allgather_inter.c:40:
This test requires an even number of processes to run. Aborting.
[unset]: aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
mpirun: cannot start allgather_inter on n1: No such file or directory
project_at_ltsp:~/lamtests-7.1.1/ccl/intercomm$
_____________________________________________________________
On running make check
_____________________________________________________________
project_at_ltsp:~/lamtests-7.1.1$ make check >>make_check_output
make[3]: *** [check-TESTS] Error 1
make[2]: *** [check-am] Error 2
make[1]: *** [check-recursive] Error 1
make: *** [check-recursive] Error 1
project_at_ltsp:~/lamtests-7.1.1$
_____________________________________________________________
Please help.
Thanks
MEC
--
CE 2002-06
|