Hello,
I'am a newbie.
i tried to run a small mpi test programm. On our Cluster this programm works.
I started with lamboot for ssh :
lamboot -b -d -ssi boot rsh -ssi boot_rsh_agent "ssh -x" hostfilempi
this worked well without any error message
with lamnodes i could see that all nodes are started:
n0 ppc214.mipool.uni-jena.de:1:origin,this_node
n1 ppc203.mipool.uni-jena.de:1:
n2 ppc201.mipool.uni-jena.de:1:
n3 ppc231.mipool.uni-jena.de:1:
then i tried to start my small testprogramm:
#include <mpi.h>
int main(int argc, char *argv[])
{
int size, rank;
double botschaft;
MPI_Status status;
MPI_Init(&argc, &argv);
MPI_Comm_size(MPI_COMM_WORLD, &size);
MPI_Comm_rank(MPI_COMM_WORLD, &rank);
botschaft = 0.0;
if(rank == 0)
{
MPI_Send(&botschaft, 1, MPI_DOUBLE, 1, 1,
MPI_COMM_WORLD);
MPI_Recv(&botschaft, 1, MPI_DOUBLE, size-1, 1,
MPI_COMM_WORLD, &status);
printf("result: %lfn", botschaft);
}
else
{
MPI_Recv(&botschaft, 1, MPI_DOUBLE, rank-1, 1,
MPI_COMM_WORLD, &status);
botschaft += rank;
MPI_Send(&botschaft, 1, MPI_DOUBLE, (rank+1)%size, 1,
MPI_COMM_WORLD);
}
MPI_Finalize();
exit(0);
}
But i become only this error message:
jreichel_at_ppc214:~$ mpirun.lam C mpitest
-----------------------------------------------------------------------------
One of the processes started by mpirun has exited with a nonzero exit
code. This typically indicates that the process finished in error.
If your process did not finish in error, be sure to include a "return
0" or "exit(0)" in your C code before exiting the application.
PID 3660 failed on node n0 (141.35.13.114) due to signal 4.
-----------------------------------------------------------------------------
Knows anybody the reason for this failure?
Regards
Joerg
"Jetzt Handykosten senken mit klarmobil - 14 Ct./Min.! Hier klicken"
http://www.klarmobil.de/index.html?pid=73025
|