Hi,
I am developping a paralell program using MPI. I am working with a
cluster of 16 biprocessor nodes. All seemed to be fine, but when I try
to test my program (now I am trying with a simple "Hello world!"
program to find the problem) using two processes by node (adding
"cpu=2" in the configuration file) I get the following error:
magonzalez_at_baobab:~/StageVerano/HelloWorld$ mpirun -np 6 ./hello
-----------------------------------------------------------------------------
The selected RPI failed to initialize during MPI_INIT. This is a
fatal error; I must abort.
This occurred on host n2 (n1).
The PID of failed process was 12851 (MPI_COMM_WORLD rank: 2)
-----------------------------------------------------------------------------
-----------------------------------------------------------------------------
One of the processes started by mpirun has exited with a nonzero exit
code. This typically indicates that the process finished in error.
If your process did not finish in error, be sure to include a "return
0" or "exit(0)" in your C code before exiting the application.
PID 29102 failed on node n0 (10.0.0.1) with exit status 1.
-----------------------------------------------------------------------------
The problem remains even if I try "lamclean" or if I do a "lamhalt" +
"lamboot" system restart.
Thanks in advance for any help :)
Miguel Ángel González Gisbert
FACULTAD DE INFORMÁTICA - UPM
|