Hello everybody
i'm working in a 64 cpus cluster, when i try to run the command "mpirun -np 64 -machinefile /root/rack05 mpihello noswap" , i get the next error:
Timeout in waiting for processes to exit, 4 left. This may be due to a defective
rsh program (Some versions of Kerberos rsh have been observed to have this
problem).
This is not a problem with P4 or MPICH but a problem with the operating
environment. For many applications, this problem will only slow down
process termination.
Then i get
p25_15642: p4_error: Timeout in establishing connection to remote process: 0
The nodes have RH 7.1 kernel 2.4.18 and their rsh RPM are
rsh-0.17-2.5
rsh-server-0.17-2.5
Why can i do?
thanks
_______________________________________________
This list is archived at http://www.lam-mpi.org/MailArchives/lam/
|