I am running lam-7.0 on a Opteron cluster running SuSe 8.1.
I noticed that mpirun sometimes hangs when running multiple MPI jobs.
These jobs run on 64 slave nodes, and keep the system resources fairly
busy. The first job is doing a fair amount of disk i/o when the second
job starts. The second job sometimes hangs. This happens before even
getting to MPI_init. Has anyone seen this kind of problem before. Is
there any option in mpirun that can help with this problem.
Thanks in advance,
Riju
|