Hello,
My mpi parallel code has some problem running on our cluster which uses a
Copper Gigabit Switch. My jobs finishes with errors. (One of the processes
started by mpirun has exited with a nonzero exit). However, the same one
runs without problems on the beowulf cluster using Myrinet cards.
- What could possibly be the cause the problem?
- Is it the coding (parallel algorithms)?
- Does it have anything to do with rates of sending and receiving data
across processors? (CPU's speeds have to be equal?)
Thank you and regards,
Watit
Center for Combustion and Environmental Research
University of Colorado at Boulder
|