I have LAM 6.5.9 installed on a 32 node ethernet cluster. I am an MPI
code that is basically pure intensive communication. The sizes of the
communications range from 1 B to 256BYTES per node. In the mpi
application, I use isend and irecv, and i am sure i have enough
buffer space. So, after running the application for many iterations, the
program hangs on large sizes and sometimes even for meduim sizes. The
feeling that i am getting is that somehow, after the network is saturated,
lam does not deliver packets and hangs. Anyone has a clue? is there away
around this? in the application, every X amounts of runs, i call lamclean
to free some resources. That did not help!
*********************************************************************
Ahmad Faraj Office: 170 James Lov Building
Ph.D Candidate Phone: (850)644-1533
Department of Computer Science Fax: (850)644-0058
Florida State University Email: faraj_at_[hidden]
Tallahassee, FL 32306 URL: http://www.cs.fsu.edu/~faraj
**********************************************************************
|