Does anybody know if there is a MPI implementation capable of make real
"progress" of nonblocking communication in the background (i.e,
overlapping computation with data transfer).
I tried doing calls to MPI_Test in the loop to help this progress in
both LAM_MPI and MPICH, buth I got no good results with big ( >
25Kbytes) buffers. I was told MPI/Pro and ChaMPIon/Pro do this overlap
in a better way. Has anybody tried this?
Thanks.
|