On Sep 30, 2010, at 9:15 PM, David Shochat wrote:
> Thanks for the quick reply. Would there be any API changes going to
> 7.1.4? Or going to Open MPI (assuming we're only using things that
> were available in 7.0)?
There should not be. Both Open MPI and LAM/MPI implement the standard API, so any MPI functions that you're using in LAM/MPI 7.0 should also be present / unaltered (in terms of C function signature) in LAM/MPI 7.1.x and Open MPI.
> Meanwhile, we have learned (by using truss on the sending process)
> that the failure is on the sending side (we can see a TCP failure
> followed by an unsuccessful retry) even though MPI_Bsend() is not
> returning an error status.
Weird.
Does dmesg return anything useful?
I only half-care about the solution to that question -- if you can upgrade to newer LAM or Open MPI, it's only worthwhile to pursue that question if the same problem occurs.
--
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/
|