LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2010-10-01 10:44:35


On Sep 30, 2010, at 9:15 PM, David Shochat wrote:

> Thanks for the quick reply. Would there be any API changes going to
> 7.1.4? Or going to Open MPI (assuming we're only using things that
> were available in 7.0)?

There should not be. Both Open MPI and LAM/MPI implement the standard API, so any MPI functions that you're using in LAM/MPI 7.0 should also be present / unaltered (in terms of C function signature) in LAM/MPI 7.1.x and Open MPI.

> Meanwhile, we have learned (by using truss on the sending process)
> that the failure is on the sending side (we can see a TCP failure
> followed by an unsuccessful retry) even though MPI_Bsend() is not
> returning an error status.

Weird.

Does dmesg return anything useful?

I only half-care about the solution to that question -- if you can upgrade to newer LAM or Open MPI, it's only worthwhile to pursue that question if the same problem occurs.

-- 
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/