LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Robin Humble (rjh_at_[hidden])
Date: 2004-12-14 18:04:27


a Xeon cluster with InfiniBand and topspin's hack of MPICH gets
6.3 microseconds latency, wheras LAM 7.1.1 is at about 15 microseconds.
these are measured with netpipe 3.6.2's NPmpi program.

Is this a known issue? Seems like it might be judging by section 3.2.2
of the LAM users guide...

Curiously enough, netpipe's native IB program (NPib) also gets about 15
microseconds. So the topspin MPI must be using IB differently to both
LAM and NPib.

OTOH peak bandwidth was the same with the various MPIs, and LAM handles
async messages better, so LAM is actually what we are using for real
world runs...

Another (probably naive) IB question is how best to use dual ported
IB cards. eg. can LAM force traffic to some nodes out one port on the
HCA and traffic to other nodes out the other port?

cheers,
robin