Hey Everyone,
I'm running LAM/MPI on a 16 node cluster over TCP on two interconnects,
Gigabit Ethernet and Infiniband. My application appears to be sending
messages that are never recieved on the destination side. Is this possible?
has anyone seen similar results? This sounds crazy, I know, but I've set up
some pretty fool-proof tests to count the number of sent and recieved
messages, and the numbers contradict. Does anyone have any ideas for why
this might be?
Thanks,
Craig
|