Dear all,
On Wed, 26 Jan 2005 12:43:55 -0800 (PST)
feldy_at_[hidden] wrote:
> I'm trying to get Pallas AlltoAll to work on a cluster of SMPs using
> LAM-7.1.1 (same behavior with 7.0.6). This is using standard tcp/ip
> via intel e1000 on-board NICs. Using dual-processor 3GHz Xeon HT
> disabled. Linux FC3
> 2.6.9-1.667smp #1 SMP Tue Nov 2 14:59:52 EST 2004 i686 i686 i386
> GNU/Linux
[...]
> the PMB-MPI1 test hangs after completing the 4Meg message size.
> It nearly always hangs, maybe one out of 10 times it will get past
> this and fail on a larger test. I presume it is stuck in the barrier
> between tests.
after a hardware upgrade of our cluster we had also a lot of trouble
with Intel e1000 on-board NICs. After a kernel downgrade from kernel
2.4.28 to 2.4.26 all problems disappeared.
May be, the root of your problems is the kernel, too.
Heiko
--
-- Schlagersänger sind junge Männer, die bei Stromausfall keine Sänger
-- mehr sind. (Danny Kaye, am. Filmschauspieler, geb. 1913)
-- Supercomputing in Magdeburg @ http://tina.nat.uni-magdeburg.de
-- Heiko Bauke @ http://www.uni-magdeburg.de/bauke
|