Hello all,
I am attempting to run a parallel program on a 5 machine cluster and am
experiencing incredibly slow execution (roughly 10,000 times slower than
when done on a single machine). The basic program structure consists of 4
slaves performing a computation and sending their results to a master
which then does some analysis and writes these results to file. The
results are passed as a struct which can accomodate 168kB but is usually
filled with 55kB. Is this too large?? As I add machines to the cluster,
the execution time increases dramatically, and CPU loads avgs drop to
virtually nothing.
Thanks,
Brian Powell
|