> I assumed that your app was having each process compute a portion of the
> calculation and then send the final results to the master for final
> combination (e.g., a typical manager/worker kind of setup). Are you
> saying that the exact same computation is performed in each process?
You're just right - each process computes a protion of the computation, part
of the computaions are exchanged between "neighbour" nodes.
Here I've got some clue already - the problem seems to be in the procedure
that computes the matrix-vector product. In fact this is the most heavyly
executed piece of code in the app. I'm going back to it ...
|