On Jun 21, 2005, at 5:19 PM, Ross Heikes wrote:
> We are running a model on APPLE x Serve using Lam 7.1.1 . WE are
> having hanging problems and using totalview we found out that the
> module -sched_yield has the error. Just want to know weather this
> module is defined in LAM source code or is ti APPLE defined module
Hi Ross -
I'm not exactly sure what you mean. LAM probably calls the system
function sched_yield in various places (especially if you are using the
shared memory transports). But that shouldn't be causing any problems
- that code is fairly well tested. It's possible if the shared memory
RPI is polling a lot (lots of pending communication), that you will
frequently see sched_yield() in the call stack, as it's where a process
is going to be blocking waiting for something to happen. Could you go
into a bit more detail about what you are seeing?
By the way, you should either upgrade to LAM 7.1.2b22 or make sure you
are using the SYSV or TCP rpis (mpirun -ssi rpi sysv) instead of the
default USYSV rpi. There is a bug in the USYSV rpi on the Apple G5
multi-cpu machines that can cause data corruption in certain
situations.
Hope this helps,
Brian
--
Brian Barrett
LAM/MPI developer and all around nice guy
Have a LAM/MPI day: http://www.lam-mpi.org/
|