LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Mukuntan (mukunth_at_[hidden])
Date: 2007-12-03 20:15:18


Hi,

I am implementing a checkpoint replication and process migration scheme in
LAM using BLCR. I went through previous threads discussing migration and
found a lot of useful information. I would like to thank the LAM developers
and other members for their useful inputs.

I have an issue in my implementation. When I attempt to start a process on
another node, it does not attach to the lam daemon there. The exact errror
it throws is 'no LAM daemon found'. This call to kenter occurs before the
process receives any GPS information from mpirun, and before any of the gps
updates for migration can be done. Why is it unable to attach to the lam
daemon of the node it is migrated to?

Thanks,
Mukunth

-- 
Mukuntan V Viswanathan
Visit My webpage at
www.buffalo.edu/~mvv2