I run my program using
mpirun -ssi rpi crtcp -ssi cr blcr -np 2 ./hello
then I use
lamcheckpoint -ssi cr blcr -pid mpirun_pid
and get following error message:
-----------------------------------------------------------------------
Encountered a failure in the SSI types while continuing from
checkpoint. Aborting in despair :-(
-----------------------------------------------------------------------
rpwait failed: Success
Checkpoint failed: no process checkpointed.
No checkpoint file is created, process is terminated on the node,
where lamcheckpoint was invoked and both process and two examples of
cr_checkpoint are in process list of second node.
LAM-MPI version is 7.1.4
BLCR version is 0.7.3
BLCR is configured with --enable-static and --enable-all-static
OS - CentiOS 5
--
With best regards
Gleb "Crazy Sage" Igumnov
|