Hello,
I installed LAM-7.0 checkpointing module with following configuration.
./configure --without-fc --with-rpi=crtcp
But when I excuted examples/trivial, I got the following message:
hjkim_at_can33 trivial]$ mpirun -np 2 -ssi rpi crtcp -ssi cr blcr trivialc
------------------------------------------------------------------------
-----
The "blcr" module requested in the CR kind was not found.
This typically means that you misspelled the desired module name, or
used
the wrong name entirely.
------------------------------------------------------------------------
-----
------------------------------------------------------------------------
-----
The crmpi SSI subsystem failed to open during MPI_INIT. This is a
fatal error; I must abort.
This occurred on host can33.kaist.ac.kr (n0).
The PID of failed process was 15898 (MPI_COMM_WORLD rank: 0)
------------------------------------------------------------------------
-----
------------------------------------------------------------------------
-----
It seems that [at least] one of the processes that was started with
mpirun did not invoke MPI_INIT before quitting (it is possible that
more than one process did not invoke MPI_INIT -- mpirun was only
notified of the first one, which was on node n0).
mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
------------------------------------------------------------------------
-----
How can I use blcr module? Please help. Thank you.
Hojoong Kim
|