Hello,
I've successfully installed blcr-0.2.3 and lam-7.1.1, but I can't
checkpoint my MPI applications.
See the output of 'laminfo' attached. It tells me blcr configuration
is ok.
Another file attached is the application I try to checkpoint. I
compile this accplication with
$ mpicc hello.c -L/usr/local/blcr-0.2.3/lib -lcr -o hello
I run the application with
$ mpirun C ./hello
After I do a
$ cr_checkpoint <pid_of_mpirun>
I get only one context file, and I can't restart from it:
$ cr_restart context.<pid_of_mpirun>
mpirun (rpwait): Bad file descriptor
Can somebody tell me what I've probably done wrong?
Thanks, regards,
Zoltan
LAM/MPI: 7.1.1
Prefix: /usr/local/lam-7.1.1
Architecture: i686-pc-linux-gnu
Configured by: zfarkas
Configured on: Fri Oct 8 00:00:24 CEST 2004
Configure host: debian
Memory manager: ptmalloc2
C bindings: yes
C++ bindings: yes
Fortran bindings: yes
C compiler: gcc
C++ compiler: g++
Fortran compiler: g77
Fortran symbols: double_underscore
C profiling: yes
C++ profiling: yes
Fortran profiling: yes
C++ exceptions: no
Thread support: yes
ROMIO support: yes
IMPI support: no
Debug support: no
Purify clean: yes
SSI boot: globus (API v1.1, Module v0.6)
SSI boot: rsh (API v1.1, Module v1.1)
SSI boot: slurm (API v1.1, Module v1.0)
SSI coll: lam_basic (API v1.1, Module v7.1)
SSI coll: shmem (API v1.1, Module v1.0)
SSI coll: smp (API v1.1, Module v1.2)
SSI rpi: crtcp (API v1.1, Module v1.1)
SSI rpi: lamd (API v1.0, Module v7.1)
SSI rpi: sysv (API v1.0, Module v7.1)
SSI rpi: tcp (API v1.0, Module v7.1)
SSI rpi: usysv (API v1.0, Module v7.1)
SSI cr: blcr (API v1.0, Module v1.1)
SSI cr: self (API v1.0, Module v1.0)
|