Dear lam team,
I am trying on a IBM SP4 at ECMWF to spawn an executable from one
node to another one.
It works with lam 6.5.9 but not with 7.0 . Furthermore I test it on a
cluster of PC where it works with lam 7.0
the strange points is lamboot -v boot_schema works but it just deadlock
at the spawn time during the mpirun -c 1 .driver command.
I am quite confident to the program since it also works on FUJITSU and
NEC SX5 with native implementations.
The idea is to make the driver spawn a block executable where it is
indicated in block.where. and if this file is not there, then the
spawned block stay on the same node that the driver (what we called
intranode and is working).
Hope you should give me some clues
There is 2 athers little points lamhalt start but hang never finishing.
35:> lamhalt
LAM 7.0/MPI 2 C++/ROMIO - Indiana University
The last little point is that mpiexec is not installed
The ecmwf support try tracking the matter and they told me there is
something wrong with their perl
version :
tools/mpiexec/mpiexec -testI get
Can't locate File/Temp.pm in @INC (@INC contains: /usr/opt/perl5/lib/5.6.0/aix
/usr/opt/perl5/lib/5.6.0
/usr/opt/perl5/lib/site_perl/5.6.0/aix /usr/opt/perl5/lib/site_perl/5.6.0
/usr/opt/perl5/lib/site_perl .) at
tools/mpiexec/mpiexec line 21.
BEGIN failed--compilation aborted at tools/mpiexec/mpiexec line 21.
So some module is not found which is needed.
Return to main problem.
193:> make clean
/bin/rm -f *.o
/bin/rm -f driver block
20:> make
/hpca/tmp/us/usu/lam-7.0/LP64_underscore/bin/hcc -DSMALL -q64 -g
-c driver.c
/hpca/tmp/us/usu/lam-7.0/LP64_underscore/bin/hcc -g -o driver
-q64 driver.o -L/hpca/tmp/us/usu/lam-7.0/LP64_underscore/lib
/hpca/tmp/us/usu/lam-7.0/LP64_underscore/bin/hcc -DSMALL -q64 -g
-c block.c
/hpca/tmp/us/usu/lam-7.0/LP64_underscore/bin/hcc -g -o block
-q64 block.o -L/hpca/tmp/us/usu/lam-7.0/LP64_underscore/lib
Target "default" is up to date.
21:> lamboot -v boot_schema
LAM 7.0/MPI 2 C++/ROMIO - Indiana University
n0<180936> ssi:boot:base:linear: booting n0 (hpca2501)
n0<180936> ssi:boot:base:linear: booting n1 (hpca2502)
n0<180936> ssi:boot:base:linear: finished
22:> mpirun -v -c 1 ./driver
250944 ./driver running on n0 (o)
Driver : My rank in MPI_COMM_WORLD is : 0 over 1 proc.s
23> laminfo
LAM/MPI: 7.0
Prefix: /hpca/tmp/us/usu/lam-7.0/LP64_underscore
Architecture: powerpc-ibm-aix5.1.0.0
Configured by: usu
Configured on: Thu Jul 24 09:28:33 GMT 2003
Configure host: hpca2501
C bindings: yes
C++ bindings: yes
Fortran bindings: yes
C profiling: yes
C++ profiling: yes
Fortran profiling: yes
ROMIO support: yes
IMPI support: no
Debug support: no
Purify clean: no
SSI boot: globus (Module v0.5)
SSI boot: rsh (Module v1.0)
SSI coll: lam_basic (Module v7.0)
SSI coll: smp (Module v1.0)
SSI rpi: crtcp (Module v1.0)
SSI rpi: lamd (Module v7.0)
SSI rpi: sysv (Module v7.0)
SSI rpi: tcp (Module v7.0)
SSI rpi: usysv (Module v7.0)
|