Hello,
I need help figuring out the problem I have with LAM 7.1.1. We just got a 16
node dual-G5 cluster with mellanox infiniband (build_id
thca-darwin-1.05-build-001).
Even when I try to run examples/cpi program I get the following error:
xserve100:~/lampi/bin eu$ mpirun C -ssi rpi ib /nfs/home/eu/cpi
VIP(1): ERROR: VIPKL_create_mr failed !
-----------------------------------------------------------------------------
It seems registration of memory for Infiniband operation failed. We
can not register any more memory. Aborting ..
-----------------------------------------------------------------------------
rank 24 REGISTER FAILED!
Process 5 of 32 on xserve103
Thanks,
Eugene
------------------------------------------------------------------------------------
xserve100:~/lampi/bin eu$ ./laminfo
LAM/MPI: 7.1.1
Prefix: /nfs/home/eu/lampi
Architecture: powerpc-apple-darwin7.3.1
Configured by: eu
Configured on: Wed Nov 24 20:07:50 PST 2004
Configure host: xserve100
Memory manager: darwin7malloc
C bindings: yes
C++ bindings: yes
Fortran bindings: no
C compiler: gcc
C++ compiler: g++
Fortran compiler: false
Fortran symbols: none
C profiling: yes
C++ profiling: yes
Fortran profiling: no
C++ exceptions: no
Thread support: yes
ROMIO support: yes
IMPI support: no
Debug support: no
Purify clean: no
SSI boot: globus (API v1.1, Module v0.6)
SSI boot: rsh (API v1.1, Module v1.1)
SSI boot: slurm (API v1.1, Module v1.0)
SSI coll: lam_basic (API v1.1, Module v7.1)
SSI coll: shmem (API v1.1, Module v1.0)
SSI coll: smp (API v1.1, Module v1.2)
SSI rpi: crtcp (API v1.1, Module v1.1)
SSI rpi: ib (API v1.1, Module v1.0)
SSI rpi: lamd (API v1.0, Module v7.1)
SSI rpi: sysv (API v1.0, Module v7.1)
SSI rpi: tcp (API v1.0, Module v7.1)
SSI rpi: usysv (API v1.0, Module v7.1)
SSI cr: self (API v1.0, Module v1.0)
|