LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Brian Barrett (brbarret_at_[hidden])
Date: 2008-07-05 13:20:31


On Jul 5, 2008, at 11:03 AM, Endee wrote:

> Dear respected members,
> Hi,
> I am newb to LAM and mpi and am facing following problem. I request
> you to guide me.
> I have been using LAM without any trouble for my mpi code until
> recently. My script uses PBS and is as follows:
> #!/bin/sh
> #PBS -l nodes=2:ppn=2
> #PBS -l walltime=100:00:00
> lamboot -v
> cd /data/rundir
> mpirun -ssi rpi sysv -np 4 program_exe <input > output
>
> Without actually changing anything, now I am getting a message with
> this script :
>
> n0<17669> ssi:boot:base:linear_windowed: booting n0 (node18)
> n0<17669> ssi:boot:base:linear_windowed: booting n1 (node17)
> -----------------------------------------------------------------------------
> The selected RPI failed to initialize during MPI_INIT. This is a
> fatal error; I must abort.
> This occurred on host node18 (n0).
> The PID of failed process was 17675 (MPI_COMM_WORLD rank: 0)
> -----------------------------------------------------------------------------

It seems like you covered most of the initial issues. If you add the
argument

   -ssi rpi_verbose 1

to the mpirun command line, you'll get a bunch of debugging
information from the rpi while it initializes. This might help lead
us to the issue.

Brian

-- 
   Brian Barrett
   LAM/MPI Developer
   Make today a LAM/MPI day!