Sims, James S. Dr. wrote:
> Thanks Mac. I think this helps. I am running the 64 bit version,
> but here is a detailed comparison of what works and what doesn't.
> If I do a qsub -I -l nodes=1:ppn=2
> lamboot
> mpirun -np 2 MPI_li_64
> in the torque/pbs environment, the code dies with
> PID 10261 failed on node n0 (10.2.1.54) due to signal 11.
>
> If on the other hand, I don't use torque but run the same
> example,
> mpirun -np 2 MPI_li_64, the job runs. So I think it is
> something about the PBS environment that is causing the
> problem.
You would normally set your PATH and LD_LIBRARY_PATH in your PBS script,
so that you get the one you need. Lately, I've got in the situation
where each phase of my PBS job requires a different MPI, so it seems
normal to wipe and set a new path for each mpirun.
|