Hi everyone,
I'm having some trouble running my program.
So far I've been testing my program running lamboot without a boot
schema file so that it runs on one node.
Today I've come to test the same program except I now supply a boot
schema with two machines listed. I assumed this wouldn't cause a problem.
However when I recompile my code and start 'lamboot mynodes' my program
gives the following error upon execution...
apps/testcode2: error while loading shared libraries: libstdc++.so.3:
cannot open shared object file: No such file or directory
-----------------------------------------------------------------------------
It seems that [at least] one of the processes that was started with
mpirun did not invoke MPI_INIT before quitting (it is possible that
more than one process did not invoke MPI_INIT -- mpirun was only
notified of the first one, which was on node n0).
mpirun can *only* be used with MPI programs (i.e., programs that
invoke MPI_INIT and MPI_FINALIZE). You can use the "lamexec" program
to run non-MPI programs over the lambooted nodes.
-----------------------------------------------------------------------------
This is strange as I've managed to compile and install the lam 7.0 code
and this requires libstdc++.so.3.
Anyhow, ls -al /usr/local/lib/libstdc++.so.3 gives...
lrwxrwxrwx 1 root root 18 Mar 5 17:31
/usr/local/lib/libstdc++.so.3 -> libstdc++.so.3.0.2
So I checked echo $LD_LIBRARY_PATH and this gives me...
/usr/local/lib:/usr/local/RTI-1.3NGv6/Linux-rh7.2-i386-gcc-3.0.2-opt-mt/lib:
Anyone know why it can't find the library??
And why does it only complain when I try to use a remote machine?
ps. the /usr/local directory of the root node is mounted on the other
machine so they both see the same /usr/local/lib.
The same applies to /home and hence $LD_LIBRARY_PATH is the same on both
machines.
I apologise if I'm missing something obvious, I just can't understand
what the problem is? :|
Thanks
--
Mike
|