Hi,
this is my first post to community and also i am new
to LAM MPI.So sorry for stupid question and also if i
am missing something
our aim is to implement LAM MPI PC cluster on Xeon
processor..Already the code is developed in C.we are
just porting the application on PC clusters and later
MPI library will be used.
4 nodes are there...Linux enterprise edition is
installed.
lam 7.1.1 is installed. sample program are running
fine w/o problems.
job scheduling is being done for some calculations in
C language.
i am starting lamboot -v lamhosts
compiler is mpicc.
i am using execve () command to run my mpirun
command(from within the C code ). one more thing i
have
not used any MPI functions other then MPI_init() and
MPI_Finalize()(do i really need them?).Moreever this
code is like a daemon process it wont exit ever (if no
error is thr). this code script i am starting on all 4
nodes which will do communication with server.
the command is mpirun -np 4 -v /.../mdrun_mpi
-s...output files.
MY project is working fine for 1 node..but as soon as
i increase number of nodes this error message is
coming
-----------------------------------------------------------------------------
One of the processes started by mpirun has exited with
a nonzero exit
code. This typically indicates that the process
finished in error.
If your process did not finish in error, be sure to
include a "return
0" or "exit(0)" in your C code before exiting the
application.
PID 14322 failed on node n1 (192.168.1.5) with exit
status 1.
_________________________________________
laminfo etc are all as mentioned in documentation.
Please help me out ....
Regards
Anuj
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
|