We have an application which uses lam-mpi.
Say our application has one parent executable. And 2 child executables.
Parent executable will pass the information to child, and get back the
result to parent through mpi world.
We have one machine on which 1 parent, 32 child processes are running. In
this case, there is no issue for long time (up to 8hrs.)
As we brought high configuration machines(2 nos), we changed the above
configuration little bit. Parent process will be running on machine 1, and
80 child processes will be launched on machine 1, and 80 child processes
will be launched on machine2.(Earlier both parent, child are running on
single machine).
With the later scenario (160 child processes) our application is getting
crashed after 1hr. (earlier there is no issue up to 8hrs)
Why this is happening?
Our latest server configuration is:
H/w(in each machine)
16GB RAM., 1000GB HardDisk,
Intel 64-bit processors (16 processors totally)
O/s : Redhat Linux Enterprise Server 5.
Oracle 10gRelease2
Lam: lam-7.0.6
Kindly reply soon.
Thanks& Regards
SrkRaju
______________________________________________________________________________
>
> DISCLAIMER
>
> The information contained in this e-mail message and/or attachments to it
> may
> contain confidential or privileged information. If you are not the
> intended
> recipient, any dissemination, use, review, distribution, printing or
> copying
> of the information contained in this e-mail message and/or attachments to
> it
> are strictly prohibited. If you have received this communication in error,
> please notify us by reply e-mail or directly to netsupport_at_[hidden] or
> telephone and immediately and permanently delete the message and any
> attachments. Thank you.
>
>
> ______________________________________________________________________________
>
> This email has been scrubbed for your protection by SecureMX.
> For more information visit http://securemx.in
> ______________________________________________________________________________
>
>
______________________________________________________________________________
DISCLAIMER
The information contained in this e-mail message and/or attachments to it may
contain confidential or privileged information. If you are not the intended
recipient, any dissemination, use, review, distribution, printing or copying
of the information contained in this e-mail message and/or attachments to it
are strictly prohibited. If you have received this communication in error,
please notify us by reply e-mail or directly to netsupport_at_[hidden] or
telephone and immediately and permanently delete the message and any
attachments. Thank you.
______________________________________________________________________________
This email has been scrubbed for your protection by SecureMX.
For more information visit http://securemx.in
______________________________________________________________________________
|