We are running lam-6.5.9 and our application is getting this error
message (seems to also happen on lam-7.x):
MPI_Comm_dup: internal MPI error: out of descriptors (rank 0, comm 4087)
>From looking at the source code I can see there is a limit of about 4096
or so communicators. The thing is, we have checked carefully and we
only have about 20 or so communicators at any given time -- they
regularly get created and freed.
So my question is this: is it possible that, even though we call
MPI_COMM_FREE, the communicator is not freed? I suspect an unwaited for
ISEND or IRECV somewhere, that is causing a communicator to be kept
internally after we free it. We are checking on this now, but I wonder
if there is something else that might be going on?
--
Brian Wainscott | "Our Constitution was made only for a moral and
brian_at_[hidden] | religious people. It is wholly inadequate to the
| government of any other." John Adams
|