LAM/MPI logo

LAM/MPI General User's Mailing List Archives

  |   Home   |   Download   |   Documentation   |   FAQ   |   all just in this list

From: Brian Wainscott (brian_at_[hidden])
Date: 2006-02-01 20:13:20


We are running lam-6.5.9 and our application is getting this error
message (seems to also happen on lam-7.x):

MPI_Comm_dup: internal MPI error: out of descriptors (rank 0, comm 4087)

>From looking at the source code I can see there is a limit of about 4096
or so communicators. The thing is, we have checked carefully and we
only have about 20 or so communicators at any given time -- they
regularly get created and freed.

So my question is this: is it possible that, even though we call
MPI_COMM_FREE, the communicator is not freed? I suspect an unwaited for
ISEND or IRECV somewhere, that is causing a communicator to be kept
internally after we free it. We are checking on this now, but I wonder
if there is something else that might be going on?

-- 
Brian Wainscott |  "Our Constitution was made only for a moral and
brian_at_[hidden]  |   religious people. It is wholly inadequate to the
                |   government of any other." – John Adams