This is a known issue - I'll test to see if it has been fixed for the upcoming 1.3.4. We know the problem does not exist in our devel trunk, but I don't know if the fix propagated to the 1.3 branch.

On Aug 26, 2009, at 3:40 PM, Tim Miller wrote:

Hello Everyone,

I have a problem that I can't seem to figure out from searching the mailing list archive. I have a code that repeatedly spawns (via MPI_COMM_SPAWN) a group of 8 processes and then waits for them to finish. The problem is that OpenMPI (I've tried 1.3.1 and 1.3.3) opens a pipe each time MPI_COMM_SPAWN is called, but this pipe never gets closed (even if I call MPI_COMM_FREE on both the parent and child). Therefore, eventually my master process hits the limit of open file descriptors and cannot spawn any more. I cannot figure out why this is happening since I am only spawning 8 jobs at a time.

I should note that I don't exchange any data over the child intercommunicators -- they exist solely so I can call MPI_BARRIER on them (the spawned processes are programed to call MPI_BARRIER directly before they call MPI_FINALIZE, thus indicating that they are done doing their work).

I'd appreciate any suggestions as to what I might be doing wrong with this that is causing OpenMPI to hold the pipes open.

Thanks,
Tim M.
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to