Re: [OMPI devel] memory_malloc_hooks.c and dlclose()

2006-06-01 Thread Neil Ludban
> Date: Tue, 30 May 2006 18:34:41 -0400 (EDT) > From: "Brian W. Barrett" > Subject: Re: [OMPI devel] memory_malloc_hooks.c and dlclose() > To: Open MPI Developers > Message-ID: > Content-Type: text/plain; charset="us-ascii" > > On Mon, 22 May 2006, Neil Ludban wrote: > > > I'm getting a core d

[OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-01 Thread Pak Lui
Hi, I have a question on signals. Normally when I do a SIGTERM (control-C) on mpirun, the signal seems to get handled in a way that it broadcasts to the orted and processes on the execution hosts. However, when I send a SIGSTOP to mpirun, mpirun seems to have stopped, but the processes of the

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-01 Thread Jeff Squyres (jsquyres)
The main reason that it doesn't work is because we didn't do any thing to make it work. :-) Specifically, mpirun is not intercepting SIGSTOP and passing it on to the remote nodes. There is nothing in the design or architecture that would prevent this, but we just don't do it [yet]. > -Orig

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-01 Thread Ralph Castain
Actually, there were some implementation issues that might prevent this from working and were the reason we didn't implement it right away. We don't actually transmit the SIGTERM - we capture it in mpirun and then propagate our own "die" command to the remote processes and daemons. Fortunately,