Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-04 Thread Josh Hursey
etc. Not saying it cannot be done - just indicating that there were reasons why it wasn't initially done other than "we just didn't get around to it". :-) (If I had to guess, I think the user is asking because some other MPI implementations implement this kind of beh

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Ralph Castain
"our implementation prohibits this."  :-) My fault - should have been clearer.   From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Friday, June 02, 2006 9:12 AM To: Open MPI Developers Subject:

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Jeff Squyres (jsquyres)
t; -Original Message- > From: devel-boun...@open-mpi.org > [mailto:devel-boun...@open-mpi.org] On Behalf Of Pak Lui > Sent: Friday, June 02, 2006 11:37 AM > To: r...@lanl.gov; Open MPI Developers > Subject: Re: [OMPI devel] SIGSTOP and SIGCONT on orted > > I agree that

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Jeff Squyres (jsquyres)
riday, June 02, 2006 9:12 AM To: Open MPI Developers Subject: Re: [OMPI devel] SIGSTOP and SIGCONT on orted Jeff Squyres (jsquyres) wrote: Just curious -- what's difficult about this? SIGTSTP and SIGCONT can be caught; is the

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Pak Lui
tting the stop to freeze the application processes. Thanks! -------- *From:* devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] *On Behalf Of *Ralph Castain *Sent:* Thursday, June 01, 2006 10:50 PM *To:

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Ralph Castain
  (If I had to guess, I think the user is asking because some other MPI implementations implement this kind of behavior)   Thanks! From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, June 01, 2006 10:50 PM To:

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-02 Thread Jeff Squyres (jsquyres)
MPI implementations implement this kind of behavior) Thanks! From: devel-boun...@open-mpi.org [mailto:devel-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Thursday, June 01, 2006 10:50 PM To: Open MPI Developers Subject: R

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-01 Thread Ralph Castain
Actually, there were some implementation issues that might prevent this from working and were the reason we didn't implement it right away. We don't actually transmit the SIGTERM - we capture it in mpirun and then propagate our own "die" command to the remote processes and daemons. Fortunately,

Re: [OMPI devel] SIGSTOP and SIGCONT on orted

2006-06-01 Thread Jeff Squyres (jsquyres)
The main reason that it doesn't work is because we didn't do any thing to make it work. :-) Specifically, mpirun is not intercepting SIGSTOP and passing it on to the remote nodes. There is nothing in the design or architecture that would prevent this, but we just don't do it [yet]. > -Orig