Re: ProcessReaper: single thread reaper

roger riggs Fri, 11 Apr 2014 09:51:44 -0700

Hi Peter,

We do know the PIDs of the processes that we care about but are unwilling
to pay the cost of waiting for them individually.

For the escapees, Process could resort to an individual thread invokingwaitpid(n).


Thanks, Roger


On 4/11/2014 10:52 AM, Peter Levart wrote:

On 04/09/2014 07:02 PM, Martin Buchholz wrote:
On Tue, Apr 8, 2014 at 11:08 PM, Peter Levart <peter.lev...@gmail.com<mailto:peter.lev...@gmail.com>> wrote:
    Hi Martin,

    As you might have seen in my later reply to Roger, there's still
    hope on that front: setpgid() + wait(-pgid, ...) might be the
    answer. I'm exploring in that direction. Shells are doing it, so
    why can't JDK?

    It's a little trickier for Process API, since I imagine that
    shells form a group of processes from a pipeline which is known
    in-advance while Process API will have to add processes to the
    live group dynamically. So some races will have to be resolved,
    but I think it's doable.
This is a clever idea, and it's arguably better to designsubprocesses so they live in separate process groups (emacs doesthat), but:Every time you create a process group, you change the effect of auser signal like Ctrl-C, since it's sent to only one group.Maybe propagate signals to the subprocess group? It's starting toget complicated...
Hi Martin,
Yes, shells send Ctrl-C (SIGINT) and other signals initiated byterminal to a (foreground) process group. A process group is formedfrom a pipeline of interconnected processes. Each pipeline isconsidered to be a separate "job", hence shells call this feature"job-control". Child processes by default inherit process group fromit's parent, so children born with Process API (and their children)inherit the process group from the JVM process. Considering theintentions of shell job-controll, is propagatingSIGTERM/SIGINT/SIGTSTP/SIGCONT signals to children spawned by ProcessAPI desirable? If so, then yes, handling those signals in JVM andpropagating them to current process group that contains all childrenspawned by Process API and their descendants would have to beperformed by JVM. That problem would certainly have to be addressed.But let's first see what I found out about sigaction(SIGCHLD, ...),setpgid(pid, pgid), waitpid(-pgid, ...), etc...
waitpid(-pgid, ...) alone seems to not be enough for our task. Mainlybecause a process can re-assign it's group and join some other group.I don't know if this is a situation that occurs in real world, butimagine if we have one live child process in a process group pgid1 andno unwaited exited children. If we issue:
    waitpid(-pgid1, &status, 0);
Then this call blocks, because at the time it was given, there were >0child processes in the pgid1 group and none of them has exited yet.Now if this one child process changes it's process group with:
    setpgid(0, pgid2);
Then the waitpid call in the parent does not return (maybe this is abug in Linux?) although there are no more live child processes in thepgid1 group any more. Even when this child exits, the call to waitpiddoes not return, since this child is not in the group we are waitingfor when it exits. If all our children "escape" the group in such way,the tread doing waiting will never unblock. To solve this, we canemploy signal handlers. In a signal handler for SIGCHLD signal we caninvoke:
    waitpid(-pgid1, &status, WNOHANG); // non-blocking call
...in loop until it either returns (0) which means that there're nomore unwaited exited children in the group at the momen or (-1) witherrno == ECHILD, which means that there're no more children in thequeried group any more - the group does not exist any more. Sincesignal handler is invoked whith SIGCHLD being masked and there is onebit of pending signal state in the kernel, no child exit can be"skipped" this way. Unless the child "escapes" by changing it's group.I don't know of a plausible reason for a program to change it'sprocess group. If a program executing as JVM child wants to become abackground daemon it usually behaves as follows:
- fork()s a grand-child and then exit()s (so we get notified viasignal and waitpid(-pgid, ...) successfully for it's exitstatus)- the grand-child then changes it's session and group (becomes sessionand group leader), closes file descriptors, etc. The responsibilityfor waiting on the grand-child daemon is transferred to the initprocess (pid=1) since the grand-child becomes an orphan (has no parent).
Ignoring this still unsolved problem of possible ill-behaved childprogram that changes it's process group, I started constructing aproof-of-concept prototype. What I will do in the prototype is startthrowing IllegalStateException from the methods of the Process APIthat pertain to such children. I think this is reasonable.
Stay tuned,

Peter

Re: ProcessReaper: single thread reaper

Reply via email to