Steve Loughran created HADOOP-9086:
--------------------------------------
Summary: Enforce process singleton rules through an exclusive
write lock on a file, not a pid file +kill -0,
Key: HADOOP-9086
URL: https://issues.apache.org/jira/browse/HADOOP-9086
Project: Hadoop Common
Issue Type: Improvement
Components: util
Affects Versions: 1.1.1, 2.0.3-alpha
Environment: Unix/Linux.
Reporter: Steve Loughran
the {{hadoop-daemon.sh}} script (and other liveness monitors) probe the
existence of a daemon service by a {{kill -0}} of a process id picked up from a
pid file.
This is flawed
# pid file locations may change with installations.
# Linux and Unix recycle pids, leading to false positives -the scripts think
the process is running, when another process is.
# doesn't work on windows.
Having the processes acquire an exclusive write-lock on a known file would
delegate lock management and implicitly liveness to the OS itself. when the
process dies, the lock is relased (on Unixes)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira