If you're using Chukwa 0.4.0, I recommend the following patches relating to fault tolerance: We've been running these for almost a year now.
https://issues.apache.org/jira/browse/CHUKWA-533 https://issues.apache.org/jira/browse/CHUKWA-534 On Thu, Jun 16, 2011 at 2:40 AM, Felix.徐 <[email protected]> wrote: > Hi,all > chukwa collector often shut down automatically when error occurs,e.g > hdfs safe mode on, does anybody know how to deal with this situation? > something like watch dog, how to use it? Thanks very much.
