I see what you mean by alerts on live instances. In fact there is an "onLiveInstanceChange" under GenericHelixController ( http://helix.incubator.apache.org/apidocs/reference/org/apache/helix/controller/GenericHelixController.html )
The question is can i register for an alert to myself? If the agent that is being alerted is the one that loses connection to zk, does the alert trigger? More importantly, it seems that setting an alert for onLiveInstanceChange happens when the zookeeper session expires(in which case master controller already remaps), and not immediately when a zk connection falters (but ephemeral node on LIVEINSTANCES is still there). I was hoping to get an alert not when the ephemeral node expires but immediately right when a zk connection falters. Thank you Lance On Tue, Jul 23, 2013 at 6:00 PM, Shi Lu <[email protected]> wrote: > Hi Lance: > > The helix controller exposes jmx beans that reflects the number of > liveInstances under the jmx domain ClusterStatus:cluster=<clusterName>, in > which it will report > number of down instances, disabled instancesand disabled partitions. You > can set alerts on those jmx beans. > > > > > On Tue, Jul 23, 2013 at 2:32 PM, Lance Co Ting Keh <[email protected]> wrote: > >> Hi guys, >> >> I was trying to look for how I can most cleanly get alerted when a helix >> participant temporary and permanently loses its session with Zookeeper. >> What is the best way to do this? >> >> >> Sincerely, >> Lance >> > >
