Re: Zookeeper leader election takes a long time.

2016-10-14 Thread Ryan Zhang
From that log, it looks like Node 3 and 2 formed a quorum (Node 1 can’t talk to Node2 for some reason). Even if Node 3 didn’t hear from Node 2 after 12:46, it is still a special case as we had a kinda network partition for about 10 mins. I wonder if you can repro this in an environment that No

Re: Zookeeper leader election takes a long time.

2016-10-14 Thread Anand Parthasarathy
Hi Michael and Ryan, Thanks for looking into the logs. Actually, in this case, node 2 was brought down ungracefully and was down permanently. In this customer deployment, our product runs as a docker container in a bare metal environment. It is possible that the docker proxy that provides externa

Re: outstandingChanges queue grows without bound

2016-10-14 Thread Mike Solomon
Thanks for the comments - I'll incorporate them in a future fix. There is actually a flaw in this code as it's currently implemented - it does not match the original behavior and I need to think more carefully. Arshad, I think ZOOKEEPER-2570 is a somewhat different issue. The root cause in both c