Re: Running 3.4 branch ZooKeeper on Linux with iptables

2016-08-31 Thread Guy Laden
. On Wed, Aug 31, 2016 at 9:27 PM, Guy Laden wrote: > I may be misunderstanding something but to the best of my knowledge the > situation is that if you are running ZooKeeper on Linux+Iptables then > > - If you run 3.5 or later then be sure to enable the TCP keepalive flags > >

Re: Running 3.4 branch ZooKeeper on Linux with iptables

2016-08-31 Thread Guy Laden
:22 AM, Patrick Hunt wrote: > I've not seen this but I remember Kishore mentioning they had run with > iptable based testing at some point, Kishore any insight? > > Patrick > > On Thu, Aug 25, 2016 at 8:10 AM, Guy Laden wrote: > > > Is anybody running 3.4 branch

Re: Working around Leader election Listner thread death

2016-08-31 Thread Guy Laden
ance that a bad > server is elected more than once along the process, which induces multiple > leader election rounds. > >>>> > >>>> Perhaps this is overkill, but I was wondering if it makes sense to > filter election traffic to and from bad servers using, for ex

Re: Working around Leader election Listner thread death

2016-08-29 Thread Guy Laden
orum support. Once it loses quorum support, you > have a quorum of healthy servers and they will elect a new, healthy leader. > At the point, you can bounce all your unhealthy servers. > > You may also want to move to a later 3.4 release. > > -Flavio > > > On 24 Aug 2016,

Running 3.4 branch ZooKeeper on Linux with iptables

2016-08-25 Thread Guy Laden
Is anybody running 3.4 branch ZooKeeper on Linux with iptables? We are running 3.4.6 and have run into conntrack silently expiring the leader election connections after they are idle for 5 days. (/proc/sys/net/netfilter/nf_conntrack_tcp_timeout_established) We then see leader election on some mach

Working around Leader election Listner thread death

2016-08-24 Thread Guy Laden
Hi all, It looks like due to a security scan sending "bad" traffic to the leader election port, we have clusters in which the leader election Listener thread is dead (unchecked exception was thrown and thread died - seen in the log). (This seems to be fixed by fixed in https://issues.apache.org/ji

Re: Simulate expired connection for testing

2016-06-23 Thread Guy Laden
Hi Irfan, Does the ZooKeeper cluster that you want to run your clients against enable you to connect the servers via JMX? If yes then you could retrieve the list of clients connected to the server (via JMX) and then invoke terminateConnection or terminateSession on the connection of interest to you

Re: garbage collector choice and tuning

2016-05-16 Thread Guy Laden
it is such an > important issue that I suggest we create a page for it, but feel free to > suggest a different path, I've suggested it because I get this question > outside the list a lot and I've seen a lot of insight in the mail thread. > > -Flavio > > > > On

Re: garbage collector choice and tuning

2016-05-16 Thread Guy Laden
Hi Flavio, Not sure if that was addressed to me? I noticed the GC section in *https://cwiki.apache.org/confluence/display/ZOOKEEPER/Troubleshooting * (I assume this is more recent than the pages at wiki.apache.org) Perhaps addin

Re: garbage collector choice and tuning

2016-05-15 Thread Guy Laden
Maugli, Thanks so much for taking the time to write at length, and for the great pointers. Thanks all for sharing.

garbage collector choice and tuning

2016-05-13 Thread Guy Laden
Hi, We are considering CMS vs G1 for ZooKeeper running under Oracle JDK8. The expected heap size is 4-6GB. How workload-specific is this choice in your opinion and it what ways? E.g. if many short sessions prefer G1, etc... Has anybody had experience they're willing to share regarding this? We'd al

dynamic enabling debug log messages

2015-05-26 Thread Guy Laden
Is there a way to support dynamic enabling of DEBUG/TRACE level methods for a running cluster? I ask in particular for 3.4.6. I see this is supported in log4j 2 (via JMX) however 3.4.6 comes with log4j 1.2. Has anybody run 3.4.6 with log4j 2? Thanks, Guy

Zookeeper on VM's in public cloud

2015-04-29 Thread Guy Laden
Hi All, I wanted to get some feedback about running ZooKeeper on VM's within public clouds. If you have experience with this could you share please? What issues have you run into? Were you able to overcome the issues and how? At the end of the day, were you able to get this to work reliably? So

Re: Recommended JDK

2015-03-31 Thread Guy Laden
tml < http://www.oracle.com/technetwork/java/javase/eol-135779.html> -Flavio > On 30 Mar 2015, at 13:57, Guy Laden wrote: > > Jon, Ibrahim, > Thanks for the feedback. > > Do people have any production experience with ZK clients running on > different JDK / hardware archi

RE: Recommended JDK

2015-03-30 Thread Guy Laden
Jon, Ibrahim, Thanks for the feedback. Do people have any production experience with ZK clients running on different JDK / hardware architecture (32/64bit) from the ZK server? I looked into the regression tests some: The list of branches/platforms tested seems to be here: https://builds.apache.o

Recommended JDK

2015-03-26 Thread Guy Laden
Hi All, Are there recommended JDK / JDK-versions for running 3.4.6 in production? Alternatively, what are the regression tests run against? What do people have good experience with? Are there any that have proven problematic? Thanks, Guy