Re: Please help! ZooKeeper 3.7.2 fsync-ing latency issue

2024-04-18 Thread Patrick Hunt
On Thu, Apr 18, 2024 at 9:15 AM Patrick Hunt wrote: > My experience with slow fsyncs is that it's almost always due to > contention for disk IO. I see that you tuned the snap* sizes down, which is > reasonable. You might check what ZK activity is happening during this > period?

Re: Please help! ZooKeeper 3.7.2 fsync-ing latency issue

2024-04-18 Thread Patrick Hunt
My experience with slow fsyncs is that it's almost always due to contention for disk IO. I see that you tuned the snap* sizes down, which is reasonable. You might check what ZK activity is happening during this period? Perhaps some client is hammering the cluster, have you ruled that out? I

Re: Convert Standalone zookeeper to A 3 node Quorum

2023-06-19 Thread Patrick Hunt
file as a part of initial base ensemble config as by default this > parameter value is true? So that Making 2 zk's run in replicated mode > from starting (starting one at a time) > > Regards, > Gaurav > > > On Mon, 19 Jun, 2023, 22:03 Patrick Hunt, wrote: > > > On Mo

Re: Convert Standalone zookeeper to A 3 node Quorum

2023-06-19 Thread Patrick Hunt
ar ensemble config, not a standalone. Patrick > Regards, > Gaurav > > On Mon, 19 Jun, 2023, 21:06 Patrick Hunt, wrote: > > > Two ways to do it come to mind, which I've used in the past: > > > > 1) most straightforward is to "clone" the repos for the t

Re: Convert Standalone zookeeper to A 3 node Quorum

2023-06-19 Thread Patrick Hunt
Two ways to do it come to mind, which I've used in the past: 1) most straightforward is to "clone" the repos for the two new members of the ensemble in their respective configs/datadirs. Just make sure to update the configs appropriately. Including the "myid" for each server. Then restart the

Re: Impact of Log4J security vulnerability CVE-2021-44228 on zookeeper

2021-12-18 Thread Patrick Hunt
tions are still > taking action to attempt to mitigate existing ZK installations regardless. > > Has anyone made an attempt to see how much effort would be involved in the > upgrade? Would you all be open to a pull request? > > Thanks for all of your hard work on ZK! > > ~Brent

Re: Impact of Log4J security vulnerability CVE-2021-44228 on zookeeper

2021-12-13 Thread Patrick Hunt
This issue is being tracked on ZOOKEEPER-4423. ZK 3.4 does not use log4j 2.x - all versions of zk currently use log4j 1.x. Regards, Patrick On Mon, Dec 13, 2021 at 4:02 AM Prasanna kumar < prasannakumarram...@gmail.com> wrote: > Could anyone confirm the same on 3.4 versions? > > On Sun, Dec

Re: write performance issue in 3.6.2

2021-02-21 Thread Patrick Hunt
On Sun, Feb 21, 2021 at 3:28 PM Li Wang wrote: > Hi Enrico, Sushant, > > I re-run the perf test with the data consistency check feature disabled > (i.e. -Dzookeeper.digest.enabled=false), the write performance issue of 3.6 > is still there. > > With everything exactly the same, the throughput of

Re: ZooKeeper Operator

2021-01-18 Thread Patrick Hunt
wrote: > It sounds like a good idea to document it and add relevant pointers, Pat. > > -Flavio > > > On 18 Jan 2021, at 19:00, Patrick Hunt wrote: > > > > FYI: The awesome operator list has a few including Pravega: > > https://github.com/operator-framework

Re: ZooKeeper Operator

2021-01-18 Thread Patrick Hunt
FYI: The awesome operator list has a few including Pravega: https://github.com/operator-framework/awesome-operators I've seen a few more while investigating kubebuilder, operator-sdk (rh) and the like: https://github.com/Ghostbaby/zookeeper-operator Perhaps the first thing we might consider is

Re: Correct way to check if zookeeper is running

2020-10-29 Thread Patrick Hunt
I believe it depends which version of netcat you're using, see the following for background: https://issues.apache.org/jira/browse/ZOOKEEPER-737 Regards, Patrick On Thu, Oct 29, 2020 at 6:18 AM Debraj Manna wrote: > Anyone any thoughts on this one? Same has been asked in stackoverflow also >

Re: Zookeeper access logs?

2020-06-23 Thread Patrick Hunt
There is audit capability in newer versions: https://zookeeper.apache.org/doc/r3.6.1/zookeeperAuditLogs.html Earlier versions include support for dumping the transaction log (which goes back a long way):

Re: zookeeper protocol requirements

2020-01-03 Thread Patrick Hunt
fwiw this one comes to mind: http://www.camilletalk.com/whilefalse/2012/12/building-global-highly-available.html Patrick On Fri, Jan 3, 2020 at 7:18 AM MATHIEU Christophe < christophe-f.math...@thalesgroup.com> wrote: > Hi Zookeeper users, > > I need to interconnect some servers of a zookeeper

ZK makes apache 2019 "top 5" projects

2019-12-11 Thread Patrick Hunt
This is really awesome, check it out: https://twitter.com/phunt/status/1204966326118141952 Kudos ZooKeeper community on all the hard work and efforts! Patrick

Re: Does ZK 3.4.14 support Netty 4.1.42.Final?

2019-11-25 Thread Patrick Hunt
on https://issues.apache.org/jira/browse/ZOOKEEPER-3568? > > We are currently running on 3.4.9 server and 3.4.6 client. If moving to > 3.5.6, should we upgrade the server or client first? > > Thanks, > Daniel > > -----Original Message- > From: Patrick Hunt >

Re: Does ZK 3.4.14 support Netty 4.1.42.Final?

2019-11-25 Thread Patrick Hunt
This was discussed relatively recently: https://lists.apache.org/thread.html/680038b345da49a3d5cb452de5d54d62f14d1df0747690980c218c1a@%3Cdev.zookeeper.apache.org%3E Gist is that while the identified issue didn't affect us directly folks should move to 3.5 (or don't use netty in 3.4) given 3.4 is

Re: Any interest in a gRPC version of ZooKeeper

2019-11-18 Thread Patrick Hunt
There are quite a few benefits to using grpc imo. It's come up a few times where I've been part of the discussion - ala we make it b/w compat it would be a good move imo. Then the question becomes what else do we fix at the same time? e.g. make version fields 64 bit rather than 32? etc... there

Re: String inconsistency issue when running ZK with OpenJDK 10 on SKL machines

2019-10-28 Thread Patrick Hunt
On Mon, Oct 28, 2019 at 12:06 AM Enrico Olivelli wrote: > Fangmin, > > Il lun 28 ott 2019, 02:23 Fangmin Lv ha scritto: > > > Hey everyone, > > > > (Forgot to add subject in the previous email, resent with clear subject.) > > > > I'd like to share some weird inconsistency bugs we saw recently

Re: Removing Netty support from branch-3.4

2019-10-07 Thread Patrick Hunt
On Fri, Oct 4, 2019 at 9:14 AM Enrico Olivelli wrote: > The release branch 3.4 is frozen and we should cut new releases only for > important security reasons or other important issues for users that cannot > upgrade to 3.5. > > Given that 3.5 is now the suggested version and the upgrade path is

Re: How to scale ZooKeeper to support 10K concurrent connections?

2019-09-27 Thread Patrick Hunt
Whether or not you can use local sessions is a critical aspect: https://issues.apache.org/jira/browse/ZOOKEEPER-1147 Patrick On Fri, Sep 27, 2019 at 2:27 PM Michael Han wrote: > >> can launch tens of thousands of calls > > Is it possible for you to quantify this in a form of (read and write) >

Re: An Apache Zookeeper Security Vulnerability

2019-08-09 Thread Patrick Hunt
On Fri, Aug 9, 2019 at 9:34 AM Enrico Olivelli wrote: > Those points do not seem a security issue > > Agree. First off the data is not sensitive. Also it's debug level and logged on the server. See https://issues.apache.org/jira/browse/ZOOKEEPER-3488 - similar situation although in this case

Re: Issues with using ZooKeeper 3.5.5 together with Solr 8.2.0

2019-08-05 Thread Patrick Hunt
on? Also agree that folks should move off 4lw to the new (better) options, esp as we plan to deprecate 4lw at some point. Patrick On Sun, Aug 4, 2019 at 12:15 PM Enrico Olivelli wrote: > Il sab 3 ago 2019, 21:41 Shawn Heisey ha scritto: > > > On 8/2/2019 10:33 AM, Patrick H

Re: Ephemeral nodes not removed

2019-08-02 Thread Patrick Hunt
The jira you ref'd is the only one that comes to mind. In terms of troubleshooting - try connecting a client to each of the servers in tern and see if it's a situation where they have a different view of the world wrt those znodes. You might also have the client create separate znodes on each

Re: Ephemeral znodes not getting removed

2019-08-02 Thread Patrick Hunt
The jira you ref'd is the only one that comes to mind. In terms of troubleshooting - try connecting a client to each of the servers in tern and see if it's a situation where they have a different view of the world wrt those znodes. You might also have the client create separate znodes on each

Re: Issues with using ZooKeeper 3.5.5 together with Solr 8.2.0

2019-08-02 Thread Patrick Hunt
On Fri, Aug 2, 2019 at 7:12 AM Jan Høydahl wrote: > So it turned out that Solr failed parsing the 'conf' response from > Zookeeper since there was one line that did not conform to the key=value > syntax: > > ... > quorumPort=2888 > peerType=0 > membership: >

Re: Zookeeper latency calculation

2019-07-21 Thread Patrick Hunt
High max latency is typically due to gc, slow disk, or issues with vm oversubscription. Turn on gc logging and verify. I've used strace in the past to troubleshoot disk issues (fsync - although now a days it's in the log). Patrick On Thu, Jul 18, 2019 at 8:40 AM rammohan ganapavarapu <

Re: ACL implimentation

2019-06-12 Thread Patrick Hunt
ont have to go through > that pain if i want to enable ACL ? > > Ram > > On Mon, Jun 10, 2019 at 2:00 PM Andor Molnár wrote: > > > Agreed. I had to dig a bunch of Hortonworks / Stackoverflow docs to > > learn how ACLs work. > > > > > > Andor > >

Re: ACL implimentation

2019-06-09 Thread Patrick Hunt
I had to deal with some ACL issues myself recently and noticed the lack of docs we have, both generally and best practices. I spent a bunch of time when testing the recent ACL changes from Andor just re-learning the shell commands and config necessary to exercise the patches. This would be a great

Re: majority of non-failing machines VS quorum

2019-05-29 Thread Patrick Hunt
2n+1 = ensemble size required to survive n failed zkservers (servers not in the quorum) iow: 3 nodes means 1 zkserver can fail and the service is still up. 5 and you can survive 2 failures. Patrick On Wed, May 29, 2019 at 4:43 PM Joel Mestres wrote: > Hello which is the relation between the

Re: Why does ZooKeeper follower shutdown itself when it can not read from leader

2019-05-22 Thread Patrick Hunt
That was/is the original intent. ZK was built to "fail fast" when it didn't know how to handle a particular case, or that case might be error prone to handle. The expectation is that the parent will restart the ZK server process when it fails. Patrick On Wed, May 22, 2019 at 6:27 PM Qian Zhang

Re: Deprecated CSVInputArchive and XMLInputArchive

2019-05-11 Thread Patrick Hunt
Seems reasonable to drop them unless someone speaks up. Patrick On Thu, May 9, 2019 at 10:11 AM Andor Molnar wrote: > Hi Zili, > > I'm surely not the best person to talk about ZooKeeper history, but as far > as I know these 2 input archives are not actively maintained and I've never > seen

Re: Zookeepr quorum and electing a leader.

2019-04-27 Thread Patrick Hunt
I've never seen this. Are there jira/logs/... that you can point to that would help identify the version and potentially reproduce? Patrick On Mon, Apr 22, 2019 at 10:57 AM Susheel Kumar wrote: > Something similar we have seen during network/VM glitches and making Solr > nodes go down/in a

Re: Limit on zookeeper ephemeral nodes

2019-04-17 Thread Patrick Hunt
On Mon, Apr 8, 2019 at 4:45 PM alwin james wrote: > Hi Team, > > I would like to understand, as part of maintaining a healthy zookeeper > cluster, does the number of ephemeral nodes important? If so, is there any > soft/hard limit on the number? > No limits outside the heap. > Or is it more

Re: openjdk 11 compatibility

2019-04-09 Thread Patrick Hunt
We've been verifying 3.4 against jdk11 - openjdk11 is not currently an option available in the apache jenkins instance. However it should be similar. https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper_branch34_java11/ but you'll need to go to 3.4.14 according to this

Re: Unexpected delay between pings sent from the client to server

2019-04-07 Thread Patrick Hunt
Try turning on verbose GC logging and look if it's GC. In my experience that's often the cause. "client application suddenly loses its Zookeeper session" - might be client GC, or it could be server GC. See if either of those shed light. Patrick On Sat, Apr 6, 2019 at 1:10 PM Norbert Kalmar

Apache ZooKeeper meetup in Palo Alto March 14th

2019-03-01 Thread Patrick Hunt
Hi folks. Andor is visiting the USA and he asked if I could facilitate a meetup the night of March 14th - 530-8pm. This would be in Palo Alto at Cloudera's HQ - 395 Page Mill Rd, Palo Alto. I will reserve one of our meeting rooms and order pizza and beer. I plan to keep it low key, a bit of an

Re: Update from 3.4.9 to 3.4.13

2018-12-14 Thread Patrick Hunt
Review the release notes for each release, they capture the changes. https://zookeeper.apache.org/releases.html Given you are upgrading across fix versions we try to only apply fixes (no features) however sometimes we end up breaking the rules - e.g. if a security issue comes up. Regards,

Re: [Help Wanted] Will zookeeper merge change events?

2018-10-24 Thread Patrick Hunt
contention on the server, for example slow fsyncs due to disk IO contention can be a real killer for latency. (that one is easy to check for in the server logs). Patrick On Wed, Oct 24, 2018 at 7:40 AM Patrick Hunt wrote: > Nothing says that you have to handle client side notifications one-by-on

Re: [Help Wanted] Will zookeeper merge change events?

2018-10-24 Thread Patrick Hunt
Nothing says that you have to handle client side notifications one-by-one, blocking the ZK client notification thread. You can do the aggregation yourself if you like, and this can be done very quickly. You can have separate threads to process the results (separate from the ZK client notification

Re: Healthcheck

2018-09-21 Thread Patrick Hunt
I created this a few years ago, afaik it still works: https://github.com/phunt/zk-smoketest Regards, Patrick On Wed, Sep 19, 2018 at 2:05 AM Jens Rantil wrote: > Hello, > > We are in need for a shell command that we can execute on a specific node > to make sure that the node has come up and

Re: Zookeeper3.4.12 c client compilation issues on HPUX environment

2018-09-12 Thread Patrick Hunt
The libtool package typically (e.g. centos and most linux distros based on similar) has a macro directory under /usr/share/aclocal. Check if macros came with your libtool package. If not you'll need to hardcode use of libtool rather than relying on autotools finding/configuring it for you. It the

Re: Java 11 OpenJDK/Oracle Java Release Cadence Questions

2018-09-11 Thread Patrick Hunt
Hi Jeremiah. It's failing consistently on Jenkins, unlikely to officially support until someone addresses those: https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk-java11/ We've been testing with openjdk for quite some time, those are supported. The docs are ambiguous in that

Re: Verify zookeeper cluster consistency

2018-08-01 Thread Patrick Hunt
iirc Ben (or was it someone else from Facebook) mentioned at a meetup about doing some work in this area. I believe they were hashing the tree but I may be mis-remembering Perhaps one of those folks can shed light. Patrick On Fri, Jul 27, 2018 at 2:04 AM Andor Molnar wrote: > Hi Klearchos,

Re: Zookeeper memory usage

2018-07-25 Thread Patrick Hunt
JVM memory usage is limited by your java heap configuration. GC will kick in to keep it within the bound but the JVM will try to use whatever you configure as the max. Patrick On Tue, Jul 24, 2018 at 4:27 AM Alex Dzyoba wrote: > I forgot to mention that I'm using Zookeeper version 3.5.3-beta >

Re: Trying to figure out what these errors mean

2018-07-25 Thread Patrick Hunt
On Tue, Jul 24, 2018 at 10:17 AM Martin Cigorraga < mailing.lists.forwar...@gmail.com> wrote: > Hi all, > > I'm not a DS guys but an Ops guy instead fairly new to Zookeeper and > Kafka; already spent a ton of time trying to understand what's going > on here but so far I just scratched the surface

Re: zookeeper c client build error

2018-06-27 Thread Patrick Hunt
We used to have access to solaris on Apache Jenkins, however that was removed some time ago. As a result we are no longer able to compile/test against that OS: https://builds.apache.org/computer/ You might check some of the open jiras to get more insight, esp those related to libtool:

Re: Enabling TTL at ZkServer (3.5.4-beta)

2018-06-20 Thread Patrick Hunt
Are the release notes useful? They talk about this change: http://zookeeper.apache.org/doc/r3.5.4-beta/releasenotes.html Notice that documentation has also been updated to reflect what need sto be done. Patrick On Tue, Jun 12, 2018 at 9:00 AM harish lohar wrote: > Getting below exception even

Re: starting multiple zookeeper nodes from one command

2018-06-01 Thread Patrick Hunt
I created this a while ago, I still use it frequently, YMMV: https://github.com/phunt/zkconf Patrick On Fri, Jun 1, 2018 at 9:25 AM Prasanth Mathialagan < prasanthmathiala...@gmail.com> wrote: > I don't quite understand what you meant by services. Are you referring to > running as system

[ANNOUNCE] Apache ZooKeeper 3.5.4-beta

2018-05-18 Thread Patrick Hunt
The Apache ZooKeeper team is proud to announce Apache ZooKeeper version 3.5.4-beta ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services - such as naming, configuration management, synchronization, and group services - in a simple interface

Re: Apache ZooKeeper meetup May 9th in Palo Alto?

2018-05-11 Thread Patrick Hunt
ail.com> wrote: > > > > > Gosh!! I missed it!! May be next time :) > > > > > > On Thu, May 10, 2018 at 8:24 AM, Andor Molnar <an...@cloudera.com> > > wrote: > > > > > > > Thanks everybody for joining. > > > > It was great to see you f

Re: removing ZK installation

2018-05-08 Thread Patrick Hunt
http://bigtop.apache.org/index.html Patrick On Tue, May 8, 2018 at 12:55 PM, harish lohar wrote: > Could someone please let me know where to get RPM for Centos for Zookeeper. > > Thanks > Harish > > On Tue, May 8, 2018 at 1:57 PM, Washko, Daniel >

Re: Confluence wiki contribution.. how?

2018-05-06 Thread Patrick Hunt
Unfortunately a while back we had to turn off edit perms for general users due to spam. Send me your Confluence user ID privately and I'll update the perms so that you can edit. Regards, Patrick On Fri, May 4, 2018 at 12:55 AM, Andor Molnar wrote: > Hi Alberto, > > Your

Re: ZooKeeper 3.4.11 bug: dataDir and dataLogDir swapped

2018-02-05 Thread Patrick Hunt
This is a good point Andor. I've updated the release page on the website to reflect the regression addressed in ZOOKEEPER-2960 and upcoming fix. Thanks! Patrick On Fri, Feb 2, 2018 at 1:07 AM, Andor Molnar wrote: > Hi all, > > Please be aware that 3.4.11 has a quite

Re: Different disk writes per node

2018-01-24 Thread Patrick Hunt
Hi Eric. I saw your original post but didn't have anything of value to add. I've never seen this myself. That said "never seen" not in the sense that it's impossible, just in the sense that I've never looked for such an issue and have never noticed such a thing. It does seem unusual however.

Re: Any technical reasons that the 3.5.x series is still in beta?

2018-01-23 Thread Patrick Hunt
On Mon, Jan 22, 2018 at 3:47 PM, Jeff Widman wrote: > Is there any technical reasons the 3.5.x series is still in beta? > > Looking through existing issues, I don't see anything critical that makes > 3.5.x less stable than the 3.4.x series... > > If the answer is "no tech

Re: Why are ephemeral nodes written to disk?

2018-01-17 Thread Patrick Hunt
Jan 17, 2018 3:28 PM, "Patrick Hunt" <ph...@apache.org> wrote: > > > On Tue, Jan 9, 2018 at 12:38 PM, Jeff Widman <j...@jeffwidman.com> > wrote: > > > > > Ephemeral nodes only exist for the life of the client session. > > > > > &g

Re: Why are ephemeral nodes written to disk?

2018-01-17 Thread Patrick Hunt
On Tue, Jan 9, 2018 at 12:38 PM, Jeff Widman wrote: > Ephemeral nodes only exist for the life of the client session. > > As far as I understand, by definition, a client session ends when the > entire zookeeper ensemble goes down. > > So I would expect that ephemeral nodes

Re: Zookeeper session expiration

2017-12-07 Thread Patrick Hunt
considered it fishy. > > > > Of course, this is why we use ntp with adjtime to avoid clocks going > backwards or making large jumps forward. > > > > -Original Message- > > From: Patrick Hunt [mailto:ph...@apache.org] > > Sent: Wednesday, December 06, 2017 5

Re: Zookeeper session expiration

2017-12-06 Thread Patrick Hunt
What Jordan said + time use is only in the relative sense, not the absolute. Session tracking (expiration) is relative to the start of leadership. Patrick On Mon, Dec 4, 2017 at 12:21 PM, Jordan Zimmerman < jor...@jordanzimmerman.com> wrote: > ZooKeeper, indeed, does not use wall clock time. It

Re: Hi Zookeeper expert, we encountered a slow attack problem, described as follows, please help to analyze and confirm, thank you very much.

2017-11-27 Thread Patrick Hunt
a particular IP address (2181 being the default ZK client port, YMMV) Regards, Patrick On Mon, Nov 27, 2017 at 12:10 PM, Patrick Hunt <ph...@apache.org> wrote: > Hi zhanggang, thank you for the report. Unfortunately the attachment did > not come through (probably stripped by t

Re: Hi Zookeeper expert, we encountered a slow attack problem, described as follows, please help to analyze and confirm, thank you very much.

2017-11-27 Thread Patrick Hunt
Hi zhanggang, thank you for the report. Unfortunately the attachment did not come through (probably stripped by the ML daemon). Iiuc this does sound like a problem that would be good to address. We have code in place to limit the number of concurrent sessions that a particular IP can hold open at

[ANNOUNCE] Apache ZooKeeper 3.4.11

2017-11-16 Thread Patrick Hunt
The Apache ZooKeeper team is proud to announce Apache ZooKeeper version 3.4.11 ZooKeeper is a high-performance coordination service for distributed applications. It exposes common services - such as naming, configuration management, synchronization, and group services - in a simple interface so

Re: Zookeeper quorum goes down for no apparent reason in 3.4.5

2017-10-09 Thread Patrick Hunt
umPeer[myid=3]/127.0.0.1: > 5002 > :FileTxnSnapLog@240] - Snapshotting: 0x60086 to > /run/zookeeper/version-2/snapshot.60086 > 2017-09-21 03:00:03,773 [myid:3] - INFO [SyncThread:3:FileTxnLog@199] - > Creating new log file: log.60087 > 2017-09-21 03:00:03,776 [myid:3

Re: Node can be created and listed, but not updated or deleted

2017-10-09 Thread Patrick Hunt
t; What else could it be? Corrupted data? > > On Wed, Sep 27, 2017 at 8:29 PM, Patrick Hunt <ph...@apache.org> wrote: > > > I spun up a 3.4.10 zk server (standalone) and tried to reproduce the > issue > > - the delete works fine for me. Perhaps the length is a red herri

[SECURITY] CVE-2017-5637: DOS attack on wchp/wchc four letter words (4lw)

2017-10-09 Thread Patrick Hunt
CVE-2017-5637: DOS attack on wchp/wchc four letter words (4lw) Severity: moderate Vendor: The Apache Software Foundation Versions Affected: ZooKeeper 3.4.0 to 3.4.9 ZooKeeper 3.5.0 to 3.5.2 The unsupported ZooKeeper 1.x through 3.3.x versions may be also affected Note: The 3.5 branch is still

Re: Zookeeper quorum goes down for no apparent reason in 3.4.5

2017-10-05 Thread Patrick Hunt
Unfortunately I don't see any attached logs, which makes it difficult to provide you with insight. "Not sufficient followers synced" indicates that you're losing followers, likely they are falling behind - what is your metric tracking telling your wrt load on the compute and load on the

Re: When zookeeper 3.5 version can go to final stable ?

2017-10-04 Thread Patrick Hunt
Hi, last update on this was a few days ago, please see the archives. https://lists.apache.org/thread.html/2d006f5914d5b06777efe378422820 8f889e66f903b561251758e08c@%3Cdev.zookeeper.apache.org%3E Regards, Patrick On Mon, Oct 2, 2017 at 5:03 AM, L. J.W wrote: > When

Re: Node can be created and listed, but not updated or deleted

2017-09-27 Thread Patrick Hunt
I spun up a 3.4.10 zk server (standalone) and tried to reproduce the issue - the delete works fine for me. Perhaps the length is a red herring (or I had some other diff I didn't notice) and it's something else? zk: localhost:2181(CONNECTED) 12] ls

Re: Running Zookeeper as a windows service

2017-09-27 Thread Patrick Hunt
I haven't had any experience with this myself however googling for "zookeeper windows service" resulted in a couple of blog posts and a github repo above the fold. Perhaps one of those will shed light. Best of luck, patrick On Fri, Sep 15, 2017 at 4:41 AM, Giannis Doukas <

Re: Upgrade of Zookeeper and Kafka

2017-08-16 Thread Patrick Hunt
eply. I'm still quite new to ZooKeeper, could you explain > a bit more about why Kafka is involved? > > Thank you! > Michael > > > On Aug 16, 2017, at 13:15, Patrick Hunt <ph...@apache.org> wrote: > > > > I'm not aware of any issues wrt ZK, but it'

Re: Upgrade of Zookeeper and Kafka

2017-08-16 Thread Patrick Hunt
I'm not aware of any issues wrt ZK, but it's probably a better question for the Kafka folks (as they are consuming the service). Patrick On Mon, Aug 14, 2017 at 2:57 PM, Carmen Molatch wrote: > Can I upgrade to Zookeeper 3.4.10 and Kafka 11? Are there any known >

Re: New PMC Member: Michael Han

2017-06-27 Thread Patrick Hunt
Kudos Michael, well deserved! Patrick On Tue, Jun 27, 2017 at 9:49 AM, Jordan Zimmerman < jor...@jordanzimmerman.com> wrote: > Congrats! > > > On Jun 27, 2017, at 11:48 AM, Flavio Junqueira wrote: > > > > I'm very happy to announce that the Apache ZooKeeper PMC has voted to >

Re: Recovering from zxid rollover

2017-05-26 Thread Patrick Hunt
On Wed, May 24, 2017 at 8:08 AM, Mike Heffner <m...@librato.com> wrote: > On Tue, May 23, 2017 at 10:21 PM, Patrick Hunt <ph...@apache.org> wrote: > > > On Tue, May 23, 2017 at 3:47 PM, Mike Heffner <m...@librato.com> wrote: > > > > > Hi, &g

Re: Recovering from zxid rollover

2017-05-23 Thread Patrick Hunt
On Tue, May 23, 2017 at 3:47 PM, Mike Heffner wrote: > Hi, > > I'm curious what the best practices are for handling zxid rollover in a ZK > ensemble. We have a few five-node ZK ensembles (some 3.4.8 and some 3.3.6) > and they periodically rollover their zxid. We see the

Re: [ANNOUNCE] Apache ZooKeeper 3.5.3-beta

2017-04-20 Thread Patrick Hunt
n order to maintain backward compat > > with a > > > change that we made to 3.5. > > > Not sure if it's ZOOKEEPER-1633. Basically rolling upgrade would not > work > > > from 3.4.x to 3.5.y if x < 6. > > > > > > On Wed, Apr 19, 2017 at 5:

Re: [ANNOUNCE] Apache ZooKeeper 3.5.3-beta

2017-04-19 Thread Patrick Hunt
I remember we had to make a change to 3.4.x (x>0) in order to maintain backward compat with a change that we made to 3.5. I searched but I can't remember the specific jira or the specific release, it was some time ago. The issue would be that if you try and do a rolling upgrade from 3.4.x-1 to

Re: Ideal values for tickTime and syncLimit

2017-03-27 Thread Patrick Hunt
On Thu, Mar 23, 2017 at 11:14 PM, Jai Bheemsen Rao Dhanwada < jaibheem...@gmail.com> wrote: > Hello, > > We have a mult dc zk cluster, and all of a sudden we started seeing > timeouts with observers. > while we are investigating the possible network issue, we would like to > understand accepted

Re: How to modify Client Connection timer

2017-03-27 Thread Patrick Hunt
le thundering herd problems that result from this very basic logic. Some form of exponential backoff would probably even be better. (configurable?) That said I've never seen the size of the ensemble be that much of an issue given the other issue. You might also check the latest trunk, can't remember if

Re: How to modify Client Connection timer

2017-03-22 Thread Patrick Hunt
You should be able to control that by increasing the session timeout. I'm not familiar with the client code you are using however. Additionally the ZK client should retry it's connection (as long as you don't close the ZK object) continually until it is able to reconnect. Patrick On Mon, Mar 20,

Re: Multiple credentials associated with same principal?

2017-02-15 Thread Patrick Hunt
On Tue, Feb 14, 2017 at 7:41 AM, Stevo Slavić wrote: > Is this natively supported by ZooKeeper or does this require some > customization? > > Hi Stevo, I've never heard of anyone taking this approach, and I don't believe it is possible today. Regards, Patrick > On Thu, Feb

ZooKeeper DOS exploit published

2017-02-13 Thread Patrick Hunt
Hi folks. The following exploit was recently published on the web and has come to our attention, it details a ZooKeeper DOS attack against certain four letter words (4lw), possible when the client port is exposed to untrusted actors:

Re: Multiple credentials associated with same principal?

2017-02-02 Thread Patrick Hunt
Hi Stevo, you might be talking about one of the following variants? (see the jiras linked to from this jira) https://issues.apache.org/jira/browse/ZOOKEEPER-1634 Patrick On Thu, Feb 2, 2017 at 4:38 AM, Stevo Slavić wrote: > Alternatively, is it possible to set ACL that would

Re: Is SSL supported in 3.4.9?

2016-12-07 Thread Patrick Hunt
Hi Dan. While Netty supports SSL I don't believe anyone worked through the issues of what needed to change in ZK itself to get it to work until 3.5/ZOOKEEPER-2125. iow Netty was added in 3.4, however Netty+SSL was not available until 3.5. Patrick On Wed, Dec 7, 2016 at 6:44 AM, Dan Langille

Re: Configurable SSL Context

2016-12-07 Thread Patrick Hunt
Hi George, I don't see any open JIRAs that mention this, while not an expert myself it sounds like something that might be useful for others. Please open a JIRA and submit a patch. Regards, Patrick On Mon, Dec 5, 2016 at 2:23 PM, George Goddard wrote: > Hello, > > Are

Re: Upgrading Zookeeper

2016-09-12 Thread Patrick Hunt
? Is backwards/forward compatability documented somewhere? > > Thanks, > Jens > > On Monday, September 12, 2016, Patrick Hunt <ph...@apache.org> wrote: > >> Hi Allen. I don't believe there is anything in the docs however there >> is something in the FAQ: >> &

Re: Upgrading Zookeeper

2016-09-12 Thread Patrick Hunt
Hi Allen. I don't believe there is anything in the docs however there is something in the FAQ: What are the options-process for upgrading ZooKeeper? https://cwiki.apache.org/confluence/display/ZOOKEEPER/FAQ Patrick On Thu, Sep 8, 2016 at 2:58 PM, allen chan wrote:

Re: log4j configuration for zookeeper with kafka process

2016-09-06 Thread Patrick Hunt
Hi Mazhar, if you're interested in ZK client logs when running Kafka that's a question better answered by the Kafka community, there are many ways to set things up, it's likely controlled by the Kafka configuration. Regards, Patrick On Tue, Sep 6, 2016 at 2:40 AM, Mazhar Shaikh

Re: Issue with NettyServerCnxn.java

2016-08-31 Thread Patrick Hunt
Hi Yuliya - my read is that sendResponse in NIOServerCnxn is logging, then dropping, any Exceptions encountered during sendResponse. In other words it's doing best effort response. Not sure if that is "correct", but that's what it's currently doing in NIO. Surprisingly it's also hiding any

Re: Exception using zkCli

2016-08-26 Thread Patrick Hunt
Take a look at the log4j logs for the client. Notice the operation is "getChildren", and you mentioned having 34k znodes. A common issues is "jute.maxbuffer" being exceeded (try googling it) during an operation. ZK has a limit to the size of message it will send client->server or server->client.

Re: ZK 3.4.6 creating malformed TGT

2016-08-24 Thread Patrick Hunt
urn null; > > Logs didn't indicate that my code had gotten to this point but it did get > me thinking that this is a problem with my jaas or my krb5.conf since the > KerberTicket objects are being retrieved from a Java API. The man page on > krb5.conf gave me the next clues. Playing ar

Re: ZK 3.4.6 creating malformed TGT

2016-08-17 Thread Patrick Hunt
I'm not an expert but I believe this indicates an environmental issue, see "service ticket not found in the subject" here: https://steveloughran.gitbooks.io/kerberos_and_hadoop/content/sections/errors.html Patrick On Tue, Aug 16, 2016 at 12:11 PM, Irfan Hamid wrote: >

Re: How to investigate these error codes

2016-08-16 Thread Patrick Hunt
What version of the c client are you using - multi-threaded or single threaded? If multi-threaded then the library (incl pthreads) will take care of handling the periodic heartbeats for you. If single threaded then you might be starving the event processing - which includes the heartbeat loop. See

Re: [ANNOUNCE] Chris Nauroth joins the Apache ZooKeeper PMC

2016-08-07 Thread Patrick Hunt
Kudos Chris! Patrick On Sun, Aug 7, 2016 at 11:05 AM, Flavio Junqueira wrote: > In recognition of all his contributions to the project, the Apache > ZooKeeper PMC has invited Chris Nauroth to join the PMC and he has > accepted. I'd like to take the opportunity to thank Chris

Re: Chocolatey packages for ZooKeeper, Kafka?

2016-07-29 Thread Patrick Hunt
Hi Andrew, if you want to publish somelike like that for ZK (on github say) we'd be happy to link to it on our wiki "useful tools" page. Regards, Patrick On Thu, Jul 28, 2016 at 6:58 PM, Andrew Pennebaker < andrew.penneba...@gmail.com> wrote: > Could we please publish Chocolatey packages for

Re: zookeeper SASL issue

2016-07-22 Thread Patrick Hunt
I recommend you turn on as much debugging as is available, add debug=true to your Server jaas config, and add "-Dsun.security.krb5.debug=true" when starting any jvm. That will ensure you get as much detail as the security logging has available. Patrick On Thu, Jul 21, 2016 at 1:12 PM, Michael

Re: [ANNOUNCE] Apache ZooKeeper 3.5.2-alpha

2016-07-22 Thread Patrick Hunt
Kudos all, thanks Chris! Patrick On Thu, Jul 21, 2016 at 7:46 PM, Flavio Junqueira wrote: > Thanks everyone for the push to get 3.5.2-alpha out, and specially to Chris > for being the release manager. > > -Flavio > >> On 21 Jul 2016, at 13:08, Chris Nauroth

Re: zk_outstanding_requests acceptable threshold limit

2016-07-13 Thread Patrick Hunt
It depends on your load. If it's sustained > 0 for any length of time that indicates that the server is not able to process requests quickly enough. Usually that means disk IO problems (that's where I typically see it). Patrick On Tue, Jul 12, 2016 at 12:28 PM, rammohan ganapavarapu <

Re: etcd3

2016-06-30 Thread Patrick Hunt
The "ZooKeeper Team" is you -- all of you! :-) Our project is community based and as such everyone can develop, test, improve, market, evangelize, etc... the project. If you're using ZK please feel free to let folks know. If you have ideas or just want to "scratch an itch" please participate. At

Re: read under transaction

2016-06-28 Thread Patrick Hunt
Multi is more of an atomic operation than a "transaction" in the typical sense. See https://issues.apache.org/jira/browse/ZOOKEEPER-965 for some background. I don't believe the original use case involved reading multiple znodes, rather updating multiple. Patrick On Mon, Jun 20, 2016 at 2:33 PM,

Re: Gentoo emerge package for ZooKeeper?

2016-06-28 Thread Patrick Hunt
I would think that Bigtop would add gentoo support if there's sufficient demand. Perhaps ask on their user list? Patrick On Tue, Jun 28, 2016 at 2:13 AM, Flavio Junqueira wrote: > Hi Andrew, > > For packaging, we have been delegating to Bigtop, see this jira for some > context

  1   2   3   4   5   >