Re: How to find current size of a znode

2023-07-14 Thread Steph van Schalkwyk
plaining when trying to list sub nodes. > > /Ryan > > On 7/14/23, 3:01 PM, "Steph van Schalkwyk" <mailto:svanschalk...@gmail.com>> wrote: > > > To your last point - ZK was designed to distribute small packets, hence the > 1M buffer. > I've h

Re: How to find current size of a znode

2023-07-14 Thread Steph van Schalkwyk
To your last point - ZK was designed to distribute small packets, hence the 1M buffer. I've had a client who had a Solr connector that kept on creating new fields from different sources, and the Solr schema quickly grew to 4M. That's about the biggest I've seen ZK operate reliably. On Fri, Jul 14,

Re: Zookeeper server doesn't list all the participants

2023-07-06 Thread Steph van Schalkwyk
Took a quick glance at your issue. I see a 169.xxx IP address in your diags, which means that NIC cannot get an address reservation. Check your ZK configs and don't use 0.0.0.0 for an IP as it binds to all your NICS, use a pingable IP address as in 10.0.xxx. Make sure your server,0, server.1 se

Re: 3-node Zookeeper ensemble unable to recover if leader fails

2023-07-03 Thread Steph van Schalkwyk
1. Are the ZKs using the same NIC as Kafka? 2. Have you been watching GC and CPU usage when these fails occur? 3. Are both Kafka and ZK on the failed node going down at the same time, of only ZK? I'd check on the NIC, and see why it isn't recovering. Steph On Mon, Jul 3, 2023 at 3:29 AM João Silv

Re: Solr Restarting frequently.

2023-01-18 Thread Steph van Schalkwyk
What's your field count? On Wed, Jan 18, 2023 at 10:20 AM Rohit Walecha wrote: > [image: Screenshot from 2023-01-18 19-06-33.png] > > Restart pattern is above.. > > On Wed, Jan 18, 2023 at 2:46 PM Rohit Walecha wrote: > >> Hi, >> >> We have a 3 node *solr(8.8.0)* cluster deployed on multiple en

Re: Zookeeper leader election for client read and write requests

2022-09-17 Thread Steph van Schalkwyk
Just google leader election site:zookeeper.apache.org On Fri, Sep 16, 2022 at 7:39 PM Kaushal Shriyan wrote: > Hi, > > I am running Zookeeper version: 3.7.0 ( 3 nodes -> 1 Leader and 2 > Followers) on CentOS Linux release 7.9.2009 (Core). In an ensemble of 3 > nodes with 1 leader and 2 follower

Re: Zookeeper on Kubernetes and Presistent Volumes

2020-09-14 Thread Steph van Schalkwyk
van schalkwyk+1.314.452.2896 (Tel/SMS)* On Mon, Sep 14, 2020 at 10:21 AM Enrico Olivelli wrote: > Remi, > sorry for the late reply. > > Your cluster would be able to work if and only if at least X/2 + 1 servers > are up and running and properly connected to the other peers. >

Re: WARN message

2018-10-03 Thread Steph van Schalkwyk
In my case it was -Djute.maxbugger setting that solved it. There was a larger file that needed to be propagated and the default (1MB) size was too small. Set the size to 4MB but try to keep it small, as I believe it is stuck on the heap (comments?). Make sure when you add it is it being picked up b

Re: Configuring jute.maxbuffer on ZooKeeper Nodes

2018-09-18 Thread Steph van Schalkwyk
Be careful not to set it too high as ZK seems to ignore it then. I have a SOLR installation where I set it to several GB and it still had the issue. Set it to something reasonable, like 4MB or so. +1.314.452.2896 (Tel/SMS) On Tue, Sep 18, 2018 at 12:30 PM, Kamoliddinov, Elbek < elbek.kamoliddi..

Re: How to use these system properties

2018-09-11 Thread Steph van Schalkwyk
65536 > syncEnabled=true > readonlymode.enabled=false > tcpKeepAlive=false > ipReachableTimeout=0 > > > On Tue, Sep 11, 2018 at 6:56 AM Steph van Schalkwyk < > svanschalk...@gmail.com> > wrote: > > > Kindly post the solution for others to not have to re-inve

Re: How to use these system properties

2018-09-11 Thread Steph van Schalkwyk
Kindly post the solution for others to not have to re-invent the wheel. +1.314.452.2896 (Tel/SMS) On Mon, Sep 10, 2018 at 11:18 PM, rammohan ganapavarapu < rammohanga...@gmail.com> wrote: > NVM, i think i got it. > > Thanks, > Ram > > On Mon, Sep 10, 2018 at 3:35 PM rammohan ganapavarapu < > ra

Re: Please help: Zookeeper not coming up after power down

2018-08-16 Thread Steph van Schalkwyk
1. Turn off your firewall or do #sudo iptables --list-rules 2. netstat -nlutp to see if ZK is listening on that port Issue is there is something blocking access to the port. +1.314.452.2896 (Tel/SMS) On Thu, Aug 16, 2018 at 8:15 PM, Raghav wrote: > 1. Ids/Nodes are correct and their is no prob

Re: Follower never recovers and keeps saying ZooKeeperServer not running

2018-07-04 Thread Steph van Schalkwyk
ill one of the devs answer. my issue was on solr restart. i incrwased znode siE on all and restarted On Wed, Jul 4, 2018, 10:37 AM Benjamin Jaton wrote: > Yes - can increasing the ZK node size cause crashes like this? > > On Wed, Jul 4, 2018 at 8:11 AM Steph van Schalkwyk <

Re: Follower never recovers and keeps saying ZooKeeperServer not running

2018-07-04 Thread Steph van Schalkwyk
think default is one m On Wed, Jul 4, 2018, 10:04 AM Benjamin Jaton wrote: > Ah interestingly we did increase the znode size to ~ 40m. Think this can be > linked? > > On Tue, Jul 3, 2018 at 10:55 PM Steph van Schalkwyk < > svanschalk...@gmail.com> > wrote: > &

Re: Follower never recovers and keeps saying ZooKeeperServer not running

2018-07-03 Thread Steph van Schalkwyk
I've seen this bur can't remember what caused it. What's your zNode size? steph On Tue, Jul 3, 2018, 9:17 PM Benjamin Jaton wrote: > Hello, > > I'm wondering what can cause a ZK follower to check out like this: > > 2018-07-03T13:43:28,814 [myid:] - ERROR [LearnerHandler-/10.0.0.248:40282 > :Lear

Re: removing ZK installation

2018-05-08 Thread Steph van Schalkwyk
Find where it is installed - typically /opt/zookeeper. Also do a which zookeeper to see if it is linked to /usr/bin or some such place. Make sure zookeeper is stopped. Far as I recall, Centos has Upstart, so sudo stop zookeeper and sudo disable zookeeper. Or sudo systemctl stop zookeeper and sudo s

Re: Configuring SolrCloud with Redundancy on Two Physical Frames

2018-05-01 Thread Steph van Schalkwyk
Unless one could instantly spin up another ZK instance on the non-failed frame... +1.314.452.2896 (Tel/SMS) On Tue, May 1, 2018 at 12:00 PM, Steph van Schalkwyk < svanschalk...@gmail.com> wrote: > Adam, > More information here: > https://stackoverflow.com/questions/24694296/usi

Re: Configuring SolrCloud with Redundancy on Two Physical Frames

2018-05-01 Thread Steph van Schalkwyk
y 1, 2018 at 11:53 AM, Adam Blank wrote: > Thanks for your replies, Steph. Adding back the rest of the mailing list. > If anyone can shed some light on my predicament that would be much > appreciated. > > Adam > > -- Forwarded message -- > From: Steph van Sc

Re: Configuring SolrCloud with Redundancy on Two Physical Frames

2018-05-01 Thread Steph van Schalkwyk
Adam, is it possible to virtualize in any way? As for single physical instances, I have been running three instances of ZK on one VM quite comfortably. This is only for dev/testing, though. Regards Steph +1.314.452.2896 (Tel/SMS) On Tue, May 1, 2018 at 9:55 AM, Adam Blank wrote: > Hello, > > I

Re: Ensemble fails when one node looses connectivity

2018-03-02 Thread Steph van Schalkwyk
If this is a t2.micro on AWS, then it has 1GB of RAM. On Fri, Mar 2, 2018 at 9:47 AM, Shawn Heisey wrote: > On 3/2/2018 6:54 AM, Jim Keeney wrote: > >> Thanks for jumping in on the ZK side as well. >> >> I will take a hard look at my config files but I checked and I do not have >> any one fil

Re: Ensemble fails when one node looses connectivity

2018-03-01 Thread Steph van Schalkwyk
gs to be larger than that of the server. Is that true? > > Thanks in advance. > > Jim K. > > On Thu, Mar 1, 2018 at 9:13 PM, Jim Keeney wrote: > > > Thanks, Yes, I have about 2MB stored in the configurations folders. I > will > > increase the jute.maxbuf

Re: Ensemble fails when one node looses connectivity

2018-03-01 Thread Steph van Schalkwyk
Does the log say anything about timing out on init? Your initLimit is already pretty big, but then we don't know anything about your setup. Are you storing more than 1MB in a znode? Then increase jute.maxbuffer (in java.env as a -Djute.maxbuffer=xx). I've recently run into that with Fusion 3.1.