Hi Prabcs, Much of that information can be found in the documentation and on the wiki. The remaining can be found in the code. Any improvements to the documentation is not only welcome but encouraged. Below are a few links to get you started:
Documentation (See "Zookeeper Directories"): http://kafka.apache.org/documentation.html#distributionimpl Wiki: https://cwiki.apache.org/confluence/display/KAFKA/Kafka+data+structures+in+Zookeeper I also recommended that contribution oriented emails be sent to the Kafka development email list: d...@kafka.apache.org Thank you, Grant On Tue, Aug 18, 2015 at 8:03 AM, Prabhjot Bharaj <prabhbha...@gmail.com> wrote: > Hello Folks, > > I wish to contribute to Kafka internals. And, one of the things which can > help me do that is understanding how kafka uses zookeeper. I have some of > these basic doubts:- > > 1. Is zookeeper primarily used for locking ? If yes, in what cases and what > kind of nodes does it use - sequential/ephemeral? > > 2. Does kafka use zookeeper watches for any of functions ? > > 3. What kind of state is stored in Zookeeper ? (I believe it has to be the > leader information per partition, but is there anything apart from it?) > What is the scale of data that is stored in Zookeeper ? > > Looking forward for your help. > > Thanks, > prabcs > -- Grant Henke Software Engineer | Cloudera gr...@cloudera.com | twitter.com/gchenke | linkedin.com/in/granthenke