Hi Sonali,

This was something that I had some questions about originally as well. In terms 
of required components then yes, for any size of Samza deployment you will  
need all those pieces. 

In terms of actual deployment, from what I understand from the LinkedIn guys 
they do run Samza on a dedicated YARN grid that also has a Kafka broker 
collocated on each node. These decisions though appear to be more down to 
convenience than a hard requirement.

In my own setup I have existing ZooKeeper and Kafka clusters that I'm pointing 
Samza at but do need to run a dedicated YARN grid because my Hadoop cluster has 
a pre-2.2 version of YARN running on it.

So if you have existing components you can reuse them, if not then repurposing 
the Hello Samza package is a good starting point to get all the things you want 
on the required hosts. Only caveat would be to not drop a ZK node on each host, 
the ZK quorum should follow the usual advice of an odd number of servers and 
likely no more than 3, 5 or 7 depending on your deployment size.

Garry

-----Original Message-----
From: [email protected] 
[mailto:[email protected]] 
Sent: 30 January 2014 23:38
To: [email protected]
Subject: Cluster Installation

Hi All,

I'm new to working with Samza and have been trying to figure out the best 
cluster configuration. I understand that Samza comes with yarn,kafka and 
zookeeper out of the box. Is that the model just for a standalone/local 
configuration. What if I want a bigger cluster? Do I have to install yarn, 
kafka and zookeeper separately? Any suggestions would be great!

Thanks,
Sonali

Sonali Parthasarathy
R&D Developer, Data Insights
Accenture Technology Labs
703-341-7432


________________________________

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise confidential information. If you have received it in 
error, please notify the sender immediately and delete the original. Any other 
use of the e-mail by you is prohibited. Where allowed by local law, electronic 
communications with Accenture and its affiliates, including e-mail and instant 
messaging (including content), may be scanned by our systems for the purposes 
of information security and assessment of internal compliance with Accenture 
policy. .
______________________________________________________________________________________

www.accenture.com

-----
No virus found in this message.
Checked by AVG - www.avg.com
Version: 2014.0.4259 / Virus Database: 3684/7046 - Release Date: 01/30/14

Reply via email to