Re: Running Spark alongside Hadoop

Ognen Duzlevski Fri, 20 Jun 2014 18:29:22 -0700

I only ran HDFS on the same nodes as Spark and that worked out greatperformance and robustness wise. However, I did not run Hadoop itself todo any computations/jobs on the same nodes. My expectation is that ifyou actually ran both at the same time with your configuration, theperformance would be pretty bad. It's mostly about memory really andthen CPU(s) etc.

OD


On 6/20/14, 2:41 PM, Sameer Tilak wrote:

Dear Spark users,
I have a small 4 node Hadoop cluster. Each node is a VM -- 4 virtualcores, 8GB memory and 500GB disk. I am currently running Hadoop on it.I would like to run Spark (in standalone mode) along side Hadoop onthe same nodes. Given the configuration of my nodes, will that work?Does anyone has any experience in terms of stability and performanceof running Spark and Hadoop on somewhat resource-constrained nodes. Iwas looking at the Spark documentation and there is a way to configurememory and cores for the and worker nodes and memory for the masternode: SPARK_WORKER_CORES, SPARK_WORKER_MEMORY, SPARK_DAEMON_MEMORY.Any recommendations on how to share resource between HAdoop and Spark?

Re: Running Spark alongside Hadoop

Reply via email to