RE: Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-21 Thread Joaquin Alzola
e. I suppose google translate suck on the translation. From: Taotao.Li [mailto:charles.up...@gmail.com] Sent: 21 July 2016 04:04 To: Jean Georges Perrin <j...@jgp.net> Cc: Sachin Mittal <sjmit...@gmail.com>; user <user@spark.apache.org> Subject: Re: Understanding spark concep

Re: Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-21 Thread Taotao.Li
Hi, Sachin, there is no planning on translate these into english currently, sorry for that, but you can check databrick's blog, there are lots of high-quality and easy-understanding posts. or you can check the list in this post of mine, choose the English version: -

Re: Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-20 Thread Sachin Mittal
Hi, Thanks for the links, is there any english translation for the same? Sachin On Thu, Jul 21, 2016 at 8:34 AM, Taotao.Li wrote: > Hi, Sachin, here are two posts about the basic concepts about spark: > > >- spark-questions-concepts >

Re: Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-20 Thread Taotao.Li
Hi, Sachin, here are two posts about the basic concepts about spark: - spark-questions-concepts - deep-into-spark-exection-model And, I fully recommend

Re: Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-20 Thread Jean Georges Perrin
Hey, I love when questions are numbered, it's easier :) 1) Yes (but I am not an expert) 2) You don't control... One of my process is going to 8k tasks, so... 3) Yes, if you have HT, it double. My servers have 12 cores, but HT, so it makes 24. 4) From my understanding: Slave is the logical

Understanding spark concepts cluster, master, slave, job, stage, worker, executor, task

2016-07-20 Thread Sachin Mittal
Hi, I was able to build and run my spark application via spark submit. I have understood some of the concepts by going through the resources at https://spark.apache.org but few doubts still remain. I have few specific questions and would be glad if someone could share some light on it. So I