No of cores per executor.

2019-12-08 Thread Amit Sharma
I have set 5 cores per executor. Is there any formula to determine best combination of executor and cores and memory per core for better performance. Also when I am running local spark instance in my web jar getting better speed than running in cluster. Thanks Amit

Re: Request more yarn vcores than executors

2019-12-08 Thread Chris Teoh
If that is the case, perhaps set vcore to CPU core ratio as 1:1 and just do --executor-cores 1 and that would at least try to get you more threads per executor. Note that vcore is a logical construct and isn't directly related to CPU cores, just the time slice allowed over the entire set of CPUs

Re: Request more yarn vcores than executors

2019-12-08 Thread jelmer
> > you can take on more simultaneous tasks per executor That is exactly what I want to avoid. that nature of the task makes it difficult to parallelise over many partitions. Ideally i'd have 1 executor per task with 10+ cores assigned to each executor On Sun, 8 Dec 2019 at 10:23, Chris Teoh

Re: Request more yarn vcores than executors

2019-12-08 Thread Chris Teoh
I thought --executor-cores is the same the other argument. If anything, just set --executor-cores to something greater than 1 and don't set the other one you mentioned. You'll then get greater number of cores per executor so you can take on more simultaneous tasks per executor. On Sun, 8 Dec

Request more yarn vcores than executors

2019-12-08 Thread jelmer
I have a job, running on yarn, that uses multithreading inside of a mapPartitions transformation Ideally I would like to have a small number of partitions but have a high number of yarn vcores allocated to the task (that i can take advantage of because of multi threading) Is this possible? I