Repartition and Worker Instances

Deep Pradhan Mon, 23 Feb 2015 07:33:12 -0800

Hi,
If I repartition my data by a factor equal to the number of worker
instances, will the performance be better or worse?
As far as I understand, the performance should be better, but in my case it
is becoming worse.
I have a single node standalone cluster, is it because of this?
Am I guaranteed to have a better performance if I do the same thing in a
multi-node cluster?


Thank You

Repartition and Worker Instances

Reply via email to