Re: Newbie question: what makes Spark run faster than MapReduce

Corey Nolet Fri, 07 Aug 2015 09:32:33 -0700

1) Spark only needs to shuffle when data needs to be partitioned around the
workers in an all-to-all fashion.
2) Multi-stage jobs that would normally require several map reduce jobs,
thus causing data to be dumped to disk between the jobs can be cached in
memory.

Re: Newbie question: what makes Spark run faster than MapReduce

Reply via email to