Re: Shuffle Write v/s Shuffle Read

2015-10-02 Thread Zoltán Zvara
computations need the data > on another node (E.g. Group by, sort, etc) > > Shuffle read and shuffle write should be mirrored on the left/right side > of a shuffle between 2 stages. > > -adrian > > From: Kartik Mathur > Date: Thursday, October 1, 2015 at 10:36 PM > To:

Re: Shuffle Write v/s Shuffle Read

2015-10-02 Thread Adrian Tanase
. -adrian From: Kartik Mathur Date: Thursday, October 1, 2015 at 10:36 PM To: user Subject: Shuffle Write v/s Shuffle Read Hi I am trying to better understand shuffle in spark . Based on my understanding thus far , Shuffle Write : writes stage output for intermediate stage on local disk

Shuffle Write v/s Shuffle Read

2015-10-01 Thread Kartik Mathur
Hi I am trying to better understand shuffle in spark . Based on my understanding thus far , *Shuffle Write* : writes stage output for intermediate stage on local disk if memory is not sufficient., Example , if each worker has 200 MB memory for intermediate results and the results are 300MB then