take only brings n elements to the driver, which is probably still a win if n is small. I'm not sure what you mean by only taking a count argument -- what else would be an arg to take?
On Wed, Aug 5, 2015 at 4:49 PM, Sandeep Giri <sand...@knowbigdata.com> wrote: > Yes, but in the take() approach we will be bringing the data to the driver > and is no longer distributed. > > Also, the take() takes only count as argument which means that every time > we would transferring the redundant elements. > >