Re: pyspark mappartions ()

2016-05-14 Thread Sujit Pal
I built this recently using the accepted answer on this SO page: http://stackoverflow.com/questions/26741714/how-does-the-pyspark-mappartitions-function-work/26745371 -sujit On Sat, May 14, 2016 at 7:00 AM, Mathieu Longtin wrote: > From memory: > def

Re: pyspark mappartions ()

2016-05-14 Thread Mathieu Longtin
>From memory: def processor(iterator): for item in iterator: newitem = do_whatever(item) yield newitem newdata = data.mapPartition(processor) Basically, your function takes an iterator as an argument, and must either be an iterator or return one. On Sat, May 14, 2016 at 12:39 AM Abi

Re: pyspark mappartions ()

2016-05-13 Thread Abi
On Tue, May 10, 2016 at 2:20 PM, Abi wrote: > Is there any example of this ? I want to see how you write the the > iterable example

Re: pyspark mappartions ()

2016-05-10 Thread Abi
On May 10, 2016 2:20:25 PM EDT, Abi wrote: >Is there any example of this ? I want to see how you write the the >iterable example

pyspark mappartions ()

2016-05-10 Thread Abi
Is there any example of this ? I want to see how you write the the iterable example