Re: foreachPartition in Java

2013-11-18 Thread Yadid Ayzenberg

Great, I will use mapPartitions instead.
Thanks for the advice,

Yadid


On 11/17/13 8:13 PM, Aaron Davidson wrote:
Also, in general, you can workaround shortcomings in the Java API by 
converting to a Scala RDD (using JavaRDD's rdd() method). The API 
tends to be much clunkier since you have to jump through some hoops to 
talk to a Scala API in Java, though. In this case, JavaRDD's 
mapPartition() method will likely be the cleanest solution as Patrick 
said.



On Sun, Nov 17, 2013 at 5:03 PM, Patrick Wendell > wrote:


Can you just call mapPartitions and ignore the result?

- Patrick

On Sun, Nov 17, 2013 at 4:45 PM, Yadid Ayzenberg
mailto:ya...@media.mit.edu>> wrote:
> Hi,
>
> According to the API, foreachPartition() is not yet implemented
in Java.
> Are there any workarounds to get the same functionality ?
> I have a non serializable DB connection and instantiating it is
pretty
> expensive, so I prefer to do it on a per partition basis.
>
> thanks,
> Yadid
>






Re: foreachPartition in Java

2013-11-17 Thread Aaron Davidson
Also, in general, you can workaround shortcomings in the Java API by
converting to a Scala RDD (using JavaRDD's rdd() method). The API tends to
be much clunkier since you have to jump through some hoops to talk to a
Scala API in Java, though. In this case, JavaRDD's mapPartition() method
will likely be the cleanest solution as Patrick said.


On Sun, Nov 17, 2013 at 5:03 PM, Patrick Wendell  wrote:

> Can you just call mapPartitions and ignore the result?
>
> - Patrick
>
> On Sun, Nov 17, 2013 at 4:45 PM, Yadid Ayzenberg 
> wrote:
> > Hi,
> >
> > According to the API, foreachPartition() is not yet implemented in Java.
> > Are there any workarounds to get the same functionality ?
> > I have a non serializable DB connection and instantiating it is pretty
> > expensive, so I prefer to do it on a per partition basis.
> >
> > thanks,
> > Yadid
> >
>


Re: foreachPartition in Java

2013-11-17 Thread Patrick Wendell
Can you just call mapPartitions and ignore the result?

- Patrick

On Sun, Nov 17, 2013 at 4:45 PM, Yadid Ayzenberg  wrote:
> Hi,
>
> According to the API, foreachPartition() is not yet implemented in Java.
> Are there any workarounds to get the same functionality ?
> I have a non serializable DB connection and instantiating it is pretty
> expensive, so I prefer to do it on a per partition basis.
>
> thanks,
> Yadid
>


foreachPartition in Java

2013-11-17 Thread Yadid Ayzenberg

Hi,

According to the API, foreachPartition() is not yet implemented in Java.
Are there any workarounds to get the same functionality ?
I have a non serializable DB connection and instantiating it is pretty 
expensive, so I prefer to do it on a per partition basis.


thanks,
Yadid