You can't quite do cleanup in mapPartitions in that way. Here is a bit more
explanation (farther down):
http://blog.cloudera.com/blog/2014/09/how-to-translate-from-mapreduce-to-apache-spark/
On Dec 28, 2014 8:18 AM, "Akhil Das" <ak...@sigmoidanalytics.com> wrote:

> Something like?
>
> val a = myRDD.mapPartitions(p => {
>
>
>
>             //Do the init
>
>             //Perform some operations
>
>             //Shut it down?
>
>          })
>
>
>
> Thanks
> Best Regards
>
> On Sun, Dec 28, 2014 at 1:53 AM, Kevin Burton <bur...@spinn3r.com> wrote:
>
>> I have a job where I want to map over all data in a cassandra database.
>>
>> I’m then selectively sending things to my own external system (ActiveMQ)
>> if the item matches criteria.
>>
>> The problem is that I need to do some init and shutdown.  Basically on
>> init I need to create ActiveMQ connections and on shutdown I need to close
>> them or daemon threads will be left running.
>>
>> What’s the best way to accomplish this. I could find it after I
>> RTFMd…(but perhaps I missed  it)
>>
>> --
>>
>> Founder/CEO Spinn3r.com
>> Location: *San Francisco, CA*
>> blog: http://burtonator.wordpress.com
>> … or check out my Google+ profile
>> <https://plus.google.com/102718274791889610666/posts>
>> <http://spinn3r.com>
>>
>>
>

Reply via email to