Yes, though it's a little more complex than that:

http://mail-archives.apache.org/mod_mbox/spark-user/201407.mbox/%3CCAPH-c_O9kQO6yJ4khXUVdO=+D4vj=JfG2tP9eqn5RPko=dr...@mail.gmail.com%3E

On Fri, Oct 3, 2014 at 9:58 AM, Mayur Rustagi <mayur.rust...@gmail.com> wrote:
> Current approach is to use mappartition, initialize the connection in the
> beginning, iterate through the data & close off the connector.
>
>
> Mayur Rustagi
> Ph: +1 (760) 203 3257
> http://www.sigmoidanalytics.com
> @mayur_rustagi
>
>
> On Fri, Oct 3, 2014 at 10:16 AM, Stephen Boesch <java...@gmail.com> wrote:
>>
>>
>> Consider there is some connection / external resource allocation required
>> to be accessed/mutated by each of the rows from within a single worker
>> thread.  That connection should only  be opened/closed before the first row
>> is accessed / after the last row is completed.
>>
>> It is my understanding that there is work presently underway (Reynold Xin
>> and others)  on defining an external resources API to address this. What is
>> the recommended approach in the meanwhile?
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to