[ 
https://issues.apache.org/jira/browse/KUDU-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15183700#comment-15183700
 ] 

Todd Lipcon commented on KUDU-1214:
-----------------------------------

Definitely the CreateClient call is expensive enough that you wouldn't want to 
do it over and over again. But, why not define the client outside the map call, 
and figure out how to make it serializable so it can be passed along with the 
closure and re-instantiated? Or perhaps some kind of lazy initialization? Are 
there other examples that ship with Spark that look like the proposed API?

> Add Integration points for Spark, Spark Streaming, and Spark SQL
> ----------------------------------------------------------------
>
>                 Key: KUDU-1214
>                 URL: https://issues.apache.org/jira/browse/KUDU-1214
>             Project: Kudu
>          Issue Type: New Feature
>          Components: integration
>            Reporter: Ted Malaska
>         Attachments: KUDU-1214.1.patch
>
>
> This Jira will be broken up into four main jira:
> 1. Add Support for Spark RDD map and foreach integration with Kudu
> 2. Add Support for Spark DStream map and foreach integration with Kudu
> 3. Add Support for Spark SQL defaultSource and push down predicates
> 4. Add documentation for all Spark Integrations



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to