[ 
https://issues.apache.org/jira/browse/SPARK-2345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14050659#comment-14050659
 ] 

Hari Shreedharan commented on SPARK-2345:
-----------------------------------------

Currently, the job (like saveAsTextFile or saveAsHadoopFile) on the DStream 
will cause the rdd.save calls to be executed on sparkContext.runJob, which in 
turn will call the foreachfunc which is passed to the ForEachDStream. So a case 
where this DStream is saved off works fine. 

But if you simply do a register and have the foreachfunc do some processing and 
custom writes may cause the application to be run locally.

> ForEachDStream should have an option of running the foreachfunc on Spark
> ------------------------------------------------------------------------
>
>                 Key: SPARK-2345
>                 URL: https://issues.apache.org/jira/browse/SPARK-2345
>             Project: Spark
>          Issue Type: Bug
>            Reporter: Hari Shreedharan
>
> Today the Job generated simply calls the foreachfunc, but does not run it on 
> spark itself using the sparkContext.runJob method.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to