[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-26 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14292448#comment-14292448 ] Xuefu Zhang commented on SPARK-2688: Yeah. We don't need a syntactic suger, but a

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-26 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14292431#comment-14292431 ] Sean Owen commented on SPARK-2688: -- As [~irashid] says, #1 is just syntactic sugar on

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-26 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14292415#comment-14292415 ] Xuefu Zhang commented on SPARK-2688: #1 above is exactly what Hive needs badly. Need

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291144#comment-14291144 ] Sean Owen commented on SPARK-2688: -- I am still not clear on what you are trying to do

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291184#comment-14291184 ] Sandy Ryza commented on SPARK-2688: --- [~xuefuz] Spark already has transformations that

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291193#comment-14291193 ] Sean Owen commented on SPARK-2688: -- (Heh, OK, well I would have closed this one instead

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291192#comment-14291192 ] Sean Owen commented on SPARK-2688: -- [~sandyr] Yes I can appreciate the difference between

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291214#comment-14291214 ] Imran Rashid commented on SPARK-2688: - [~airhorns] I completely agree with your use

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-25 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14291134#comment-14291134 ] Xuefu Zhang commented on SPARK-2688: I think SPARK-3622 is related to this JIRA but

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-23 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289813#comment-14289813 ] Sandy Ryza commented on SPARK-2688: --- I agree that this is worth keeping open. Allowing

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-23 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289798#comment-14289798 ] Sean Owen commented on SPARK-2688: -- [~airhorns] Persisting does not mean hitting disk, if

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2015-01-23 Thread Harry Brundage (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289771#comment-14289771 ] Harry Brundage commented on SPARK-2688: --- I respectfully disagree :) Persist is one

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2014-07-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14075720#comment-14075720 ] Sean Owen commented on SPARK-2688: -- If you persist/cache rdd2, it is not recomputed. You

[jira] [Commented] (SPARK-2688) Need a way to run multiple data pipeline concurrently

2014-07-25 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14074368#comment-14074368 ] Xuefu Zhang commented on SPARK-2688: cc: [~rxin] [~sandyr] Need a way to run