[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-06-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15337728#comment-15337728 ] Sean Owen commented on SPARK-6817: -- No, the best thing is just bulk-changing the issues t

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-06-17 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15337525#comment-15337525 ] Shivaram Venkataraman commented on SPARK-6817: -- I think all the ones we need

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-04-29 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15264954#comment-15264954 ] Shivaram Venkataraman commented on SPARK-6817: -- I just merged https://issues.

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-04-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15264836#comment-15264836 ] Michael Armbrust commented on SPARK-6817: - [~shivaram] Sill trying to get any of t

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-21 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15110359#comment-15110359 ] Sun Rui commented on SPARK-6817: for dapply(), user can call repartition() to set an appro

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15110299#comment-15110299 ] Felix Cheung commented on SPARK-6817: - Thanks for putting together on the doc [~sunrui

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-19 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15108098#comment-15108098 ] Sun Rui commented on SPARK-6817: I wrote an implementation document at https://docs.googl

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-19 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15108097#comment-15108097 ] Sun Rui commented on SPARK-6817: Moved SQL UDF related stuff to SPARK-12918. > DataFrame

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-15 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15101805#comment-15101805 ] Sun Rui commented on SPARK-6817: Spark is now supporting vectorized execution via Columnar

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097723#comment-15097723 ] Sun Rui commented on SPARK-6817: OK. I will follow the design of the original design doc,

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097712#comment-15097712 ] Antonio Piccolboni commented on SPARK-6817: --- Thanks! > DataFrame UDFs in R > --

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097560#comment-15097560 ] Sun Rui commented on SPARK-6817: https://github.com/apache/spark/blob/master/sql/core/src/

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097552#comment-15097552 ] Antonio Piccolboni commented on SPARK-6817: --- I need to see the code to understan

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097458#comment-15097458 ] Sun Rui commented on SPARK-6817: projecting batching rows for UDF are implmentation optimi

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15096738#comment-15096738 ] Antonio Piccolboni commented on SPARK-6817: --- So I am not sure row-orientation me

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095868#comment-15095868 ] Sun Rui commented on SPARK-6817: If we think that column-oriented UDF is more important, I

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095860#comment-15095860 ] Sun Rui commented on SPARK-6817: I agree R's efficiency comes from vectorization. Here UDF

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095776#comment-15095776 ] Antonio Piccolboni commented on SPARK-6817: --- My question made sense only wrt the

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Weiqiang Zhuang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095756#comment-15095756 ] Weiqiang Zhuang commented on SPARK-6817: We did see both apply use cases. But the

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095747#comment-15095747 ] Reynold Xin commented on SPARK-6817: Please take a look at the original design doc for

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095745#comment-15095745 ] Sun Rui commented on SPARK-6817: [~rxin] Row-oriented R UDF is for SQL and is similar to P

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095734#comment-15095734 ] Jeff Zhang commented on SPARK-6817: --- +1 on block based API, UDF would usually call other

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095721#comment-15095721 ] Reynold Xin commented on SPARK-6817: [~sunrui] Why are you focusing on a row-based AP

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095600#comment-15095600 ] Sun Rui commented on SPARK-6817: [~shivaram] I first focus on the row-based UDF functional

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095594#comment-15095594 ] Sun Rui commented on SPARK-6817: [~piccolbo] I am not sure If I understand your meaning. T

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095590#comment-15095590 ] Sun Rui commented on SPARK-6817: [~mpollock], this PR will support row-based UDF. UDF oper

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-12 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15095588#comment-15095588 ] Sun Rui commented on SPARK-6817: Attached the first draft design doc, please review and gi

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-12-18 Thread Matt Pollock (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15064664#comment-15064664 ] Matt Pollock commented on SPARK-6817: - Will this only support UDFs that operate on a f

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-12-16 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15061420#comment-15061420 ] Antonio Piccolboni commented on SPARK-6817: --- Will this form of partition-UDF ava

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-12-15 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15059658#comment-15059658 ] Sun Rui commented on SPARK-6817: Start working on it > DataFrame UDFs in R >

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-11-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14987101#comment-14987101 ] Michael Armbrust commented on SPARK-6817: - Should we bump this now that we are pas

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-08-28 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14720845#comment-14720845 ] Shivaram Venkataraman commented on SPARK-6817: -- The idea behind having `dappl

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-08-27 Thread Indrajit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14717079#comment-14717079 ] Indrajit commented on SPARK-6817: -- Here are some suggestions on the proposed API. If the

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-08-25 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14711972#comment-14711972 ] Shivaram Venkataraman commented on SPARK-6817: -- I've created a design doc for

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-06-19 Thread Aleksander Eskilson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14593823#comment-14593823 ] Aleksander Eskilson commented on SPARK-6817: Makes sense, thanks for the clari

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-06-18 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592087#comment-14592087 ] Shivaram Venkataraman commented on SPARK-6817: -- I think there are two separat

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2015-06-18 Thread Aleksander Eskilson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14591984#comment-14591984 ] Aleksander Eskilson commented on SPARK-6817: It looks like this issue also rel