[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/9222 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151305314 Ok I see -- we used to make a Seq[Array[Any]] and then call toArray on it before and now we just have Array[Array[Any]]. Alright change LGTM. Merging this --- If your

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread FRosner
Github user FRosner commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151293332 @felixcheung the runtime performance tests I conducted came to the result that the performance gain is minimal if there is enough RAM available. It was faster all the ti

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread FRosner
Github user FRosner commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151236579 In my opinion we are good to go if there are no objections from your side. -- Sent from my phone. Please excuse my brevity. Shivaram Venkataraman wrote

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread felixcheung
Github user felixcheung commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151232905 @FRosner I assume this is faster than the original implementation? --- If your project is set up for it, you can reply to this email and have your reply appear on Gi

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151194783 The class is already `private[r]` so it should be fine. @FRosner Is the change good to go or are you doing any more perf tests ? --- If your project is set up

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread sun-rui
Github user sun-rui commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151060081 @FRosner, I am not denying a unit test:). My point is that it seems not necessary to add a Scala unit case for dfToCols, which is dedicated for SparkR now and its logic

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-26 Thread FRosner
Github user FRosner commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-151056747 @sun-rui I am not sure how callJStatic works but I can imagine that making a method private does not allow it to be called - neither from the test, nor from R. Why would

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread sun-rui
Github user sun-rui commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150803696 @FRosner, dfToCols is not a public API, and is now only a helper function for SparkR. Could you add a private modifier for it? --- If your project is set up for it, you

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150788661 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150788665 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150788399 **[Test build #44293 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44293/consoleFull)** for PR 9222 at commit [`7e34d30`](https://git

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150778791 **[Test build #44293 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44293/consoleFull)** for PR 9222 at commit [`7e34d30`](https://gith

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150778341 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150778337 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread FRosner
Github user FRosner commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150778223 @shivaram I added the missing license header. Sorry for missing this one. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-24 Thread FRosner
Github user FRosner commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150778118 @sun-rui thanks for your comments. You should not hesitate to add unit tests for functions that are not tested, yet. Tests help other developers (like me) to change stuf

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread sun-rui
Github user sun-rui commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150758802 @FRosner, it seems that you are investigating why the performance of the old implementation does not behave as you expect? No matter the investigation result, your new v

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread sun-rui
Github user sun-rui commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150757273 @FRosner, there is no test case dedicated for dfToCols in R now. I am still reluctant to add a new test case in Scala simply for dfToCols. I'd like to leave the decision

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700637 **[Test build #44262 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44262/consoleFull)** for PR 9222 at commit [`f857546`](https://git

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700643 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700641 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700586 **[Test build #44262 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/44262/consoleFull)** for PR 9222 at commit [`f857546`](https://gith

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700188 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150700209 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150699863 Jenkins, ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this f

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread Gerrrr
Github user Ge commented on the pull request: https://github.com/apache/spark/pull/9222#issuecomment-150659258 The problem might be related to GC as a current implementation of `dfToCols` allocates memory for the map result and the array while @FRosner implementation only allocate

[GitHub] spark pull request: SPARK-11258 Converting a Spark DataFrame into ...

2015-10-23 Thread FRosner
Github user FRosner commented on a diff in the pull request: https://github.com/apache/spark/pull/9222#discussion_r42884105 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -130,16 +130,18 @@ private[r] object SQLUtils { } def d