[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/10911 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175381894 I'm going to merge this. Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175362788 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175362791 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175362482 **[Test build #50146 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50146/consoleFull)** for PR 10911 at commit [`3ff902a`](https://g

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175324830 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175324834 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175324264 **[Test build #50126 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50126/consoleFull)** for PR 10911 at commit [`4a40802`](https://g

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175322730 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175322727 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175319396 **[Test build #50146 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50146/consoleFull)** for PR 10911 at commit [`3ff902a`](https://gi

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50926992 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketch.java --- @@ -59,16 +59,17 @@ public enum Version { /**

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50926614 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -1270,4 +1270,37 @@ class DataFrameSuite extends QueryTest with SharedS

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50926598 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -325,27 +321,43 @@ public void writeTo(OutputStream out)

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925871 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -1270,4 +1270,37 @@ class DataFrameSuite extends QueryTest with SharedSQLCon

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925607 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -325,27 +321,43 @@ public void writeTo(OutputStream out) thro

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925566 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -1270,4 +1270,37 @@ class DataFrameSuite extends QueryTest with SharedS

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925449 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -1270,4 +1270,37 @@ class DataFrameSuite extends QueryTest with SharedS

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925247 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -325,27 +321,43 @@ public void writeTo(OutputStream out)

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50925316 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -325,27 +321,43 @@ public void writeTo(OutputStream out)

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175305092 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175305090 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50919784 --- Diff: sql/core/pom.xml --- @@ -44,6 +44,11 @@ org.apache.spark + spark-sketch_2.10 --- End diff --

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50919715 --- Diff: sql/core/pom.xml --- @@ -44,6 +44,11 @@ org.apache.spark + spark-sketch_2.10 --- End diff --

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50918206 --- Diff: sql/core/pom.xml --- @@ -44,6 +44,11 @@ org.apache.spark + spark-sketch_2.10 --- End diff --

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50914282 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -348,4 +374,30 @@ public static CountMinSketchImpl readF

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175275779 Josh is looking into the PySpark test failure. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175256282 **[Test build #50126 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50126/consoleFull)** for PR 10911 at commit [`4a40802`](https://gi

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175254946 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175254943 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175254410 **[Test build #50117 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50117/consoleFull)** for PR 10911 at commit [`fb23a24`](https://g

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50907571 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -348,4 +374,30 @@ public static CountMinSketchImpl readFrom(I

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50907599 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -348,4 +374,30 @@ public static CountMinSketchImpl readFrom(I

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50906771 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -17,17 +17,37 @@ package org.apache.spark.uti

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50906324 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -348,4 +374,30 @@ public static CountMinSketchImpl readF

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50893099 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -348,4 +374,30 @@ public static CountMinSketchImpl readF

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-175176766 **[Test build #50117 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50117/consoleFull)** for PR 10911 at commit [`fb23a24`](https://gi

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50880387 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -17,17 +17,37 @@ package org.apache.spark.uti

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50878077 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,84 @@ final class DataFrameStatFunctions private[sql](df:

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-26 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50877569 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,84 @@ final class DataFrameStatFunctions private[sql

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50802122 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,84 @@ final class DataFrameStatFunctions private[sql

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50801910 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,84 @@ final class DataFrameStatFunctions private[sql

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174848875 cc @JoshRosen is the python tests broken? ``` Running PySpark tests. Output is in /home/jenkins/workspace/SparkPullRequestBuilder/python/unit-tests.log Err

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174847369 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174847368 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174847279 **[Test build #50061 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50061/consoleFull)** for PR 10911 at commit [`32a9860`](https://g

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50797694 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -368,4 +379,30 @@ public static CountMinSketchImpl readFrom(I

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174816049 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174816044 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174815506 **[Test build #50055 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50055/consoleFull)** for PR 10911 at commit [`4e5d1af`](https://g

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174785197 **[Test build #50061 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50061/consoleFull)** for PR 10911 at commit [`32a9860`](https://gi

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174783482 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174783481 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50787971 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -52,6 +57,10 @@ private double eps; privat

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786975 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,88 @@ final class DataFrameStatFunctions private[sql](df:

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786852 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,88 @@ final class DataFrameStatFunctions private[sql](df:

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786646 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,88 @@ final class DataFrameStatFunctions private[sql](df:

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786602 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -309,4 +311,88 @@ final class DataFrameStatFunctions private[sql](df:

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786480 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -347,8 +356,10 @@ public void writeTo(OutputStream out) throw

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786439 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -41,7 +46,7 @@ * - ... * - Row depth - 1 (wid

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50786449 --- Diff: common/sketch/src/main/java/org/apache/spark/util/sketch/CountMinSketchImpl.java --- @@ -52,6 +57,10 @@ private double eps; private dou

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174761421 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174761419 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174760913 **[Test build #50055 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/50055/consoleFull)** for PR 10911 at commit [`4e5d1af`](https://gi

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/10911#discussion_r50784455 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameStatFunctions.scala --- @@ -45,7 +48,6 @@ final class DataFrameStatFunctions private[sql](d

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/10911#issuecomment-174756777 cc @cloud-fan @rxin @yhuai --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-12935][SQL] DataFrame API for Count-Min...

2016-01-25 Thread liancheng
GitHub user liancheng opened a pull request: https://github.com/apache/spark/pull/10911 [SPARK-12935][SQL] DataFrame API for Count-Min Sketch This PR integrates Count-Min Sketch from spark-sketch into DataFrame. This version resorts to `RDD.aggregate` for building the sketch. A more