[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-12-30 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/6416 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-12-30 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-168112838 I'm going to close this pull request. If this is still relevant and you are interested in pushing it forward, please open a new pull request. Thanks! --- If your project i

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-09-22 Thread hvanhovell
Github user hvanhovell commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-142309139 @MLnick I guess it depends. The other ```dataframe.stat``` functions have not been implemented as UDAFs, so this is not nessecary. However I do think that CMS is a us

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-09-22 Thread MLnick
Github user MLnick commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-142245752 @rxin @hvanhovell @yhuai should this be synced up with #8362 in terms of implementation approach? --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-08-26 Thread viirya
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-135263455 @rxin Is it good to have this as an aggregation function too? If so, I will update it too. --- If your project is set up for it, you can reply to this email and have you

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-07-13 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-121072431 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-06-01 Thread viirya
Github user viirya commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-107348123 For input DataFrame with a column name "A": A 1 10 5 10 8 The returned DataFrame has two columns "A"

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-06-01 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-107342342 Can you give an example of what your returned DataFrame looks like? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub a

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105657152 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105657154 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105657127 [Test build #33531 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33531/consoleFull) for PR 6416 at commit [`9972421`](https://git

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105629627 [Test build #33531 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/33531/consoleFull) for PR 6416 at commit [`9972421`](https://gith

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105628542 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6416#issuecomment-105628479 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6763][SQL] Add CountMinSketch to DataFr...

2015-05-26 Thread viirya
GitHub user viirya opened a pull request: https://github.com/apache/spark/pull/6416 [SPARK-6763][SQL] Add CountMinSketch to DataFrame for estimating frequencies JIRA: https://issues.apache.org/jira/browse/SPARK-6763 You can merge this pull request into a Git repository by running: