[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14948 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14948 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64896/ Test PASSed. ---

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14948 **[Test build #64896 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64896/consoleFull)** for PR 14948 at commit

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14452 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64895/ Test PASSed. ---

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14452 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14452 **[Test build #64895 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64895/consoleFull)** for PR 14452 at commit

[GitHub] spark issue #13767: [MINOR][SQL] Not dropping all necessary tables

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13767 **[Test build #64897 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64897/consoleFull)** for PR 13767 at commit

[GitHub] spark issue #13767: [MINOR][SQL] Not dropping all necessary tables

2016-09-03 Thread techaddict
Github user techaddict commented on the issue: https://github.com/apache/spark/pull/13767 @srowen yes, the issue is still there. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #13767: [MINOR][SQL] Not dropping all necessary tables

2016-09-03 Thread techaddict
GitHub user techaddict reopened a pull request: https://github.com/apache/spark/pull/13767 [MINOR][SQL] Not dropping all necessary tables ## What changes were proposed in this pull request? was not dropping table `parquet_t3` ## How was this patch tested? tested

[GitHub] spark pull request #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup ...

2016-09-03 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/14948#discussion_r77435330 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/clustering/KMeansSuite.scala --- @@ -75,7 +75,7 @@ class KMeansSuite extends SparkFunSuite with

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14948 **[Test build #64896 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64896/consoleFull)** for PR 14948 at commit

[GitHub] spark issue #13767: [MINOR][SQL] Not dropping all necessary tables

2016-09-03 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/13767 @techaddict I missed this, is it still relevant? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #13767: [MINOR][SQL] Not dropping all necessary tables

2016-09-03 Thread techaddict
Github user techaddict closed the pull request at: https://github.com/apache/spark/pull/13767 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14452 **[Test build #64895 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64895/consoleFull)** for PR 14452 at commit

[GitHub] spark pull request #14901: [SPARK-17347][SQL][Examples]Encoder in Dataset ex...

2016-09-03 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14901 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14901: [SPARK-17347][SQL][Examples]Encoder in Dataset example h...

2016-09-03 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/14901 merged to master/2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #14826: [SPARK-17311] [MLLIB] Standardize Python-Java MLlib API ...

2016-09-03 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/14826 Assuming my interpretation of the last comment was right, I think this is good to go. Even if somehow it 'worked' before this is a cleaner implementation of the same behavior in that case. --- If

[GitHub] spark pull request #14597: [SPARK-17017][MLLIB][ML] add a chiSquare Selector...

2016-09-03 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/14597#discussion_r77433980 --- Diff: python/pyspark/mllib/feature.py --- @@ -276,24 +276,64 @@ class ChiSqSelector(object): """ Creates a ChiSquared feature selector.

[GitHub] spark issue #14924: [SPARK-17299] TRIM/LTRIM/RTRIM should not strips charact...

2016-09-03 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/14924 LGTM. Seems like this should go into 2.0.x as well since it's really a bug fix. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14948 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64894/ Test FAILed. ---

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14948 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14948 **[Test build #64894 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64894/consoleFull)** for PR 14948 at commit

[GitHub] spark issue #14923: [SPARK-17363][ML][MLLib] fix MultivariantOnlineSummerize...

2016-09-03 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/14923 Merged to master too --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #14923: [SPARK-17363][ML][MLLib] fix MultivariantOnlineSu...

2016-09-03 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14923 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14433: [SPARK-16829][SparkR]:sparkR sc.setLogLevel doesn...

2016-09-03 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/14433#discussion_r77433830 --- Diff: core/src/main/scala/org/apache/spark/internal/Logging.scala --- @@ -135,7 +135,8 @@ private[spark] trait Logging { val replLevel =

[GitHub] spark issue #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup with be...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14948 **[Test build #64894 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64894/consoleFull)** for PR 14948 at commit

[GitHub] spark pull request #14948: [SPARK-17389] [MLLIB] Significant KMeans speedup ...

2016-09-03 Thread srowen
GitHub user srowen opened a pull request: https://github.com/apache/spark/pull/14948 [SPARK-17389] [MLLIB] Significant KMeans speedup with better choice of init steps, optimizing to remove 'runs' ## What changes were proposed in this pull request? - Deprecate KMeans 'runs'

[GitHub] spark issue #14947: [SPARK-17388][SQL] Support for inferring type date/times...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14947 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64893/ Test FAILed. ---

[GitHub] spark issue #14947: [SPARK-17388][SQL] Support for inferring type date/times...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14947 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14947: [SPARK-17388][SQL] Support for inferring type date/times...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14947 **[Test build #64893 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64893/consoleFull)** for PR 14947 at commit

[GitHub] spark issue #14919: [SPARK-17354][SQL] Partitioning by dates/timestamps shou...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14919 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14919: [SPARK-17354][SQL] Partitioning by dates/timestamps shou...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14919 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64891/ Test PASSed. ---

[GitHub] spark issue #14919: [SPARK-17354][SQL] Partitioning by dates/timestamps shou...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14919 **[Test build #64891 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64891/consoleFull)** for PR 14919 at commit

[GitHub] spark issue #14946: [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-16959] ...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14946 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14946: [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-16959] ...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14946 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64892/ Test FAILed. ---

[GitHub] spark issue #14946: [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-16959] ...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14946 **[Test build #64892 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64892/consoleFull)** for PR 14946 at commit

[GitHub] spark issue #11205: [SPARK-11334][Core] Handle maximum task failure situatio...

2016-09-03 Thread rustagi
Github user rustagi commented on the issue: https://github.com/apache/spark/pull/11205 I am seeing this issue quite frequently. Not sure what is causing it but frequently we will get a onTaskEnd event after a stage has ended. This will cause the numRunningTasks to become negative. If

[GitHub] spark issue #14947: [SPARK-17388][SQL] Support for inferring type date/times...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14947 **[Test build #64893 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64893/consoleFull)** for PR 14947 at commit

[GitHub] spark issue #14947: [SPARK-17388][SQL] Support for inferring type date/times...

2016-09-03 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/14947 Some tests might be failed due to https://github.com/apache/spark/pull/14919. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request #14947: [SPARK-17388][SQL] Support for inferring type dat...

2016-09-03 Thread HyukjinKwon
GitHub user HyukjinKwon opened a pull request: https://github.com/apache/spark/pull/14947 [SPARK-17388][SQL] Support for inferring type date/timestamp/decimal for partition column ## What changes were proposed in this pull request? Currently, Spark only supports to infer

[GitHub] spark pull request #14888: [SPARK-17324] [SQL] Remove Direct Usage of HiveCl...

2016-09-03 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14888#discussion_r77432512 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala --- @@ -613,21 +613,22 @@ private[hive] class HiveClientImpl(

[GitHub] spark issue #14946: [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-16959] ...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14946 **[Test build #64892 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64892/consoleFull)** for PR 14946 at commit

[GitHub] spark pull request #14946: [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-...

2016-09-03 Thread gatorsmile
GitHub user gatorsmile opened a pull request: https://github.com/apache/spark/pull/14946 [SPARK-17353] [SPARK-16943] [SPARK-16942] [SPARK-16959] [BACKPORT-2.0] [SQL] Fix multiple bugs in CREATE TABLE LIKE command ### What changes were proposed in this pull request? This PR is

[GitHub] spark issue #14937: [WIP] [SPARK-8519][SPARK-11560] [ML] [MLlib] Optimize KM...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14937 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14937: [WIP] [SPARK-8519][SPARK-11560] [ML] [MLlib] Optimize KM...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14937 **[Test build #64890 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64890/consoleFull)** for PR 14937 at commit

[GitHub] spark issue #14937: [WIP] [SPARK-8519][SPARK-11560] [ML] [MLlib] Optimize KM...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14937 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64890/ Test FAILed. ---

[GitHub] spark issue #14850: [SPARK-17279][SQL] better error message for exceptions d...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14850 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64889/ Test PASSed. ---

[GitHub] spark issue #14850: [SPARK-17279][SQL] better error message for exceptions d...

2016-09-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14850 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14919: [SPARK-17354][SQL] Partitioning by dates/timestamps shou...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14919 **[Test build #64891 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64891/consoleFull)** for PR 14919 at commit

[GitHub] spark issue #14850: [SPARK-17279][SQL] better error message for exceptions d...

2016-09-03 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14850 **[Test build #64889 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64889/consoleFull)** for PR 14850 at commit

[GitHub] spark issue #14919: [SPARK-17354][SQL] Partitioning by dates/timestamps shou...

2016-09-03 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/14919 @sameeragarwal Could you take another look please? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

<    1   2