[GitHub] spark pull request: [SPARK-10088] [sql] Add support for "stored as...

2015-08-18 Thread vanzin
GitHub user vanzin opened a pull request: https://github.com/apache/spark/pull/8282 [SPARK-10088] [sql] Add support for "stored as avro" in HiveQL parser. You can merge this pull request into a Git repository by running: $ git pull https://github.com/vanzin/spark SPARK-10088

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132315683 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37337556 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132315680 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8473][SPARK-9889][ML] User guide and ex...

2015-08-18 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/8184#issuecomment-132314885 Looks good except for those items. I ran the Scala example. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-8473][SPARK-9889][ML] User guide and ex...

2015-08-18 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/8184#discussion_r37337185 --- Diff: docs/ml-features.md --- @@ -649,6 +649,80 @@ for expanded in polyDF.select("polyFeatures").take(3): +## Discrete Cosine Trans

[GitHub] spark pull request: [SPARK-8473][SPARK-9889][ML] User guide and ex...

2015-08-18 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/8184#discussion_r37337180 --- Diff: docs/ml-features.md --- @@ -649,6 +649,80 @@ for expanded in polyDF.select("polyFeatures").take(3): +## Discrete Cosine Trans

[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7943#issuecomment-132314508 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7943#issuecomment-132314505 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9439] [yarn] External shuffle service r...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7943#issuecomment-132314318 **[Test build #41136 timed out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41136/console)** for PR 7943 at commit [`0d285d3`](https://github

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37336590 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-9944] [SQL] [WIP] Allow hive.metastore....

2015-08-18 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/8186#issuecomment-132313042 I don't think that this is the right semantic. Correct me if I'm wrong, but I think that we should use `hive.metastore.warehouse.dir` as the default location when crea

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37336485 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Inter

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37336212 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Inter

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37336156 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-7736] [core] Fix a race introduced in P...

2015-08-18 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/8258#issuecomment-132312001 pyspark fail is the same flaky test that has been failing on and off for a long time. I'm merging this. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-10048][SPARKR] Support arbitrary nested...

2015-08-18 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8276#issuecomment-132311929 @sun-rui Generally, the changes looks good to me, could you add unit tests for ArrayType? Do we want to support create create DataFrame from ArrayType (could be another P

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132311793 [Test build #41152 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41152/consoleFull) for PR 8007 at commit [`70d6a15`](https://gith

[GitHub] spark pull request: [SPARK-7736] [core] Fix a race introduced in P...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8258#issuecomment-132311430 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37335793 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-7736] [core] Fix a race introduced in P...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8258#issuecomment-132311429 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-7736] [core] Fix a race introduced in P...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8258#issuecomment-132311307 [Test build #41137 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41137/console) for PR 8258 at commit [`d8831a2`](https://github.

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132310857 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132311220 [Test build #41151 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41151/consoleFull) for PR 8197 at commit [`83d229f`](https://gith

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132310936 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37335316 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37335210 --- Diff: docs/ml-linear-methods.md --- @@ -118,12 +133,114 @@ lrModel = lr.fit(training) print("Weights: " + str(lrModel.weights)) print("Intercep

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread mccheah
Github user mccheah commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132309535 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10080][SQL] Fix binary incompatibility ...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8281#issuecomment-132309228 [Test build #41150 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41150/consoleFull) for PR 8281 at commit [`541d9a0`](https://gith

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132309366 Cool LGTM, @jkbradley for final pass --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10082] [MLlib] Validate i, j in apply (...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/8271#discussion_r37335135 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/linalg/MatricesSuite.scala --- @@ -74,6 +74,17 @@ class MatricesSuite extends SparkFunSuite {

[GitHub] spark pull request: [SPARK-10082] [MLlib] Validate i, j in apply (...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/8271#discussion_r37335078 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala --- @@ -281,7 +281,11 @@ class DenseMatrix( /** * @since 1.3.0

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132308808 > Should we try to merge this commit first and have your commit be rebased on top of this? I think that's the best approach; Mark can keep some (or all) of the co

[GitHub] spark pull request: [SPARK-10082] [MLlib] Validate i, j in apply (...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/8271#discussion_r37334947 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/Matrices.scala --- @@ -281,7 +281,11 @@ class DenseMatrix( /** * @since 1.3.0

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132308575 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132308625 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132308590 I have merged your changes. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project doe

[GitHub] spark pull request: [SPARK-10082] [MLlib] Validate i, j in apply (...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/8271#issuecomment-132308301 :+1: at first I was concerned that this might be unnecessary since `Array` checks for bounds, but this PR fixes a bug which occurs when a user provides an OOB row i

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread mccheah
Github user mccheah commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132307731 Cool - and once again, trying the new commit would be appreciated. Also @markgrover how do we want to resolve all of the duplicate work being done here and in #

[GitHub] spark pull request: [SPARK-10080][SQL] Fix binary incompatibility ...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8281#issuecomment-132307243 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-10080][SQL] Fix binary incompatibility ...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8281#issuecomment-132307281 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10040][SQL] Use batch insert for JDBC w...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8273#issuecomment-132304931 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10080][SQL] Fix binary incompatibility ...

2015-08-18 Thread marmbrus
GitHub user marmbrus opened a pull request: https://github.com/apache/spark/pull/8281 [SPARK-10080][SQL] Fix binary incompatibility for $ column interpolation Turns out that inner classes of inner objects are referenced directly, and thus moving it will break binary compatibility.

[GitHub] spark pull request: [CORE] Disable spark.shuffle.reduceLocality.en...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8280#issuecomment-132306708 [Test build #41149 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41149/consoleFull) for PR 8280 at commit [`f77e574`](https://gith

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/8197#issuecomment-132306453 @MechCoder I sent you a PR, please merge if you like the changes. --- If your project is set up for it, you can reply to this email and have your reply appear on Gi

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132302390 [Test build #41147 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41147/console) for PR 8207 at commit [`b038826`](https://github.

[GitHub] spark pull request: [CORE] Disable spark.shuffle.reduceLocality.en...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8280#issuecomment-132305977 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [CORE] Disable spark.shuffle.reduceLocality.en...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8280#issuecomment-132305959 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [CORE] Disable spark.shuffle.reduceLocality.en...

2015-08-18 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/8280 [CORE] Disable spark.shuffle.reduceLocality.enabled by default. You can merge this pull request into a Git repository by running: $ git pull https://github.com/yhuai/spark disableReduceLocalityB

[GitHub] spark pull request: [SPARK-10040][SQL] Use batch insert for JDBC w...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8273#issuecomment-132304929 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37333263 --- Diff: docs/ml-guide.md --- @@ -801,6 +801,153 @@ jsc.stop(); +## Examples: Summaries for LogisticRegression. + +Once [`Log

[GitHub] spark pull request: [SPARK-10040][SQL] Use batch insert for JDBC w...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8273#issuecomment-132304504 [Test build #41140 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41140/console) for PR 8273 at commit [`1062cda`](https://github.

[GitHub] spark pull request: [SPARK-10048][SPARKR] Support arbitrary nested...

2015-08-18 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/8276#discussion_r37332791 --- Diff: core/src/main/scala/org/apache/spark/api/r/SerDe.scala --- @@ -210,22 +213,31 @@ private[spark] object SerDe { writeType(dos, "void")

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132302712 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132302709 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/7388 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-9906] [ML] User guide for LogisticRegre...

2015-08-18 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/8197#discussion_r37331604 --- Diff: docs/ml-guide.md --- @@ -801,6 +801,153 @@ jsc.stop(); +## Examples: Summaries for LogisticRegression. + +Once [`Log

[GitHub] spark pull request: [PARK-6489][SQL] add column pruning for Genera...

2015-08-18 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/8268#discussion_r37331657 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -178,6 +179,10 @@ object ColumnPruning extends Rule[Log

[GitHub] spark pull request: [SPARK-8924] [MLLIB, DOCUMENTATION] Added @sin...

2015-08-18 Thread MechCoder
Github user MechCoder commented on the pull request: https://github.com/apache/spark/pull/7380#issuecomment-132299068 Apart from that minor comment, LGTM @mengxr Thanks for the PR @BryanCutler . And @manishamde I took the liberty of reviewing the PR since you were busy. Hope

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132297867 I'll merge this with master and branch-1.5. @hhbyyh Thanks a lot! --- If your project is set up for it, you can reply to this email and have your reply appear on GitH

[GitHub] spark pull request: [SPARK-10048][SPARKR] Support arbitrary nested...

2015-08-18 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/8276#issuecomment-132297061 Thanks @sun-rui I'll take a look at this today cc @davies --- If your project is set up for it, you can reply to this email and have your reply appear on GitH

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132294920 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132294922 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132294758 [Test build #41146 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41146/console) for PR 7388 at commit [`a370816`](https://github.

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread JasmineGeorge
Github user JasmineGeorge commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132294464 formatted those files as well. Sorry I missed them last time. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHu

[GitHub] spark pull request: [SPARK-8924] [MLLIB, DOCUMENTATION] Added @sin...

2015-08-18 Thread MechCoder
Github user MechCoder commented on a diff in the pull request: https://github.com/apache/spark/pull/7380#discussion_r37329998 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/impurity/Variance.scala --- @@ -57,6 +60,7 @@ object Variance extends Impurity { /**

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132294169 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132294175 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132293991 [Test build #41145 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41145/console) for PR 7842 at commit [`8b31f7d`](https://github.

[GitHub] spark pull request: [SPARK-10064] [ML] Parallelize decision tree b...

2015-08-18 Thread NathanHowell
Github user NathanHowell commented on the pull request: https://github.com/apache/spark/pull/8246#issuecomment-132293317 Will do, thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-9856][SparkR] Add expression functions ...

2015-08-18 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8264#issuecomment-132292648 @yu-iskw Would you mind to add doc for every function? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-8167] Make tasks that fail from YARN pr...

2015-08-18 Thread markgrover
Github user markgrover commented on the pull request: https://github.com/apache/spark/pull/8007#issuecomment-132292001 Yeah, for sure, let me upload the logs on gist and explain what I am talking about. Will do that soon. --- If your project is set up for it, you can reply to this em

[GitHub] spark pull request: [SPARK-10064] [ML] Parallelize decision tree b...

2015-08-18 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/8246#issuecomment-132291972 This sounds great, thanks! I'll need to finish up with QA for 1.5 before taking a look, but please ping me if I don't return to review before long. --- If your proje

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132291780 Modifying the LDA example sounds good to me. It does mean using spark.ml classes in spark.mllib examples, but I don't really see a problem with that for examples. (I

[GitHub] spark pull request: SPARK-9790: [YARN] Expose in WebUI if NodeMana...

2015-08-18 Thread markgrover
Github user markgrover commented on the pull request: https://github.com/apache/spark/pull/8093#issuecomment-132291876 Thanks Marcelo, the code more or less works but GetExecutorLossReason is getting called twice. So, I need to work that out with @mccheah. Once that's resolved, I will

[GitHub] spark pull request: [SPARK-10001] [CORE] Allow Ctrl-C in spark-she...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8216#issuecomment-132290482 [Test build #41148 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41148/consoleFull) for PR 8216 at commit [`d3eabf0`](https://gith

[GitHub] spark pull request: [SPARK-9856][SparkR] Add expression functions ...

2015-08-18 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8264#issuecomment-132290516 @yu-iskw You add this patch, then you do not need the special version of rand/randn ``` --- a/core/src/main/scala/org/apache/spark/api/r/RBackendHandler.scala

[GitHub] spark pull request: [SPARK-10001] [CORE] Allow Ctrl-C in spark-she...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8216#issuecomment-132289797 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10001] [CORE] Allow Ctrl-C in spark-she...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8216#issuecomment-132289774 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-10001] [CORE] Allow Ctrl-C in spark-she...

2015-08-18 Thread piaozhexiu
Github user piaozhexiu commented on the pull request: https://github.com/apache/spark/pull/8216#issuecomment-132289421 I believe the YarnClusterSuite failure is fixed by [SPARK-10059](https://issues.apache.org/jira/browse/SPARK-10059) because I see the same error message (`java.lang.N

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132288585 [Test build #41147 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41147/consoleFull) for PR 8207 at commit [`b038826`](https://gith

[GitHub] spark pull request: [SPARK-8115] [SQL] Remove TestData

2015-08-18 Thread BenFradet
Github user BenFradet commented on the pull request: https://github.com/apache/spark/pull/7406#issuecomment-132288239 @rxin @andrewor14 This patch doesnt really make sense with the introduction of [SQLTestData](https://github.com/apache/spark/blob/master/sql/core/src/test/scal

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132286553 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132286529 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9900][MLlib] User guide for Association...

2015-08-18 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/8207#issuecomment-132286583 @mengxr Rebased (again...) :trollface: --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your proj

[GitHub] spark pull request: [SPARK-10064] [ML] Parallelize decision tree b...

2015-08-18 Thread NathanHowell
Github user NathanHowell commented on the pull request: https://github.com/apache/spark/pull/8246#issuecomment-132283376 cc/ @jkbradley @manishamde @mengxr --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project d

[GitHub] spark pull request: [SPARK-9819][Streaming][Documentation] Clarify...

2015-08-18 Thread huitseeker
Github user huitseeker commented on a diff in the pull request: https://github.com/apache/spark/pull/8103#discussion_r37325256 --- Diff: python/pyspark/streaming/dstream.py --- @@ -454,7 +454,10 @@ def reduceByWindow(self, reduceFunc, invReduceFunc, windowDuration, slideDuratio

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132282330 [Test build #41146 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41146/consoleFull) for PR 7388 at commit [`a370816`](https://gith

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132281581 [Test build #41145 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41145/consoleFull) for PR 7842 at commit [`8b31f7d`](https://gith

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread hhbyyh
Github user hhbyyh commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132281207 @mengxr @jkbradley Sent an update addressing the comment. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132280348 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9782] [YARN] Support YARN application t...

2015-08-18 Thread vanzin
Github user vanzin commented on a diff in the pull request: https://github.com/apache/spark/pull/8072#discussion_r37324265 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -163,6 +163,23 @@ private[spark] class Client( appContext.setQueue(args.

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132280407 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [Spark-9028] [ML] Add CountVectorizer as an es...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7388#issuecomment-132280307 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-8542][MLlib]PMML export for Decision Tr...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7842#issuecomment-132280277 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9939] [SQL] Resorts to Java process API...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8168#issuecomment-132278490 [Test build #41144 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41144/consoleFull) for PR 8168 at commit [`7f20e19`](https://gith

[GitHub] spark pull request: [SPARK-9782] [YARN] Support YARN application t...

2015-08-18 Thread dennishuo
Github user dennishuo commented on a diff in the pull request: https://github.com/apache/spark/pull/8072#discussion_r37323794 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala --- @@ -163,6 +163,23 @@ private[spark] class Client( appContext.setQueue(ar

[GitHub] spark pull request: [SPARK-9939] [SQL] Resorts to Java process API...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8168#issuecomment-132277739 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9939] [SQL] Resorts to Java process API...

2015-08-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8168#issuecomment-132277784 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10083] [SQL] CaseWhen should support ty...

2015-08-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8270#issuecomment-132275259 [Test build #41142 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41142/consoleFull) for PR 8270 at commit [`1487543`](https://gith

<    3   4   5   6   7   8   9   10   11   12   >