[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135910154 [Test build #41768 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41768/console) for PR 8503 at commit [`3c74245`](https://github.

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread yu-iskw
Github user yu-iskw commented on a diff in the pull request: https://github.com/apache/spark/pull/8503#discussion_r38253474 --- Diff: R/pkg/R/generics.R --- @@ -507,6 +507,10 @@ setGeneric("saveAsTable", function(df, tableName, source, mode, ...) { standardGeneric("saveAsTab

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135909544 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135909493 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread yu-iskw
Github user yu-iskw commented on a diff in the pull request: https://github.com/apache/spark/pull/8503#discussion_r38253381 --- Diff: R/pkg/R/generics.R --- @@ -467,7 +467,7 @@ setGeneric("merge") #' @rdname withColumn #' @export -setGeneric("mutate", function(x,

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135908415 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/8503#discussion_r38253290 --- Diff: R/pkg/R/DataFrame.R --- @@ -1034,6 +1068,10 @@ setMethod("[", signature(x = "DataFrame", i = "Column"), #' @return A new DataFrame with sel

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135908274 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135908273 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135907994 [Test build #41770 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41770/consoleFull) for PR 8515 at commit [`b509bee`](https://gith

[GitHub] spark pull request: [SPARK-10326] [yarn] Fix app submission on win...

2015-08-28 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/8493#issuecomment-135907721 (I think Andrew deleted his question, but this is not a problem in 1.4 nor 1.3.) --- If your project is set up for it, you can reply to this email and have your reply ap

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread yu-iskw
Github user yu-iskw commented on a diff in the pull request: https://github.com/apache/spark/pull/8503#discussion_r38252954 --- Diff: R/pkg/R/DataFrame.R --- @@ -1034,6 +1068,10 @@ setMethod("[", signature(x = "DataFrame", i = "Column"), #' @return A new DataFrame with selecte

[GitHub] spark pull request: [SPARK-10326] [yarn] Fix app submission on win...

2015-08-28 Thread vanzin
Github user vanzin closed the pull request at: https://github.com/apache/spark/pull/8493 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: SPARK-9545, SPARK-9547: Use Maven in PRB if ti...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7878#issuecomment-135907494 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: SPARK-9545, SPARK-9547: Use Maven in PRB if ti...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7878#issuecomment-135907495 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: SPARK-9545, SPARK-9547: Use Maven in PRB if ti...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7878#issuecomment-135907411 [Test build #41756 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41756/console) for PR 7878 at commit [`cf58c49`](https://github.

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135907268 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135907247 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135907201 [Test build #41767 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41767/console) for PR 8503 at commit [`90916d1`](https://github.

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135907267 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135907261 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135907074 [Test build #41768 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41768/consoleFull) for PR 8503 at commit [`3c74245`](https://gith

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/8515#discussion_r38252484 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/CatalystRowConverter.scala --- @@ -196,6 +196,13 @@ private[parquet] class

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/8515#discussion_r38252461 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/CatalystRowConverter.scala --- @@ -196,6 +196,13 @@ private[parquet] class

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135906740 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10326] [yarn] Fix app submission on win...

2015-08-28 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/8493#issuecomment-135906746 I've merged this. @vanzin can you close the pull request? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8515#issuecomment-135906726 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135906727 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135906741 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10339] [SPARK-10334] [SQL]Partitioned t...

2015-08-28 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/8515 [SPARK-10339] [SPARK-10334] [SQL]Partitioned table scan This fixes the problem that scanning partitioned table causes driver have a high memory pressure and takes down the cluster. Also, with this fix

[GitHub] spark pull request: [SPARK-10326] [yarn] Fix app submission on win...

2015-08-28 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/8493#issuecomment-135906019 In case others are wondering where this line comes from: #8053. LGTM feel free to merge it. --- If your project is set up for it, you can reply to this email and hav

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135905963 @felixcheung for the column being replaced part, I'd say we can open another jira / discuss it for later. I prefer the existing behavior of appending columns and users

[GitHub] spark pull request: [SPARK-8951][SparkR] support Unicode character...

2015-08-28 Thread shivaram
Github user shivaram commented on the pull request: https://github.com/apache/spark/pull/7494#issuecomment-135905717 Thanks @CHOIJAEHONG1 and @sun-rui -- I just want to test this / go through this a bit carefully once more as its a pretty fundamental change in how we handle strings. W

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135905741 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135905739 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135905677 [Test build #41764 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41764/console) for PR 8118 at commit [`345bde2`](https://github.

[GitHub] spark pull request: [SPARK-5753] [SQL] add JDBCRDD support for pos...

2015-08-28 Thread lepfhty
Github user lepfhty commented on the pull request: https://github.com/apache/spark/pull/4549#issuecomment-135905598 sorry i haven't gotten around to this. they changed their base classes. it might be easier to do as a spark-package... i could give it another try in a week or two ---

[GitHub] spark pull request: [SPARK-10326] [yarn] Fix app submission on win...

2015-08-28 Thread andrewor14
Github user andrewor14 commented on the pull request: https://github.com/apache/spark/pull/8493#issuecomment-135905532 @vanzin was this a regression from 1.4 or 1.3? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread shivaram
Github user shivaram commented on a diff in the pull request: https://github.com/apache/spark/pull/8503#discussion_r38251860 --- Diff: R/pkg/inst/tests/test_sparkSQL.R --- @@ -1042,6 +1046,19 @@ test_that("mutate(), rename() and names()", { names(newDF2) <- c("newerName", "ev

[GitHub] spark pull request: [SPARK-10330] Use SparkHadoopUtil TaskAttemptC...

2015-08-28 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/8499#issuecomment-135905364 I'll add a Scalastyle rule to catch these patterns and will fix these cases. --- If your project is set up for it, you can reply to this email and have your reply appe

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135905329 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135905331 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135905208 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10330] Use SparkHadoopUtil TaskAttemptC...

2015-08-28 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8499#issuecomment-135905246 ``` davies@localhost:~/work/spark$ git grep getConfiguration sql/ | grep -v getConfigurationFromJobContext sql/core/src/main/scala/org/apache/spark/sql/execution/

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135905240 [Test build #41765 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41765/console) for PR 6386 at commit [`8ca0fa9`](https://github.

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135905206 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135905104 [Test build #41762 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41762/console) for PR 8513 at commit [`f0fd13c`](https://github.

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135904835 [Test build #41767 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41767/consoleFull) for PR 8503 at commit [`90916d1`](https://gith

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135903860 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: Removed code duplication in ShuffleBlockFetche...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8514#issuecomment-135903805 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your pr

[GitHub] spark pull request: [SPARK-9803] [SPARKR] Add subset and transform...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8503#issuecomment-135903817 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-10301] [SQL] Fixes schema merging for n...

2015-08-28 Thread marmbrus
Github user marmbrus commented on the pull request: https://github.com/apache/spark/pull/8509#issuecomment-135903763 This seems risky to include in branch-1.5 given how far along in the process we are. I'd propose we instead merge a small patch that checks that the things being zippe

[GitHub] spark pull request: Removed code duplication in ShuffleBlockFetche...

2015-08-28 Thread eracah
GitHub user eracah opened a pull request: https://github.com/apache/spark/pull/8514 Removed code duplication in ShuffleBlockFetcherIterator Added fetchUpToMaxBytes() to prevent having to update both code blocks when a change is made. You can merge this pull request into a Git repos

[GitHub] spark pull request: [SPARK-10341] [SQL] fix memory starving in uns...

2015-08-28 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8511#issuecomment-135902719 ping @JoshRosen --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featur

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-135901298 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-135901296 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9043] Serialize key, value and combiner...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7403#issuecomment-135901205 [Test build #41754 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41754/console) for PR 7403 at commit [`c8d5fa8`](https://github.

[GitHub] spark pull request: [SPARK-10064] [ML] Parallelize decision tree b...

2015-08-28 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/8246#discussion_r38249912 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala --- @@ -1056,6 +988,70 @@ object DecisionTree extends Serializable with Loggi

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135900262 [Test build #41763 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41763/console) for PR 7976 at commit [`2316a90`](https://github.

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135900354 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135900353 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135899939 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135899937 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135899934 [Test build #41766 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41766/console) for PR 8512 at commit [`b713ad9`](https://github.

[GitHub] spark pull request: [SPARK-10064] [ML] Parallelize decision tree b...

2015-08-28 Thread manishamde
Github user manishamde commented on a diff in the pull request: https://github.com/apache/spark/pull/8246#discussion_r38249715 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/tree/DecisionTree.scala --- @@ -1056,6 +988,70 @@ object DecisionTree extends Serializable with Loggi

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135899228 [Test build #41764 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41764/consoleFull) for PR 8118 at commit [`345bde2`](https://gith

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135899015 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135899122 [Test build #41766 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41766/consoleFull) for PR 8512 at commit [`b713ad9`](https://gith

[GitHub] spark pull request: [SPARK-10330] Use SparkHadoopUtil TaskAttemptC...

2015-08-28 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/8499#issuecomment-135899030 (Just to clarify: pushing to branch-1.5 now means the change will make it to 1.5.0 if there's a new rc. If it's not really meant to go into 1.5.0, we should wait until th

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135898992 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-10330] Use SparkHadoopUtil TaskAttemptC...

2015-08-28 Thread vanzin
Github user vanzin commented on the pull request: https://github.com/apache/spark/pull/8499#issuecomment-135898883 The code looks fine, but do you want to pull this into 1.5.0? Wouldn't that mean a "-1" vote and a new rc? --- If your project is set up for it, you can reply to this em

[GitHub] spark pull request: [SPARK-5259][CORE] don't submit stage until it...

2015-08-28 Thread squito
Github user squito commented on the pull request: https://github.com/apache/spark/pull/7699#issuecomment-135898730 @mateiz thanks, I think I see now -- I should be able to add a unit test for that without much work. I'm not sure whether or not that was broken before this fix, but I'l

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135898763 [Test build #41765 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41765/consoleFull) for PR 6386 at commit [`8ca0fa9`](https://gith

[GitHub] spark pull request: [SPARK-10330] Use SparkHadoopUtil TaskAttemptC...

2015-08-28 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/8499#issuecomment-135898488 I'd like to pull this into branch-1.5 since this change will simplify certain compatibility checks for the `spark-avro` library. Does anyone have review comments conce

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135898448 [Test build #41763 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41763/consoleFull) for PR 7976 at commit [`2316a90`](https://gith

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135898321 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135898354 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [Spark-7780][MLLIB] Intercept in logisticregre...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/6386#issuecomment-135898336 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135898341 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135898085 [Test build #41762 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41762/consoleFull) for PR 8513 at commit [`f0fd13c`](https://gith

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135897666 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7976#issuecomment-135897649 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135897205 [Test build #41761 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41761/console) for PR 8118 at commit [`7767df0`](https://github.

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135897216 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135897221 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-9654][ML][PYSPARK] Add IndexToString to...

2015-08-28 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/7976#discussion_r38248302 --- Diff: python/pyspark/ml/feature.py --- @@ -817,6 +822,64 @@ class StringIndexerModel(JavaModel): """ Model fitted by StringIndexer.

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135896813 [Test build #41761 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41761/consoleFull) for PR 8118 at commit [`7767df0`](https://gith

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135896661 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8513#issuecomment-135896672 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-10299][ML][WIP] word2vec should allow u...

2015-08-28 Thread holdenk
GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/8513 [SPARK-10299][ML][WIP] word2vec should allow users to specify the window size Currently word2vec has the window hard coded at 5, some users may want different sizes (for example if using on n-gram

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135895948 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8118#issuecomment-135895929 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-9679][ML][PYSPARK] Add Python API for S...

2015-08-28 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/8118#discussion_r38247509 --- Diff: python/pyspark/ml/tests.py --- @@ -263,6 +263,23 @@ def test_ngram(self): transformedDF = ngram0.transform(dataset) self.as

[GitHub] spark pull request: [SPARK-10017] [MLlib]: ML model broadcasts sho...

2015-08-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/8241#issuecomment-135895566 @sabhyankar Yes, the first issue also applies to the current approach. But I was expecting to solve it with this effort (broadcast less and only once). It is awkward to d

[GitHub] spark pull request: [SPARK-10323] [SQL] fix nullability of In/InSe...

2015-08-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/8492 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-10323] [SQL] fix nullability of In/InSe...

2015-08-28 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/8492#issuecomment-135894603 Merged into master and 1.5 branch. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does no

[GitHub] spark pull request: [SPARK-10323] [SQL] fix nullability of In/InSe...

2015-08-28 Thread yhuai
Github user yhuai commented on the pull request: https://github.com/apache/spark/pull/8492#issuecomment-135894175 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135891267 [Test build #41760 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41760/console) for PR 8512 at commit [`982f0dd`](https://github.

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135891286 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/41

[GitHub] spark pull request: [SPARK-10340] [SQL] Use S3 bulk listing for S3...

2015-08-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/8512#issuecomment-135891282 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

<    1   2   3   4   5   >