date:20160818

[GitHub] spark issue #14709: [SPARK-17150][SQL] Support SQL generation for inline tab...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14709 **[Test build #64038 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64038/consoleFull)** for PR 14709 at commit

[GitHub] spark pull request #14709: [SPARK-17150][SQL] Support SQL generation for inl...

2016-08-18 Thread petermaxlee

GitHub user petermaxlee opened a pull request: https://github.com/apache/spark/pull/14709 [SPARK-17150][SQL] Support SQL generation for inline tables ## What changes were proposed in this pull request? This patch adds support for SQL generation for inline tables. With this, it

[GitHub] spark issue #14682: [SPARK-17104][SQL] LogicalRelation.newInstance should fo...

2016-08-18 Thread viirya

Github user viirya commented on the issue: https://github.com/apache/spark/pull/14682 cc @cloud-fan @hvanhovell --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14452 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14452 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64032/ Test FAILed. ---

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14452 **[Test build #64032 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64032/consoleFull)** for PR 14452 at commit

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424988 --- Diff: R/pkg/R/functions.R --- @@ -2276,9 +2276,8 @@ setMethod("n_distinct", signature(x = "Column"), countDistinct(x, ...)

[GitHub] spark issue #13796: [SPARK-7159][ML] Add multiclass logistic regression to S...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13796 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64033/ Test PASSed. ---

[GitHub] spark issue #13796: [SPARK-7159][ML] Add multiclass logistic regression to S...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13796 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13796: [SPARK-7159][ML] Add multiclass logistic regression to S...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13796 **[Test build #64033 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64033/consoleFull)** for PR 13796 at commit

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14452 **[Test build #64037 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64037/consoleFull)** for PR 14452 at commit

[GitHub] spark pull request #14155: [SPARK-16498][SQL] move hive hack for data source...

2016-08-18 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14155#discussion_r75424795 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala --- @@ -97,16 +92,17 @@ case class

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread zjffdu

Github user zjffdu commented on the issue: https://github.com/apache/spark/pull/14639 @shivaram @felixcheung @sun-rui My previous commit didn't resolve the issue. It succeeded just due to it already download spark in cache dir. I push another commit to fix the issue. Overall,

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424491 --- Diff: R/pkg/R/functions.R --- @@ -832,7 +827,10 @@ setMethod("kurtosis", #' The function by default returns the last values it sees. It will

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424452 --- Diff: R/pkg/R/SQLContext.R --- @@ -727,6 +730,7 @@ dropTempView <- function(viewName) { #' @param source The name of external data source

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread sun-rui

Github user sun-rui commented on the issue: https://github.com/apache/spark/pull/14639 I think there may be a simpler solution. Just as my comment in the JIRA, "EXISTING_SPARKR_BACKEND_PORT" env variable can be checked, instead of getting the whole spark conf from JVM into R. ---

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424331 --- Diff: R/pkg/R/DataFrame.R --- @@ -514,9 +519,10 @@ setMethod("registerTempTable", #' #' Insert the contents of a SparkDataFrame into a

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424346 --- Diff: R/pkg/R/DataFrame.R --- @@ -999,9 +1008,10 @@ setMethod("dim", #' Collects all the elements of a SparkDataFrame and coerces them

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424338 --- Diff: R/pkg/R/DataFrame.R --- @@ -603,8 +611,9 @@ setMethod("persist", #' Mark this SparkDataFrame as non-persistent, and remove all blocks for

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424376 --- Diff: R/pkg/R/DataFrame.R --- @@ -2759,31 +2793,27 @@ setMethod("dropna", dataFrame(sdf) }) +#' @param

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424317 --- Diff: R/pkg/R/DataFrame.R --- @@ -120,8 +120,9 @@ setMethod("schema", #' #' Print the logical and physical Catalyst plans to the console

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424303 --- Diff: R/pkg/R/functions.R --- @@ -1273,12 +1271,15 @@ setMethod("round", #' bround #' #' Returns the value of the column `e` rounded

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424295 --- Diff: R/pkg/R/functions.R --- @@ -1335,7 +1336,7 @@ setMethod("rtrim", #' @note sd since 1.6.0 setMethod("sd", signature(x =

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424253 --- Diff: R/pkg/R/functions.R --- @@ -1848,7 +1850,7 @@ setMethod("upper", #' @note var since 1.6.0 setMethod("var", signature(x

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424182 --- Diff: R/pkg/R/functions.R --- @@ -2114,20 +2116,22 @@ setMethod("pmod", signature(y = "Column"), #' @rdname approxCountDistinct #' @name

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424170 --- Diff: R/pkg/R/functions.R --- @@ -2676,6 +2679,11 @@ setMethod("format_string", signature(format = "character", x = "Column"), #' representing

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424161 --- Diff: R/pkg/R/functions.R --- @@ -2702,19 +2710,21 @@ setMethod("from_unixtime", signature(x = "Column"), #' [12:05,12:10) but not in

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424152 --- Diff: R/pkg/R/functions.R --- @@ -2766,6 +2776,10 @@ setMethod("window", signature(x = "Column"), #' NOTE: The position is not zero based, but 1

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14639 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75424138 --- Diff: R/pkg/R/functions.R --- @@ -3115,6 +3166,11 @@ setMethod("dense_rank", #' #' This is equivalent to the LAG function in SQL. #'

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14639 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64036/ Test FAILed. ---

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14639 **[Test build #64036 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64036/consoleFull)** for PR 14639 at commit

[GitHub] spark issue #14639: [SPARK-17054][SPARKR] SparkR can not run in yarn-cluster...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14639 **[Test build #64036 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64036/consoleFull)** for PR 14639 at commit

[GitHub] spark issue #14694: [SPARK-17121][SPARKSUBMIT] Support _HOST replacement for...

2016-08-18 Thread zjffdu

Github user zjffdu commented on the issue: https://github.com/apache/spark/pull/14694 @vanzin Could you help review it ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14699: [SPARK-17125][SPARKR] Allow to specify spark config usin...

2016-08-18 Thread zjffdu

Github user zjffdu commented on the issue: https://github.com/apache/spark/pull/14699 @felixcheung In what of case user was expecting `1 -> 1.0 (numeric)`, because internally SparkConf use string for both key and value. --- If your project is set up for it, you can reply to this

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75423807 --- Diff: R/pkg/R/mllib.R --- @@ -620,11 +625,12 @@ setMethod("predict", signature(object = "KMeansModel"), #' predictions on new data, and

[GitHub] spark issue #8880: [SPARK-5682][Core] Add encrypted shuffle in spark

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/8880 **[Test build #64035 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64035/consoleFull)** for PR 8880 at commit

[GitHub] spark issue #14708: [SPARK-17149][SQL] array.sql for testing array related f...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14708 **[Test build #64034 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64034/consoleFull)** for PR 14708 at commit

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75423748 --- Diff: R/pkg/R/DataFrame.R --- @@ -3187,6 +3221,7 @@ setMethod("histogram", #' @param x A SparkDataFrame #' @param url JDBC database url of

[GitHub] spark issue #14697: [SPARK-17124][SQL] RelationalGroupedDataset.agg should p...

2016-08-18 Thread petermaxlee

Github user petermaxlee commented on the issue: https://github.com/apache/spark/pull/14697 cc @cloud-fan too --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75423733 --- Diff: R/pkg/R/DataFrame.R --- @@ -3003,9 +3036,10 @@ setMethod("str", #' Returns a new SparkDataFrame with columns dropped. #' This is a

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75423717 --- Diff: R/pkg/R/DataFrame.R --- @@ -2464,8 +2489,10 @@ setMethod("unionAll", #' Union two or more SparkDataFrames. This is equivalent to `UNION

[GitHub] spark pull request #14708: [SPARK-17149][SQL] array.sql for testing array re...

2016-08-18 Thread petermaxlee

Github user petermaxlee commented on a diff in the pull request: https://github.com/apache/spark/pull/14708#discussion_r75423714 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/FunctionRegistry.scala --- @@ -161,7 +161,6 @@ object FunctionRegistry {

[GitHub] spark pull request #14708: [SPARK-17149][SQL] array.sql for testing array re...

2016-08-18 Thread petermaxlee

GitHub user petermaxlee opened a pull request: https://github.com/apache/spark/pull/14708 [SPARK-17149][SQL] array.sql for testing array related functions ## What changes were proposed in this pull request? This patch creates array.sql in SQLQueryTestSuite for testing array

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75423643 --- Diff: R/pkg/R/mllib.R --- @@ -917,14 +922,14 @@ setMethod("spark.lda", signature(data = "SparkDataFrame"), # Returns a summary of the AFT

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75422922 --- Diff: R/pkg/R/generics.R --- @@ -735,6 +752,8 @@ setGeneric("between", function(x, bounds) { standardGeneric("between") }) setGeneric("cast",

[GitHub] spark issue #14467: [SPARK-16861][PYSPARK][CORE] Refactor PySpark accumulato...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14467 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64030/ Test FAILed. ---

[GitHub] spark issue #14467: [SPARK-16861][PYSPARK][CORE] Refactor PySpark accumulato...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14467 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14467: [SPARK-16861][PYSPARK][CORE] Refactor PySpark accumulato...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14467 **[Test build #64030 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64030/consoleFull)** for PR 14467 at commit

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75422686 --- Diff: R/pkg/R/mllib.R --- @@ -504,14 +504,15 @@ setMethod("summary", signature(object = "IsotonicRegressionModel"), #' Users can call

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread felixcheung

Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75422499 --- Diff: R/pkg/R/mllib.R --- @@ -917,14 +922,14 @@ setMethod("spark.lda", signature(data = "SparkDataFrame"), # Returns a summary of the AFT

[GitHub] spark issue #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/duplicated a...

2016-08-18 Thread HyukjinKwon

Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/14705 @shivaram BTW do you mind if I ask if we have R style guidelines somewhere please? I remember I made a PR for R referencing only the other codes. It seems it is also missing in

[GitHub] spark issue #14699: [SPARK-17125][SPARKR] Allow to specify spark config usin...

2016-08-18 Thread felixcheung

Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/14699 I haven't exhaustively test out configs, not sure always coercing to string might be limiting, in certain cases (eg. user was expecting `1` -> `1.0` (numeric) but now `1` -> `"1"` (character)

[GitHub] spark pull request #14705: [SPARK-16508][SparkR] Fix CRAN undocumented/dupli...

2016-08-18 Thread HyukjinKwon

Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/14705#discussion_r75421912 --- Diff: R/pkg/R/DataFrame.R --- @@ -1202,6 +1215,7 @@ setMethod("toRDD", #' Groups the SparkDataFrame using the specified columns, so we can run

[GitHub] spark issue #14707: [SPARK-17144] [SQL] Removal of useless CreateHiveTableAs...

2016-08-18 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14707 cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #13796: [SPARK-7159][ML] Add multiclass logistic regression to S...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13796 **[Test build #64033 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64033/consoleFull)** for PR 13796 at commit

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421906 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,611 @@ +/* + * Licensed to the

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421870 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,619 @@ +/* + * Licensed to the

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421878 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,619 @@ +/* + * Licensed to the

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421847 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1001 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421818 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421784 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421793 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421785 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75421763 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark issue #14703: [SparkR][Example]change example APP name

2016-08-18 Thread felixcheung

Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/14703 LGTM Thanks - I'll merge if no more comments in a bit. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14452 **[Test build #64032 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64032/consoleFull)** for PR 14452 at commit

[GitHub] spark issue #13950: [SPARK-15487] [Web UI] Spark Master UI to reverse proxy ...

2016-08-18 Thread gurvindersingh

Github user gurvindersingh commented on the issue: https://github.com/apache/spark/pull/13950 @ajbozarth I have changed the information part when `reverseproxyUrl` is defined. Regarding accessing worker logs it is still working for me when worker, master and application are on

[GitHub] spark issue #13950: [SPARK-15487] [Web UI] Spark Master UI to reverse proxy ...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13950 **[Test build #64031 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64031/consoleFull)** for PR 13950 at commit

[GitHub] spark pull request #14597: [SPARK-17017][MLLIB][ML] add a chiSquare Selector...

2016-08-18 Thread mpjlu

Github user mpjlu commented on a diff in the pull request: https://github.com/apache/spark/pull/14597#discussion_r75419052 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/feature/ChiSqSelector.scala --- @@ -189,11 +228,35 @@ class ChiSqSelector @Since("1.3.0") ( */

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75419042 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75418682 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75418614 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark issue #14452: [SPARK-16849][SQL] Improve subquery execution by dedupli...

2016-08-18 Thread viirya

Github user viirya commented on the issue: https://github.com/apache/spark/pull/14452 @davies Thanks for comment. This is proposed to reuse common subquery results in the query plan. For example, WITH cte as (SELECT * FROM src) SELECT * FROM cte a JOIN cte b

[GitHub] spark issue #14311: [SPARK-16550] [core] Certain classes fail to deserialize...

2016-08-18 Thread ericl

Github user ericl commented on the issue: https://github.com/apache/spark/pull/14311 On jenkins, it seems sometimes one block manager will fail to register. The only clue from the logs is ``` 16/08/18 18:54:25.796 netty-rpc-connection-0 INFO TransportClientFactory:

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75418377 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75418198 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/MultinomialLogisticRegressionSuite.scala --- @@ -0,0 +1,1016 @@ +/* + * Licensed

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14384 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64029/ Test FAILed. ---

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14384 Build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14384 **[Test build #64029 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64029/consoleFull)** for PR 14384 at commit

[GitHub] spark issue #14707: [SPARK-17144] [SQL] Removal of useless CreateHiveTableAs...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14707 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14707: [SPARK-17144] [SQL] Removal of useless CreateHiveTableAs...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14707 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64028/ Test PASSed. ---

[GitHub] spark issue #14707: [SPARK-17144] [SQL] Removal of useless CreateHiveTableAs...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14707 **[Test build #64028 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64028/consoleFull)** for PR 14707 at commit

[GitHub] spark issue #14155: [SPARK-16498][SQL] move hive hack for data source table ...

2016-08-18 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14155 : ) Then, it becomes very straightforward to combine `CreateDataSourceTableCommand` and `CreateDataSourceTableAsSelectCommand` into the same node. Now, let me check the data flow of

[GitHub] spark issue #14693: [SPARK-17113][Shuffle] Job failure due to Executor OOM i...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14693 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64027/ Test PASSed. ---

[GitHub] spark issue #14693: [SPARK-17113][Shuffle] Job failure due to Executor OOM i...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14693 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75417445 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,619 @@ +/* + * Licensed to the

[GitHub] spark issue #14693: [SPARK-17113][Shuffle] Job failure due to Executor OOM i...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14693 **[Test build #64027 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64027/consoleFull)** for PR 14693 at commit

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14384 Build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14384 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64026/ Test PASSed. ---

[GitHub] spark issue #14384: [Spark-16443][SparkR] Alternating Least Squares (ALS) wr...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14384 **[Test build #64026 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64026/consoleFull)** for PR 14384 at commit

[GitHub] spark issue #14700: [SPARK-17127]Make unaligned access in unsafe available f...

2016-08-18 Thread yimuxi

Github user yimuxi commented on the issue: https://github.com/apache/spark/pull/14700 @hvanhovell yes, I have tested it. For I'm now doing the spark unit tests in our server with the AArch64 architecture , and maybe due to the problem of openjdk(the unaligned field in the

[GitHub] spark issue #14676: [SPARK-16947][SQL] Support type coercion and foldable ex...

2016-08-18 Thread rxin

Github user rxin commented on the issue: https://github.com/apache/spark/pull/14676 I've backported this (along with range function). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread sethah

Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75416875 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,619 @@ +/* + * Licensed to the

[GitHub] spark issue #14690: [SPARK-16980][SQL] Load only catalog table partition met...

2016-08-18 Thread mallman

Github user mallman commented on the issue: https://github.com/apache/spark/pull/14690 That looks great! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #13796: [SPARK-7159][ML] Add multiclass logistic regressi...

2016-08-18 Thread dbtsai

Github user dbtsai commented on a diff in the pull request: https://github.com/apache/spark/pull/13796#discussion_r75416580 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultinomialLogisticRegression.scala --- @@ -0,0 +1,619 @@ +/* + * Licensed to the

[GitHub] spark issue #14690: [SPARK-16980][SQL] Load only catalog table partition met...

2016-08-18 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14690 not yet, but I can make one right after https://github.com/apache/spark/pull/14155 is merged --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #14676: [SPARK-16947][SQL] Support type coercion and foldable ex...

2016-08-18 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14676 thanks, merging to master! @petermaxlee do you mind send a new PR for 2.0? thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #14690: [SPARK-16980][SQL] Load only catalog table partition met...

2016-08-18 Thread mallman

Github user mallman commented on the issue: https://github.com/apache/spark/pull/14690 @cloud-fan O... how exciting! Is there a PR? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14467: [SPARK-16861][PYSPARK][CORE] Refactor PySpark accumulato...

2016-08-18 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14467 **[Test build #64030 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64030/consoleFull)** for PR 14467 at commit

< 1 2 3 4 5 6 7 >

101 - 200 of 667 matches

Mail list logo