[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17742 **[Test build #76258 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76258/testReport)** for PR 17742 at commit

[GitHub] spark issue #17793: [SPARK-20484][MLLIB] Add documentation to ALS code

2017-04-27 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/17793 +1 for this change. I'll try to take a look sometime, but maybe after the QA period. Also cc @MLnick. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #17556: [SPARK-16957][MLlib] Use weighted midpoints for split va...

2017-04-27 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/17556 I don't mind the weighted midpoints. However, if for a continuous feature we find that many points have the exact same value, we are assuming we may find data points in the test set that are close

[GitHub] spark issue #17540: [SPARK-20213][SQL][UI] Fix DataFrameWriter operations in...

2017-04-27 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17540 Personally, I'm fine with this patch, the only concern is we should have a follow up for nested query execution ASAP. And we should revert

[GitHub] spark pull request #17556: [SPARK-16957][MLlib] Use weighted midpoints for s...

2017-04-27 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/17556#discussion_r113855186 --- Diff: mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala --- @@ -1009,10 +1009,24 @@ private[spark] object RandomForest extends

[GitHub] spark pull request #17556: [SPARK-16957][MLlib] Use weighted midpoints for s...

2017-04-27 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/17556#discussion_r113855243 --- Diff: mllib/src/test/scala/org/apache/spark/ml/tree/impl/RandomForestSuite.scala --- @@ -138,9 +169,10 @@ class RandomForestSuite extends SparkFunSuite

[GitHub] spark pull request #17556: [SPARK-16957][MLlib] Use weighted midpoints for s...

2017-04-27 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/17556#discussion_r113854473 --- Diff: mllib/src/test/scala/org/apache/spark/ml/tree/impl/RandomForestSuite.scala --- @@ -112,9 +138,11 @@ class RandomForestSuite extends SparkFunSuite

[GitHub] spark pull request #17556: [SPARK-16957][MLlib] Use weighted midpoints for s...

2017-04-27 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/17556#discussion_r113855209 --- Diff: mllib/src/main/scala/org/apache/spark/ml/tree/impl/RandomForest.scala --- @@ -1037,7 +1051,10 @@ private[spark] object RandomForest extends

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113855222 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark pull request #17797: [SparkR][DOC]:Document LinearSVC in R programming...

2017-04-27 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17797 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/17797 merged to master/2.2 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113854501 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17797 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76257/ Test PASSed. ---

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17797 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17797 **[Test build #76257 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76257/testReport)** for PR 17797 at commit

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-27 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/17503 I think the benefit of this would be for speed at predict time or for model storage. @srowen the nodes don't have to be equal to be merged, they just have to output the same prediction. Since this a

[GitHub] spark pull request #17640: [SPARK-17608][SPARKR]:Long type has incorrect ser...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17640#discussion_r113853971 --- Diff: R/pkg/R/serialize.R --- @@ -83,6 +83,7 @@ writeObject <- function(con, object, writeType = TRUE) { Date = writeDate(con,

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread wangmiao1981
Github user wangmiao1981 commented on the issue: https://github.com/apache/spark/pull/17797 @felixcheung As I checked the SparkR programming guide, it seems that all machine learning parts are links to existing documents. So I just add the link to Linear SVM document and tested it.

[GitHub] spark pull request #17640: [SPARK-17608][SPARKR]:Long type has incorrect ser...

2017-04-27 Thread wangmiao1981
Github user wangmiao1981 commented on a diff in the pull request: https://github.com/apache/spark/pull/17640#discussion_r113853483 --- Diff: R/pkg/R/serialize.R --- @@ -83,6 +83,7 @@ writeObject <- function(con, object, writeType = TRUE) { Date = writeDate(con,

[GitHub] spark issue #17797: [SparkR][DOC]:Document LinearSVC in R programming guide

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17797 **[Test build #76257 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76257/testReport)** for PR 17797 at commit

[GitHub] spark pull request #17797: [SparkR][DOC]:Document LinearSVC in R programming...

2017-04-27 Thread wangmiao1981
GitHub user wangmiao1981 opened a pull request: https://github.com/apache/spark/pull/17797 [SparkR][DOC]:Document LinearSVC in R programming guide ## What changes were proposed in this pull request? add link to svmLinear in the SparkR programming document. ## How

[GitHub] spark issue #16609: [SPARK-8480] [CORE] [PYSPARK] [SPARKR] Add setName for D...

2017-04-27 Thread phatak-dev
Github user phatak-dev commented on the issue: https://github.com/apache/spark/pull/16609 @gatorsmile sure. I will give a PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17303: [SPARK-19112][CORE] add codec for ZStandard

2017-04-27 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/17303 I did quick benchmarks by using a TPCDS query (Q4) (I just referred the previous work in #10342) Based on the result, it seems it's a bit earlier to implement this; ``` scaleFactor: 4

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113851957 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113851718 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark pull request #17640: [SPARK-17608][SPARKR]:Long type has incorrect ser...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17640#discussion_r113851363 --- Diff: R/pkg/R/serialize.R --- @@ -83,6 +83,7 @@ writeObject <- function(con, object, writeType = TRUE) { Date = writeDate(con,

[GitHub] spark issue #17781: [SPARK-20476] [SQL] Block users to create a table that u...

2017-04-27 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/17781 cc @cloud-fan @sameeragarwal --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17774 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76256/ Test PASSed. ---

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17774 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17774 **[Test build #76256 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76256/testReport)** for PR 17774 at commit

[GitHub] spark pull request #17783: [SPARK-20490][SPARKR][WIP] Add R wrappers for eqN...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17783#discussion_r113849536 --- Diff: R/pkg/inst/tests/testthat/test_sparkSQL.R --- @@ -1478,6 +1481,13 @@ test_that("column functions", { lapply( list(list(x =

[GitHub] spark pull request #17783: [SPARK-20490][SPARKR][WIP] Add R wrappers for eqN...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17783#discussion_r113849200 --- Diff: R/pkg/R/column.R --- @@ -67,8 +67,7 @@ operators <- list( "+" = "plus", "-" = "minus", "*" = "multiply", "/" = "divide", "%%" = "mod",

[GitHub] spark pull request #17783: [SPARK-20490][SPARKR][WIP] Add R wrappers for eqN...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17783#discussion_r113849582 --- Diff: R/pkg/inst/tests/testthat/test_sparkSQL.R --- @@ -1965,6 +1975,16 @@ test_that("filter() on a DataFrame", { # Test stats::filter

[GitHub] spark pull request #17783: [SPARK-20490][SPARKR][WIP] Add R wrappers for eqN...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17783#discussion_r113849285 --- Diff: R/pkg/R/column.R --- @@ -302,3 +301,65 @@ setMethod("otherwise", jc <- callJMethod(x@jc, "otherwise", value)

[GitHub] spark issue #17786: [SPARK-20483] Mesos Coarse mode may starve other Mesos f...

2017-04-27 Thread dbtsai
Github user dbtsai commented on the issue: https://github.com/apache/spark/pull/17786 @mgummelt We tested this in our production env, and it solves our issue. Since it seems to be a trivial change, I made my judgement. Gonna wait for more feedback. Thanks. --- If your project is

[GitHub] spark pull request #17783: [SPARK-20490][SPARKR][WIP] Add R wrappers for eqN...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17783#discussion_r113849107 --- Diff: R/pkg/R/column.R --- @@ -302,3 +301,65 @@ setMethod("otherwise", jc <- callJMethod(x@jc, "otherwise", value)

[GitHub] spark issue #17786: [SPARK-20483] Mesos Coarse mode may starve other Mesos f...

2017-04-27 Thread dbtsai
Github user dbtsai commented on the issue: https://github.com/apache/spark/pull/17786 @mgummelt We tested this in our production env, and it solves our issue. Since it seems to be a trivial change, I made my judgement. Gonna wait for more feedback. --- If your project is set up

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113847972 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17774 **[Test build #76256 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76256/testReport)** for PR 17774 at commit

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/17774 Jenkins, ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #17774: [SPARK-18371][Streaming] Spark Streaming backpressure ge...

2017-04-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/17774 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #17785: [SPARK-20493][R] De-deuplicate parse logics for DDL-like...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17785 **[Test build #76255 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76255/testReport)** for PR 17785 at commit

[GitHub] spark issue #17786: [SPARK-20483] Mesos Coarse mode may starve other Mesos f...

2017-04-27 Thread dgshep
Github user dgshep commented on the issue: https://github.com/apache/spark/pull/17786 Fair point. This felt like a succinct way to handle this corner case, but if it makes sense to harden the offer refusal code instead, I can update. --- If your project is set up for it, you can

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113847076 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/api/r/SQLUtils.scala --- @@ -92,48 +93,8 @@ private[sql] object SQLUtils extends Logging {

[GitHub] spark pull request #17785: [SPARK-20493][R] De-deuplicate parse logics for D...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17785#discussion_r113846935 --- Diff: R/pkg/R/utils.R --- @@ -864,6 +864,14 @@ captureJVMException <- function(e, method) { # Extract the first message of JVM exception.

[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17742 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76254/ Test PASSed. ---

[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17742 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17742 **[Test build #76254 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76254/testReport)** for PR 17742 at commit

[GitHub] spark pull request #17130: [SPARK-19791] [ML] Add doc and example for fpgrow...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17130#discussion_r113846602 --- Diff: mllib/src/main/scala/org/apache/spark/ml/fpm/FPGrowth.scala --- @@ -82,8 +81,8 @@ private[fpm] trait FPGrowthParams extends Params with

[GitHub] spark pull request #17130: [SPARK-19791] [ML] Add doc and example for fpgrow...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17130#discussion_r113846530 --- Diff: mllib/src/main/scala/org/apache/spark/ml/fpm/FPGrowth.scala --- @@ -268,12 +269,8 @@ class FPGrowthModel private[ml] ( val predictUDF

[GitHub] spark pull request #17130: [SPARK-19791] [ML] Add doc and example for fpgrow...

2017-04-27 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/17130#discussion_r113846563 --- Diff: docs/ml-frequent-pattern-mining.md --- @@ -0,0 +1,87 @@ +--- +layout: global +title: Frequent Pattern Mining +displayTitle:

[GitHub] spark issue #17796: [SPARK-20519][SQL][CORE]Modify to prevent some possible ...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17796 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #17796: [SPARK-20519][SQL][CORE]Modify to prevent some po...

2017-04-27 Thread 10110346
GitHub user 10110346 opened a pull request: https://github.com/apache/spark/pull/17796 [SPARK-20519][SQL][CORE]Modify to prevent some possible runtime exceptions Signed-off-by: liuxian ## What changes were proposed in this pull request? For

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76251/ Test PASSed. ---

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17765 **[Test build #76251 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76251/testReport)** for PR 17765 at commit

[GitHub] spark issue #17459: [SPARK-20109][MLlib] Rewrote toBlockMatrix method on Ind...

2017-04-27 Thread johnc1231
Github user johnc1231 commented on the issue: https://github.com/apache/spark/pull/17459 @viirya Any more feedback on this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17742 **[Test build #76254 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76254/testReport)** for PR 17742 at commit

[GitHub] spark issue #17781: [SPARK-20476] [SQL] Block users to create a table that u...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17781 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17781: [SPARK-20476] [SQL] Block users to create a table that u...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17781 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76252/ Test PASSed. ---

[GitHub] spark issue #17781: [SPARK-20476] [SQL] Block users to create a table that u...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17781 **[Test build #76252 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76252/testReport)** for PR 17781 at commit

[GitHub] spark pull request #17596: [SPARK-12837][CORE] Do not send the name of inter...

2017-04-27 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17596 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17596: [SPARK-12837][CORE] Do not send the name of internal acc...

2017-04-27 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17596 LGTM - merging to master/2.2 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17795: [SPARK-20517][UI] Fix broken history UI download ...

2017-04-27 Thread jerryshao
GitHub user jerryshao opened a pull request: https://github.com/apache/spark/pull/17795 [SPARK-20517][UI] Fix broken history UI download link ## What changes were proposed in this pull request? The download link in history server UI is concatenated with: ```

[GitHub] spark issue #17795: [SPARK-20517][UI] Fix broken history UI download link

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17795 **[Test build #76253 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76253/testReport)** for PR 17795 at commit

[GitHub] spark issue #17702: [SPARK-20408][SQL] Get the glob path in parallel to redu...

2017-04-27 Thread xuanyuanking
Github user xuanyuanking commented on the issue: https://github.com/apache/spark/pull/17702 @HyukjinKwon Can you help me to find a appropriate reviewer about this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #17794: Supplement the new blockidsuite unit tests

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17794 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76250/ Test PASSed. ---

[GitHub] spark pull request #17794: Supplement the new blockidsuite unit tests

2017-04-27 Thread heary-cao
GitHub user heary-cao opened a pull request: https://github.com/apache/spark/pull/17794 Supplement the new blockidsuite unit tests ## What changes were proposed in this pull request? This PR adds the new unit tests to support ShuffleDataBlockId , ShuffleIndexBlockId ,

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17765 **[Test build #76250 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76250/testReport)** for PR 17765 at commit

[GitHub] spark issue #17793: [SPARK-20484][MLLIB] Add documentation to ALS code

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17793 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #17736: [SPARK-20399][SQL] Can't use same regex pattern between ...

2017-04-27 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/17736 @cloud-fan Do you mean `SELECT \\abc`? Spark 2.x: sql("select '\\abc'").show() +---+ |abc| +---+ |abc| +---+

[GitHub] spark pull request #17793: [SPARK-20484][MLLIB] Add documentation to ALS cod...

2017-04-27 Thread danielyli
GitHub user danielyli opened a pull request: https://github.com/apache/spark/pull/17793 [SPARK-20484][MLLIB] Add documentation to ALS code ## What changes were proposed in this pull request? This PR adds documentation to the ALS code. ## How was this patch tested?

[GitHub] spark issue #17789: [SPARK-19525][CORE]Add RDD checkpoint compression suppor...

2017-04-27 Thread mridulm
Github user mridulm commented on the issue: https://github.com/apache/spark/pull/17789 To add, for non streaming usecases, this will definitely help - but was this a recent change for streaming ? (probably after @aramesh117 make the PR ?) --- If your project is set up for it, you

[GitHub] spark issue #17436: [SPARK-20101][SQL] Use OffHeapColumnVector when "spark.m...

2017-04-27 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17436 When shall we free a column vector? One is when the iterator is consumed up, another one is when we have a `LIMIT n` in the query and stop reading the iterator at some point. Is there any other

[GitHub] spark issue #17789: [SPARK-19525][CORE]Add RDD checkpoint compression suppor...

2017-04-27 Thread mridulm
Github user mridulm commented on the issue: https://github.com/apache/spark/pull/17789 I thought the main reason @aramesh117 did this PR was for compression to be enabled for spark streaming usecase. If compression is already enabled, then am I missing something here ? --- If

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76249/ Test FAILed. ---

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17765 **[Test build #76249 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76249/testReport)** for PR 17765 at commit

[GitHub] spark issue #17771: [SPARK-20471]Remove AggregateBenchmark testsuite warning...

2017-04-27 Thread heary-cao
Github user heary-cao commented on the issue: https://github.com/apache/spark/pull/17771 @gatorsmile ok, please review it again. thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #17792: [SPARK-20496][SS] Bug in KafkaWriter Looks at Una...

2017-04-27 Thread anabranch
Github user anabranch closed the pull request at: https://github.com/apache/spark/pull/17792 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #17645: [SPARK-20348] [ML] Support squared hinge loss (L2...

2017-04-27 Thread hhbyyh
Github user hhbyyh commented on a diff in the pull request: https://github.com/apache/spark/pull/17645#discussion_r113836900 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/LinearSVC.scala --- @@ -42,15 +44,35 @@ import org.apache.spark.sql.functions.{col, lit}

[GitHub] spark pull request #17792: [SPARK-20496][SS] Bug in KafkaWriter Looks at Una...

2017-04-27 Thread anabranch
GitHub user anabranch opened a pull request: https://github.com/apache/spark/pull/17792 [SPARK-20496][SS] Bug in KafkaWriter Looks at Unanalyzed Plans ## What changes were proposed in this pull request? We didn't enforce analyzed plans in Spark 2.1 when writing out to

[GitHub] spark pull request #17787: [SPARK-20496][SS] Bug in KafkaWriter Looks at Una...

2017-04-27 Thread anabranch
Github user anabranch closed the pull request at: https://github.com/apache/spark/pull/17787 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17742: [Spark-11968][ML][MLLIB]Optimize MLLIB ALS recommendForA...

2017-04-27 Thread jtengyp
Github user jtengyp commented on the issue: https://github.com/apache/spark/pull/17742 I did some tests with the PR. Here is the cluster configure: 3 workers, each has 10 cores and 30G memory. With the netflix dataset (480,189 users and 17770 movies), the

[GitHub] spark issue #17781: [SPARK-20476] [SQL] Block users to create a table that u...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17781 **[Test build #76252 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76252/testReport)** for PR 17781 at commit

[GitHub] spark issue #17735: [SPARK-20441][SPARK-20432][SS] Within the same streaming...

2017-04-27 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/17735 @brkyvz please take a another look --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17540: [SPARK-20213][SQL][UI] Fix DataFrameWriter operations in...

2017-04-27 Thread rdblue
Github user rdblue commented on the issue: https://github.com/apache/spark/pull/17540 @zsxwing and @cloud-fan, can you have another look at this? I'd really like to get it in. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76246/ Test PASSed. ---

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76245/ Test PASSed. ---

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17765 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17765 **[Test build #76245 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76245/testReport)** for PR 17765 at commit

[GitHub] spark issue #17765: [SPARK-20464][SS] Add a job group and description for st...

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17765 **[Test build #76246 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76246/testReport)** for PR 17765 at commit

[GitHub] spark pull request #17765: [SPARK-20464][SS] Add a job group and description...

2017-04-27 Thread kunalkhamar
Github user kunalkhamar commented on a diff in the pull request: https://github.com/apache/spark/pull/17765#discussion_r113831182 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala --- @@ -825,6 +832,11 @@ class StreamExecution(

[GitHub] spark issue #17790: [SPARK-20514][CORE] Upgrade Jetty to 9.3.11.v20160721

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17790 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17790: [SPARK-20514][CORE] Upgrade Jetty to 9.3.11.v20160721

2017-04-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17790 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76244/ Test PASSed. ---

[GitHub] spark pull request #17765: [SPARK-20464][SS] Add a job group and description...

2017-04-27 Thread kunalkhamar
Github user kunalkhamar commented on a diff in the pull request: https://github.com/apache/spark/pull/17765#discussion_r113830998 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala --- @@ -825,6 +832,11 @@ class StreamExecution(

[GitHub] spark issue #17790: [SPARK-20514][CORE] Upgrade Jetty to 9.3.11.v20160721

2017-04-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17790 **[Test build #76244 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76244/testReport)** for PR 17790 at commit

  1   2   3   4   5   >