[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19631 retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19565: [SPARK-22111][MLLIB] OnlineLDAOptimizer should filter ou...

2017-11-01 Thread akopich
Github user akopich commented on the issue: https://github.com/apache/spark/pull/19565 @WeichenXu123, in a case of large dataset this "adjustment" would have infinitesimal effect. (IMO, no adjustment is needed -- the expected number of non-empty docs in the same and does not depend

[GitHub] spark pull request #19632: Added description

2017-11-01 Thread animenon
GitHub user animenon opened a pull request: https://github.com/apache/spark/pull/19632 Added description ## What changes were proposed in this pull request? Description added to better understand example. ## How was this patch tested? Not required as only

[GitHub] spark issue #19623: [SPARK-22078][SQL] clarify exception behaviors for all d...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19623 **[Test build #83300 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83300/testReport)** for PR 19623 at commit

[GitHub] spark issue #19633: [SPARK-22411][SQL] Disable the heuristic to calculate ma...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19633 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19635 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/19635 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19631 **[Test build #83299 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83299/testReport)** for PR 19631 at commit

[GitHub] spark issue #19624: [SPARKR][SPARK-22315] Warn if SparkR package version doe...

2017-11-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/19624 Maybe but it will say Version mismatch between Spark JVM and SparkR package. JVM version was 2.3.0-SNAPSHOT, while R package version was 2.1.2 I think it will be clear the

[GitHub] spark issue #19208: [SPARK-21087] [ML] CrossValidator, TrainValidationSplit ...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19208 **[Test build #3972 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3972/testReport)** for PR 19208 at commit

[GitHub] spark issue #19633: [SPARK-22411][SQL] Disable the heuristic to calculate ma...

2017-11-01 Thread vgankidi
Github user vgankidi commented on the issue: https://github.com/apache/spark/pull/19633 ping @davies --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark pull request #19634: [SPARK-22412][SQL] Fix incorrect comment in DataS...

2017-11-01 Thread vgankidi
GitHub user vgankidi opened a pull request: https://github.com/apache/spark/pull/19634 [SPARK-22412][SQL] Fix incorrect comment in DataSourceScanExec ## What changes were proposed in this pull request? Next fit decreasing bin packing algorithm is used to combine splits in

[GitHub] spark issue #16578: [SPARK-4502][SQL] Parquet nested column pruning

2017-11-01 Thread CodingCat
Github user CodingCat commented on the issue: https://github.com/apache/spark/pull/16578 made a simple test in a single-node spark environment I used a synthetic dataset which is generated as: (that’s 20M) ```scala import spark.implicits._ import

[GitHub] spark pull request #19635: [SPARK-22413][SQL] Type coercion for IN is not co...

2017-11-01 Thread mgaido91
GitHub user mgaido91 opened a pull request: https://github.com/apache/spark/pull/19635 [SPARK-22413][SQL] Type coercion for IN is not coherent between Literals and subquery ## What changes were proposed in this pull request? Now, type coercion for IN is not coherent

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19635 **[Test build #83304 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83304/testReport)** for PR 19635 at commit

[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19631 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19629: [SPARK-22408][SQL] RelationalGroupedDataset's distinct p...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19629 **[Test build #3971 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3971/testReport)** for PR 19629 at commit

[GitHub] spark issue #19565: [SPARK-22111][MLLIB] OnlineLDAOptimizer should filter ou...

2017-11-01 Thread akopich
Github user akopich commented on the issue: https://github.com/apache/spark/pull/19565 Okay... any idea why tests failed? It says ```ERROR: Step ?Publish JUnit test result report? failed: No test report files were found. Configuration error?``` ---

[GitHub] spark issue #19208: [SPARK-21087] [ML] CrossValidator, TrainValidationSplit ...

2017-11-01 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/19208 taking a look... --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19631 retest this please --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19631 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83302/ Test FAILed. ---

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19439 **[Test build #83303 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83303/testReport)** for PR 19439 at commit

[GitHub] spark issue #19632: Added description

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19632 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19631: [SPARK-22372][core, yarn] Make cluster submission use Sp...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19631 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive...

2017-11-01 Thread dongjoon-hyun
GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/19636 [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `sql/core` ## What changes were proposed in this pull request? According to the

[GitHub] spark pull request #19633: [SPARK-22411][SQL] Disable the heuristic to calcu...

2017-11-01 Thread vgankidi
GitHub user vgankidi opened a pull request: https://github.com/apache/spark/pull/19633 [SPARK-22411][SQL] Disable the heuristic to calculate max partition size when dynamic allocation is enabled and use the value specified by the property spark.sql.files.maxPartitionBytes instead

[GitHub] spark pull request #19632: Added description to python spark Pi example

2017-11-01 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/19632#discussion_r148370955 --- Diff: examples/src/main/python/pi.py --- @@ -27,12 +27,16 @@ if __name__ == "__main__": """ Usage: pi [partitions] +

[GitHub] spark issue #19633: [SPARK-22411][SQL] Disable the heuristic to calculate ma...

2017-11-01 Thread vgankidi
Github user vgankidi commented on the issue: https://github.com/apache/spark/pull/19633 How about using spark.dynamicAllocation.maxExecutors for calculating bytesPerCore when dynamic allocation is enabled? --- - To

[GitHub] spark issue #19634: [SPARK-22412][SQL] Fix incorrect comment in DataSourceSc...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19634 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/19635 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/19635 @mgaido91 Can you update the PR and describe there what you exactly changed? --- - To unsubscribe, e-mail:

[GitHub] spark issue #19582: [SPARK-20644][core] Initial ground work for kvstore UI b...

2017-11-01 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19582 @squito --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19439 **[Test build #83303 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83303/testReport)** for PR 19439 at commit

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19439 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19468: [SPARK-18278] [Scheduler] Spark on Kubernetes - Basic Sc...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19468 **[Test build #83309 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83309/testReport)** for PR 19468 at commit

[GitHub] spark pull request #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/19636#discussion_r148420534 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcOptions.scala --- @@ -27,7 +27,7 @@ import

[GitHub] spark issue #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `s...

2017-11-01 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/19636 Retest this please. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark pull request #19439: [SPARK-21866][ML][PySpark] Adding spark image rea...

2017-11-01 Thread dakirsa
Github user dakirsa commented on a diff in the pull request: https://github.com/apache/spark/pull/19439#discussion_r148404466 --- Diff: mllib/src/main/scala/org/apache/spark/ml/image/ImageSchema.scala --- @@ -0,0 +1,236 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19628 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19208: [SPARK-21087] [ML] CrossValidator, TrainValidationSplit ...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19208 **[Test build #3972 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3972/testReport)** for PR 19208 at commit

[GitHub] spark issue #19433: [SPARK-3162] [MLlib] Add local tree training for decisio...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19433 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83310/ Test FAILed. ---

[GitHub] spark issue #19433: [SPARK-3162] [MLlib] Add local tree training for decisio...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19433 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19629: [SPARK-22408][SQL] RelationalGroupedDataset's distinct p...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19629 **[Test build #3971 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3971/testReport)** for PR 19629 at commit

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19635 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83304/ Test FAILed. ---

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19635 **[Test build #83304 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83304/testReport)** for PR 19635 at commit

[GitHub] spark issue #19635: [SPARK-22413][SQL] Type coercion for IN is not coherent ...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19635 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19439 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83303/ Test PASSed. ---

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19628 **[Test build #83311 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83311/testReport)** for PR 19628 at commit

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19628 **[Test build #83312 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83312/testReport)** for PR 19628 at commit

[GitHub] spark pull request #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive...

2017-11-01 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/19636#discussion_r148423575 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/orc/OrcOptions.scala --- @@ -27,7 +27,7 @@ import

[GitHub] spark issue #19623: [SPARK-22078][SQL] clarify exception behaviors for all d...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19623 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83300/ Test PASSed. ---

[GitHub] spark issue #19623: [SPARK-22078][SQL] clarify exception behaviors for all d...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19623 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19459: [SPARK-20791][PYSPARK] Use Arrow to create Spark DataFra...

2017-11-01 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/19459 I made [SPARK-22417](https://issues.apache.org/jira/browse/SPARK-22417) for fixing reading from timestamps without arrow ---

[GitHub] spark issue #19625: [SPARK-22407][WEB-UI] Add rdd id column on storage page ...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19625 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #16578: [SPARK-4502][SQL] Parquet nested column pruning

2017-11-01 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/16578 Thanks @CodingCat +1 on config switch. I think that would be a good idea. --- - To unsubscribe, e-mail:

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/19439 Quick comment: I see that data are being added under mllib/src/test/resources/ That appears to be a new directory, created recently. The standard directory is

[GitHub] spark issue #18538: [SPARK-14516][ML] Adding ClusteringEvaluator with the im...

2017-11-01 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/18538 @yanboliang @mgaido91 I just saw this PR. It creates a new test data directory. Could you please send a quite update to move the data to the existing data directory:

[GitHub] spark issue #19439: [SPARK-21866][ML][PySpark] Adding spark image reader

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19439 @jkbradley, BTW, mind checking the API structure please? I reviewed this to be consistent with other components and codes at my best but, to be honest, my ML knowledge and familiarity are

[GitHub] spark issue #19637: [SPARK-22243][DStream]spark.yarn.jars should reload from...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19637 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `s...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19636 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `s...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19636 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83307/ Test FAILed. ---

[GitHub] spark pull request #19439: [SPARK-21866][ML][PySpark] Adding spark image rea...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/19439#discussion_r148405401 --- Diff: mllib/src/main/scala/org/apache/spark/ml/image/ImageSchema.scala --- @@ -0,0 +1,236 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19628 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19628 **[Test build #83311 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83311/testReport)** for PR 19628 at commit

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19628 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83311/ Test PASSed. ---

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19628 BTW, the test was passed for 5e1bbf0. It triggered again against the same commit after adding it to whitelist somehow. ---

[GitHub] spark issue #18118: [SPARK-20199][ML] : Provided featureSubsetStrategy to GB...

2017-11-01 Thread pralabhkumar
Github user pralabhkumar commented on the issue: https://github.com/apache/spark/pull/18118 @sethah please find some time to look into the changes . Please let me know if further changes are required. ---

[GitHub] spark issue #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `s...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19636 **[Test build #83306 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83306/testReport)** for PR 19636 at commit

[GitHub] spark issue #19636: [SPARK-22416][SQL] Move OrcOptions from `sql/hive` to `s...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19636 **[Test build #83308 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83308/testReport)** for PR 19636 at commit

[GitHub] spark issue #19623: [SPARK-22078][SQL] clarify exception behaviors for all d...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19623 **[Test build #83300 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83300/testReport)** for PR 19623 at commit

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19628 add to whitelist --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark issue #19628: [MINOR][DOC] automatic type inference supports also Date...

2017-11-01 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/19628 Merged to mater, branch-2.2 and branch-2.1. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark pull request #19628: [MINOR][DOC] automatic type inference supports al...

2017-11-01 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19628 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19637: [SPARK-22243][DStream]spark.yarn.jars should reload from...

2017-11-01 Thread ChenjunZou
Github user ChenjunZou commented on the issue: https://github.com/apache/spark/pull/19637 add spark.yarn.jars to the checkpoint reload configs. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark pull request #19637: [SPARK-22243][DStream]spark.yarn.jars should relo...

2017-11-01 Thread ChenjunZou
GitHub user ChenjunZou opened a pull request: https://github.com/apache/spark/pull/19637 [SPARK-22243][DStream]spark.yarn.jars should reload from config when checkpoint recovery the previous pr branch is deleted by mistake You can merge this pull request into a Git repository by

[GitHub] spark issue #19637: [SPARK-22243][DStream]spark.yarn.jars should reload from...

2017-11-01 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/19637 ok to test --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark pull request #19527: [SPARK-13030][ML] Create OneHotEncoderEstimator f...

2017-11-01 Thread MLnick
Github user MLnick commented on a diff in the pull request: https://github.com/apache/spark/pull/19527#discussion_r148193102 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/OneHotEncoderEstimator.scala --- @@ -0,0 +1,456 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #19624: [SPARKR][SPARK-22315] Warn if SparkR package vers...

2017-11-01 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/19624#discussion_r148192869 --- Diff: R/pkg/R/sparkR.R --- @@ -420,6 +420,18 @@ sparkR.session <- function( enableHiveSupport)

[GitHub] spark issue #19607: [SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19607 **[Test build #83286 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83286/testReport)** for PR 19607 at commit

[GitHub] spark issue #19618: [SPARK-5484][Followup] PeriodicRDDCheckpointer doc clean...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19618 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83281/ Test PASSed. ---

[GitHub] spark issue #18805: [SPARK-19112][CORE] Support for ZStandard codec

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18805 **[Test build #83282 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83282/testReport)** for PR 18805 at commit

[GitHub] spark pull request #19618: [SPARK-5484][Followup] PeriodicRDDCheckpointer do...

2017-11-01 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19618 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19218: [SPARK-21786][SQL] The 'spark.sql.parquet.compression.co...

2017-11-01 Thread fjh100456
Github user fjh100456 commented on the issue: https://github.com/apache/spark/pull/19218 cc @gatorsmile @dongjoon-hyun Is it ok now? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark issue #19622: [SPARK-22306][SQL][2.2] alter table schema should not er...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19622 **[Test build #83287 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83287/testReport)** for PR 19622 at commit

[GitHub] spark issue #19607: [SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19607 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #16722: [SPARK-19591][ML][MLlib] Add sample weights to decision ...

2017-11-01 Thread zengxy
Github user zengxy commented on the issue: https://github.com/apache/spark/pull/16722 @sethah Hi thanks for your working on it and would you have any plan to make it on 2.3 ? --- - To unsubscribe, e-mail:

[GitHub] spark issue #19607: [SPARK-22395][SQL][PYTHON] Fix the behavior of timestamp...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19607 **[Test build #83280 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83280/testReport)** for PR 19607 at commit

[GitHub] spark issue #19396: [SPARK-22172][CORE] Worker hangs when the external shuff...

2017-11-01 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/19396 OK, let me merge to master branch. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands,

[GitHub] spark issue #19586: [SPARK-22367][WIP][CORE] Separate the serialization of c...

2017-11-01 Thread ConeyLiu
Github user ConeyLiu commented on the issue: https://github.com/apache/spark/pull/19586 Currently, I use it directly. Maybe this is suitable for some special case which has same type data, such as ml or else. ---

[GitHub] spark pull request #19623: [SPARK-22078][SQL] clarify exception behaviors fo...

2017-11-01 Thread steveloughran
Github user steveloughran commented on a diff in the pull request: https://github.com/apache/spark/pull/19623#discussion_r148227459 --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/reader/DataReader.java --- @@ -34,11 +34,17 @@ /** * Proceed

[GitHub] spark issue #19396: [SPARK-22172][CORE] Worker hangs when the external shuff...

2017-11-01 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/19396 we should update PR description too, but it's too late now... --- - To unsubscribe, e-mail:

[GitHub] spark issue #19618: [SPARK-5484][Followup] PeriodicRDDCheckpointer doc clean...

2017-11-01 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19618 **[Test build #83281 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/83281/testReport)** for PR 19618 at commit

[GitHub] spark issue #19618: [SPARK-5484][Followup] PeriodicRDDCheckpointer doc clean...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19618 Merged build finished. Test PASSed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #19618: [SPARK-5484][Followup] PeriodicRDDCheckpointer doc clean...

2017-11-01 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/19618 Merged to master --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail:

[GitHub] spark pull request #19396: [SPARK-22172][CORE] Worker hangs when the externa...

2017-11-01 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19396 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #19586: [SPARK-22367][WIP][CORE] Separate the serialization of c...

2017-11-01 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/19586 For these cases, they can write their own serializer and set it via `spark.serializer`. I don't think Spark should have built-in support for them because it's not general. ---

[GitHub] spark issue #19583: [WIP][SPARK-22339] [CORE] [NETWORK-SHUFFLE] Push epoch u...

2017-11-01 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/19583 instead of piggybacking the heartbeat, can we just let the driver send an event to all executors asynchronously about the node failure? ---

[GitHub] spark issue #19479: [SPARK-17074] [SQL] Generate equi-height histogram in co...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19479 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/83285/ Test FAILed. ---

[GitHub] spark issue #19479: [SPARK-17074] [SQL] Generate equi-height histogram in co...

2017-11-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19479 Merged build finished. Test FAILed. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark pull request #19622: [SPARK-22306][SQL][2.2] alter table schema should...

2017-11-01 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/19622#discussion_r148219682 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveExternalCatalog.scala --- @@ -295,7 +297,7 @@ private[spark] class

  1   2   3   >