svn commit: r29563 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_22_02-5d74449-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Fri Sep 21 05:17:10 2018 New Revision: 29563 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_22_02-5d74449 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25494][SQL] Upgrade Spark's use of Janino to 3.0.10

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 5d25e1544 -> 596af211a [SPARK-25494][SQL] Upgrade Spark's use of Janino to 3.0.10 ## What changes were proposed in this pull request? This PR upgrades Spark's use of Janino from 3.0.9 to 3.0.10. Note that 3.0.10 is a out-of-band release

svn commit: r29562 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_20_02-5d25e15-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Fri Sep 21 03:17:06 2018 New Revision: 29562 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_20_02-5d25e15 docs [This commit notification would consist of 1486 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: Revert "[SPARK-23715][SQL] the input of to/from_utc_timestamp can not have timezone

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.4 51f3659b7 -> 5d7444996 Revert "[SPARK-23715][SQL] the input of to/from_utc_timestamp can not have timezone ## What changes were proposed in this pull request? This reverts commit 417ad92502e714da71552f64d0e1257d2fd5d3d0. We decided

spark git commit: Revert "[SPARK-23715][SQL] the input of to/from_utc_timestamp can not have timezone

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 950ab7995 -> 5d25e1544 Revert "[SPARK-23715][SQL] the input of to/from_utc_timestamp can not have timezone ## What changes were proposed in this pull request? This reverts commit 417ad92502e714da71552f64d0e1257d2fd5d3d0. We decided to

svn commit: r29561 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_18_02-51f3659-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Fri Sep 21 01:17:13 2018 New Revision: 29561 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_18_02-51f3659 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-24777][SQL] Add write benchmark for AVRO

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.4 43c62e797 -> 51f3659b7 [SPARK-24777][SQL] Add write benchmark for AVRO ## What changes were proposed in this pull request? Refactor `DataSourceWriteBenchmark` and add write benchmark for AVRO. ## How was this patch tested? Build and

spark git commit: [SPARK-24777][SQL] Add write benchmark for AVRO

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 77e52448e -> 950ab7995 [SPARK-24777][SQL] Add write benchmark for AVRO ## What changes were proposed in this pull request? Refactor `DataSourceWriteBenchmark` and add write benchmark for AVRO. ## How was this patch tested? Build and run

svn commit: r29559 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_16_02-77e5244-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 23:16:53 2018 New Revision: 29559 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_16_02-77e5244 docs [This commit notification would consist of 1486 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25472][SS] Don't have legitimate stops of streams cause stream exceptions

2018-09-20 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 4d114fc9a -> 77e52448e [SPARK-25472][SS] Don't have legitimate stops of streams cause stream exceptions ## What changes were proposed in this pull request? Legitimate stops of streams may actually cause an exception to be captured by

spark git commit: [SPARK-25366][SQL] Zstd and brotli CompressionCodec are not supported for parquet files

2018-09-20 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2f51e7235 -> 4d114fc9a [SPARK-25366][SQL] Zstd and brotli CompressionCodec are not supported for parquet files ## What changes were proposed in this pull request? Hadoop2.6 and hadoop2.7 do not contain zstd and brotli compressioncodec

svn commit: r29557 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_14_03-43c62e7-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 21:19:08 2018 New Revision: 29557 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_14_03-43c62e7 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29556 - in /dev/spark/2.3.3-SNAPSHOT-2018_09_20_14_02-7edfdfc-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 21:17:38 2018 New Revision: 29556 Log: Apache Spark 2.3.3-SNAPSHOT-2018_09_20_14_02-7edfdfc docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29554 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_12_03-2f51e72-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 19:17:26 2018 New Revision: 29554 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_12_03-2f51e72 docs [This commit notification would consist of 1486 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-24918][CORE] Executor Plugin API

2018-09-20 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.4 c67c597b6 -> 43c62e797 [SPARK-24918][CORE] Executor Plugin API ## What changes were proposed in this pull request? A continuation of squito's executor plugin task. By his request I took his code and added testing and moved the plugin

spark git commit: [SPARK-24918][CORE] Executor Plugin API

2018-09-20 Thread vanzin
Repository: spark Updated Branches: refs/heads/master a86f84102 -> 2f51e7235 [SPARK-24918][CORE] Executor Plugin API ## What changes were proposed in this pull request? A continuation of squito's executor plugin task. By his request I took his code and added testing and moved the plugin

svn commit: r29550 - in /dev/spark/2.3.3-SNAPSHOT-2018_09_20_10_03-7edfdfc-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 17:19:34 2018 New Revision: 29550 Log: Apache Spark 2.3.3-SNAPSHOT-2018_09_20_10_03-7edfdfc docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29548 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_10_03-c67c597-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 17:18:39 2018 New Revision: 29548 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_10_03-c67c597 docs [This commit notification would consist of 1475 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25381][SQL] Stratified sampling by Column argument

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 88446b6ad -> a86f84102 [SPARK-25381][SQL] Stratified sampling by Column argument ## What changes were proposed in this pull request? In the PR, I propose to add an overloaded method for `sampleBy` which accepts the first argument of the

spark git commit: [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 dad5c48b2 -> 7edfdfcec [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation ## What changes were proposed in this pull request?

spark git commit: [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.4 fc036729c -> c67c597b6 [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation ## What changes were proposed in this pull request?

spark git commit: [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation

2018-09-20 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 88e7e87bd -> 88446b6ad [SPARK-25450][SQL] PushProjectThroughUnion rule uses the same exprId for project expressions in each Union child, causing mistakes in constant propagation ## What changes were proposed in this pull request? The

spark git commit: [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.4 78dd1d859 -> fc036729c [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package ## What changes were proposed in this pull request? This PR proposes to use add a helper in `PythonUtils` instead of

spark git commit: [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 e319a624e -> dad5c48b2 [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package ## What changes were proposed in this pull request? This PR proposes to use add a helper in `PythonUtils` instead of

spark git commit: [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 67f2c6a55 -> 88e7e87bd [MINOR][PYTHON] Use a helper in `PythonUtils` instead of direct accessing Scala package ## What changes were proposed in this pull request? This PR proposes to use add a helper in `PythonUtils` instead of direct

svn commit: r29544 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_08_02-67f2c6a-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 15:16:53 2018 New Revision: 29544 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_08_02-67f2c6a docs [This commit notification would consist of 1485 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29541 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_06_03-78dd1d8-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 13:17:25 2018 New Revision: 29541 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_06_03-78dd1d8 docs [This commit notification would consist of 1475 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25417][SQL] ArrayContains function may return incorrect result when right expression is implicitly down casted

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.4 b3bdfd7f1 -> 78dd1d859 [SPARK-25417][SQL] ArrayContains function may return incorrect result when right expression is implicitly down casted ## What changes were proposed in this pull request? In ArrayContains, we currently cast the

spark git commit: [SPARK-25417][SQL] ArrayContains function may return incorrect result when right expression is implicitly down casted

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/master edf5cc64e -> 67f2c6a55 [SPARK-25417][SQL] ArrayContains function may return incorrect result when right expression is implicitly down casted ## What changes were proposed in this pull request? In ArrayContains, we currently cast the right

spark git commit: [SPARK-25460][SS] DataSourceV2: SS sources do not respect SessionConfigSupport

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 89671a27e -> edf5cc64e [SPARK-25460][SS] DataSourceV2: SS sources do not respect SessionConfigSupport ## What changes were proposed in this pull request? This PR proposes to respect `SessionConfigSupport` in SS datasources as well.

spark git commit: Revert [SPARK-19355][SPARK-25352]

2018-09-20 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 7ff5386ed -> 89671a27e Revert [SPARK-19355][SPARK-25352] ## What changes were proposed in this pull request? This goes to revert sequential PRs based on some discussion and comments at

svn commit: r29540 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_04_02-7ff5386-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 11:16:49 2018 New Revision: 29540 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_04_02-7ff5386 docs [This commit notification would consist of 1485 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29536 - in /dev/spark/2.4.1-SNAPSHOT-2018_09_20_02_02-e07042a-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 09:16:44 2018 New Revision: 29536 Log: Apache Spark 2.4.1-SNAPSHOT-2018_09_20_02_02-e07042a docs [This commit notification would consist of 1475 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r29534 - in /dev/spark/2.5.0-SNAPSHOT-2018_09_20_00_02-0e31a6f-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-09-20 Thread pwendell
Author: pwendell Date: Thu Sep 20 07:17:52 2018 New Revision: 29534 Log: Apache Spark 2.5.0-SNAPSHOT-2018_09_20_00_02-0e31a6f docs [This commit notification would consist of 1485 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [MINOR][PYTHON][TEST] Use collect() instead of show() to make the output silent

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.4 dfcff3839 -> e07042a35 [MINOR][PYTHON][TEST] Use collect() instead of show() to make the output silent ## What changes were proposed in this pull request? This PR replace an effective `show()` to `collect()` to make the output silent.

spark git commit: [MINOR][PYTHON][TEST] Use collect() instead of show() to make the output silent

2018-09-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 0e31a6f25 -> 7ff5386ed [MINOR][PYTHON][TEST] Use collect() instead of show() to make the output silent ## What changes were proposed in this pull request? This PR replace an effective `show()` to `collect()` to make the output silent.