spark git commit: [SPARK-25736][SQL][TEST] add tests to verify the behavior of multi-column count

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.4 8bc7ab03d -> 77156f8c8 [SPARK-25736][SQL][TEST] add tests to verify the behavior of multi-column count ## What changes were proposed in this pull request? AFAIK multi-column count is not widely supported by the mainstream databases(po

spark git commit: [SPARK-25736][SQL][TEST] add tests to verify the behavior of multi-column count

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 5c7f6b663 -> e028fd3ae [SPARK-25736][SQL][TEST] add tests to verify the behavior of multi-column count ## What changes were proposed in this pull request? AFAIK multi-column count is not widely supported by the mainstream databases(postgr

svn commit: r30086 - in /dev/spark/3.0.0-SNAPSHOT-2018_10_16_00_02-5c7f6b6-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 07:16:59 2018 New Revision: 30086 Log: Apache Spark 3.0.0-SNAPSHOT-2018_10_16_00_02-5c7f6b6 docs [This commit notification would consist of 1481 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

svn commit: r30089 - in /dev/spark/2.4.1-SNAPSHOT-2018_10_16_02_02-77156f8-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 09:22:20 2018 New Revision: 30089 Log: Apache Spark 2.4.1-SNAPSHOT-2018_10_16_02_02-77156f8 docs [This commit notification would consist of 1472 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

svn commit: r30090 - in /dev/spark/3.0.0-SNAPSHOT-2018_10_16_04_02-e028fd3-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 11:20:45 2018 New Revision: 30090 Log: Apache Spark 3.0.0-SNAPSHOT-2018_10_16_04_02-e028fd3 docs [This commit notification would consist of 1481 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

spark git commit: [SPARK-25579][SQL] Use quoted attribute names if needed in pushed ORC predicates

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master e028fd3ae -> 2c664edc0 [SPARK-25579][SQL] Use quoted attribute names if needed in pushed ORC predicates ## What changes were proposed in this pull request? This PR aims to fix an ORC performance regression at Spark 2.4.0 RCs from Spark 2.

spark git commit: [SPARK-25579][SQL] Use quoted attribute names if needed in pushed ORC predicates

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.4 77156f8c8 -> 144cb949d [SPARK-25579][SQL] Use quoted attribute names if needed in pushed ORC predicates ## What changes were proposed in this pull request? This PR aims to fix an ORC performance regression at Spark 2.4.0 RCs from Spark

svn commit: r30091 - in /dev/spark/2.4.1-SNAPSHOT-2018_10_16_06_02-144cb94-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 13:20:30 2018 New Revision: 30091 Log: Apache Spark 2.4.1-SNAPSHOT-2018_10_16_06_02-144cb94 docs [This commit notification would consist of 1472 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

[3/4] spark git commit: [SPARK-25705][BUILD][STREAMING][TEST-MAVEN] Remove Kafka 0.8 integration

2018-10-16 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/703e6da1/external/kafka-0-8/src/main/scala/org/apache/spark/streaming/kafka/KafkaCluster.scala -- diff --git a/external/kafka-0-8/src/main/scala/org/apache/spark/streaming/kafka/Ka

[4/4] spark git commit: [SPARK-25705][BUILD][STREAMING][TEST-MAVEN] Remove Kafka 0.8 integration

2018-10-16 Thread srowen
[SPARK-25705][BUILD][STREAMING][TEST-MAVEN] Remove Kafka 0.8 integration ## What changes were proposed in this pull request? Remove Kafka 0.8 integration ## How was this patch tested? Existing tests, build scripts Closes #22703 from srowen/SPARK-25705. Authored-by: Sean Owen Signed-off-by: S

[1/4] spark git commit: [SPARK-25705][BUILD][STREAMING][TEST-MAVEN] Remove Kafka 0.8 integration

2018-10-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2c664edc0 -> 703e6da1e http://git-wip-us.apache.org/repos/asf/spark/blob/703e6da1/python/pyspark/streaming/kafka.py -- diff --git a/python/pyspark/streaming/kafka.py b/pyth

[2/4] spark git commit: [SPARK-25705][BUILD][STREAMING][TEST-MAVEN] Remove Kafka 0.8 integration

2018-10-16 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/703e6da1/external/kafka-0-8/src/main/scala/org/apache/spark/streaming/kafka/ReliableKafkaReceiver.scala -- diff --git a/external/kafka-0-8/src/main/scala/org/apache/spark/streaming

svn commit: r30094 - in /dev/spark/3.0.0-SNAPSHOT-2018_10_16_08_02-703e6da-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 15:20:45 2018 New Revision: 30094 Log: Apache Spark 3.0.0-SNAPSHOT-2018_10_16_08_02-703e6da docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

spark git commit: [SPARK-25394][CORE] Add an application status metrics source

2018-10-16 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 703e6da1e -> bd2c44713 [SPARK-25394][CORE] Add an application status metrics source - Exposes several metrics regarding application status as a source, useful to scrape them via jmx instead of mining the metrics rest api. Example use case

spark git commit: [SPARK-25631][SPARK-25632][SQL][TEST] Improve the test runtime of KafkaRDDSuite

2018-10-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master bd2c44713 -> 9d4dd7992 [SPARK-25631][SPARK-25632][SQL][TEST] Improve the test runtime of KafkaRDDSuite ## What changes were proposed in this pull request? Set a reasonable poll timeout thats used while consuming topics/partitions from kafk

svn commit: r30104 - in /dev/spark/3.0.0-SNAPSHOT-2018_10_16_16_03-9d4dd79-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Tue Oct 16 23:20:50 2018 New Revision: 30104 Log: Apache Spark 3.0.0-SNAPSHOT-2018_10_16_16_03-9d4dd79 docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

[2/2] spark git commit: [SPARK-25393][SQL] Adding new function from_csv()

2018-10-16 Thread gurwls223
[SPARK-25393][SQL] Adding new function from_csv() ## What changes were proposed in this pull request? The PR adds new function `from_csv()` similar to `from_json()` to parse columns with CSV strings. I added the following methods: ```Scala def from_csv(e: Column, schema: StructType, options: Map

[1/2] spark git commit: [SPARK-25393][SQL] Adding new function from_csv()

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 9d4dd7992 -> e9af9460b http://git-wip-us.apache.org/repos/asf/spark/blob/e9af9460/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/csv/CSVOptions.scala --

spark git commit: [SPARK-25734][SQL] Literal should have a value corresponding to dataType

2018-10-16 Thread wenchen
Repository: spark Updated Branches: refs/heads/master e9af9460b -> a9f685bb7 [SPARK-25734][SQL] Literal should have a value corresponding to dataType ## What changes were proposed in this pull request? `Literal.value` should have a value a value corresponding to `dataType`. This pr added code

svn commit: r30106 - in /dev/spark/3.0.0-SNAPSHOT-2018_10_16_20_02-e9af946-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Wed Oct 17 03:20:33 2018 New Revision: 30106 Log: Apache Spark 3.0.0-SNAPSHOT-2018_10_16_20_02-e9af946 docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---

spark git commit: [SQL][CATALYST][MINOR] update some error comments

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master a9f685bb7 -> e9332f600 [SQL][CATALYST][MINOR] update some error comments ## What changes were proposed in this pull request? this PR correct some comment error: 1. change from "as low a possible" to "as low as possible" in RewriteDistinct

spark git commit: [SQL][CATALYST][MINOR] update some error comments

2018-10-16 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.4 144cb949d -> 3591bd229 [SQL][CATALYST][MINOR] update some error comments ## What changes were proposed in this pull request? this PR correct some comment error: 1. change from "as low a possible" to "as low as possible" in RewriteDist

svn commit: r30107 - in /dev/spark/2.4.1-SNAPSHOT-2018_10_16_22_02-3591bd2-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-10-16 Thread pwendell
Author: pwendell Date: Wed Oct 17 05:20:25 2018 New Revision: 30107 Log: Apache Spark 2.4.1-SNAPSHOT-2018_10_16_22_02-3591bd2 docs [This commit notification would consist of 1472 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.] ---