svn commit: r30903 - in /dev/spark/3.0.0-SNAPSHOT-2018_11_14_20_33-03306a6-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-11-14 Thread pwendell
Author: pwendell Date: Thu Nov 15 04:45:51 2018 New Revision: 30903 Log: Apache Spark 3.0.0-SNAPSHOT-2018_11_14_20_33-03306a6 docs [This commit notification would consist of 1755 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

[3/4] spark git commit: [SPARK-26036][PYTHON] Break large tests.py files into smaller files

2018-11-14 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark/blob/03306a6d/python/pyspark/tests.py -- diff --git a/python/pyspark/tests.py b/python/pyspark/tests.py deleted file mode 100644 index 131c51e..000 --- a/python/pyspark/tests.py +++

[1/4] spark git commit: [SPARK-26036][PYTHON] Break large tests.py files into smaller files

2018-11-14 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master f6255d7b7 -> 03306a6df http://git-wip-us.apache.org/repos/asf/spark/blob/03306a6d/python/pyspark/tests/test_readwrite.py -- diff --git

[2/4] spark git commit: [SPARK-26036][PYTHON] Break large tests.py files into smaller files

2018-11-14 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark/blob/03306a6d/python/pyspark/tests/__init__.py -- diff --git a/python/pyspark/tests/__init__.py b/python/pyspark/tests/__init__.py new file mode 100644 index 000..12bdf0d ---

svn commit: r30902 - in /dev/spark/3.0.0-SNAPSHOT-2018_11_14_16_25-ad853c5-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-11-14 Thread pwendell
Author: pwendell Date: Thu Nov 15 00:38:01 2018 New Revision: 30902 Log: Apache Spark 3.0.0-SNAPSHOT-2018_11_14_16_25-ad853c5 docs [This commit notification would consist of 1755 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [MINOR][SQL] Add disable bucketedRead workaround when throw RuntimeException

2018-11-14 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master ad853c567 -> f6255d7b7 [MINOR][SQL] Add disable bucketedRead workaround when throw RuntimeException ## What changes were proposed in this pull request? It will throw `RuntimeException` when read from bucketed table(about 1.7G per bucket

spark git commit: [SPARK-25956] Make Scala 2.12 as default Scala version in Spark 3.0

2018-11-14 Thread dongjoon
Repository: spark Updated Branches: refs/heads/master 2977e2312 -> ad853c567 [SPARK-25956] Make Scala 2.12 as default Scala version in Spark 3.0 ## What changes were proposed in this pull request? This PR makes Spark's default Scala version as 2.12, and Scala 2.11 will be the alternative

svn commit: r30901 - in /dev/spark/2.4.1-SNAPSHOT-2018_11_14_14_19-ba638a7-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-11-14 Thread pwendell
Author: pwendell Date: Wed Nov 14 22:33:54 2018 New Revision: 30901 Log: Apache Spark 2.4.1-SNAPSHOT-2018_11_14_14_19-ba638a7 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25986][BUILD] Add rules to ban throw Errors in application code

2018-11-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2b671e729 -> 2977e2312 [SPARK-25986][BUILD] Add rules to ban throw Errors in application code ## What changes were proposed in this pull request? Add scala and java lint check rules to ban the usage of `throw new xxxErrors` and fix up

spark git commit: [SPARK-25778] WriteAheadLogBackedBlockRDD in YARN Cluster Mode Fails …

2018-11-14 Thread vanzin
Repository: spark Updated Branches: refs/heads/branch-2.4 ca426bfa5 -> ba638a783 [SPARK-25778] WriteAheadLogBackedBlockRDD in YARN Cluster Mode Fails … …due lack of access to tmpDir from $PWD to HDFS WriteAheadLogBackedBlockRDD usage of java.io.tmpdir will fail if $PWD resolves to a

spark git commit: [SPARK-25778] WriteAheadLogBackedBlockRDD in YARN Cluster Mode Fails …

2018-11-14 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 722369ee5 -> 2b671e729 [SPARK-25778] WriteAheadLogBackedBlockRDD in YARN Cluster Mode Fails … …due lack of access to tmpDir from $PWD to HDFS WriteAheadLogBackedBlockRDD usage of java.io.tmpdir will fail if $PWD resolves to a folder

spark git commit: [SPARK-24421][BUILD][CORE] Accessing sun.misc.Cleaner in JDK11

2018-11-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master 922dfe486 -> 722369ee5 [SPARK-24421][BUILD][CORE] Accessing sun.misc.Cleaner in JDK11 …. Other related changes to get JDK 11 working, to test ## What changes were proposed in this pull request? - Access `sun.misc.Cleaner` (Java 8) and

svn commit: r30897 - in /dev/spark/3.0.0-SNAPSHOT-2018_11_14_12_12-922dfe4-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-11-14 Thread pwendell
Author: pwendell Date: Wed Nov 14 20:27:01 2018 New Revision: 30897 Log: Apache Spark 3.0.0-SNAPSHOT-2018_11_14_12_12-922dfe4 docs [This commit notification would consist of 1471 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25965][SQL][TEST] Add avro read benchmark

2018-11-14 Thread dongjoon
Repository: spark Updated Branches: refs/heads/master 4035c98a0 -> 922dfe486 [SPARK-25965][SQL][TEST] Add avro read benchmark Add read benchmark for Avro, which is missing for a period. The benchmark is similar to `DataSourceReadBenchmark` and `OrcReadBenchmark` Manually run benchmark

spark git commit: [SPARK-26042][SS][TESTS] Fix a potential hang in KafkaContinuousSourceTopicDeletionSuite

2018-11-14 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 5f11e8c4c -> 4035c98a0 [SPARK-26042][SS][TESTS] Fix a potential hang in KafkaContinuousSourceTopicDeletionSuite ## What changes were proposed in this pull request? As initializing lazy vals shares the same lock, a thread is trying to

spark git commit: [SPARK-26042][SS][TESTS] Fix a potential hang in KafkaContinuousSourceTopicDeletionSuite

2018-11-14 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.4 e2e1f0ad8 -> ca426bfa5 [SPARK-26042][SS][TESTS] Fix a potential hang in KafkaContinuousSourceTopicDeletionSuite ## What changes were proposed in this pull request? As initializing lazy vals shares the same lock, a thread is trying to

spark git commit: [SPARK-25118][CORE] Persist Driver Logs in Client mode to Hdfs

2018-11-14 Thread vanzin
Repository: spark Updated Branches: refs/heads/master e503065fd -> 5f11e8c4c [SPARK-25118][CORE] Persist Driver Logs in Client mode to Hdfs Currently, we do not have a mechanism to collect driver logs if a user chooses to run their application in client mode. This is a big issue as admin

svn commit: r30892 - in /dev/spark/3.0.0-SNAPSHOT-2018_11_14_07_44-e503065-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-11-14 Thread pwendell
Author: pwendell Date: Wed Nov 14 15:58:48 2018 New Revision: 30892 Log: Apache Spark 3.0.0-SNAPSHOT-2018_11_14_07_44-e503065 docs [This commit notification would consist of 1471 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25868][MLLIB] One part of Spark MLlib Kmean Logic Performance problem

2018-11-14 Thread srowen
Repository: spark Updated Branches: refs/heads/master a7a331df6 -> e503065fd [SPARK-25868][MLLIB] One part of Spark MLlib Kmean Logic Performance problem ## What changes were proposed in this pull request? Fix fastSquaredDistance to calculate dense-dense situation calculation performance