spark git commit: [SPARK-25178][SQL] Directly ship the StructType objects of the keySchema / valueSchema for xxxHashMapGenerator

2018-08-23 Thread ueshin
Repository: spark Updated Branches: refs/heads/master 9b6baeb7b -> ab3302895 [SPARK-25178][SQL] Directly ship the StructType objects of the keySchema / valueSchema for xxxHashMapGenerator ## What changes were proposed in this pull request? This PR generates the code that to refer a

svn commit: r28925 - in /dev/spark/2.3.3-SNAPSHOT-2018_08_23_22_01-fcc9bd6-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Fri Aug 24 05:15:41 2018 New Revision: 28925 Log: Apache Spark 2.3.3-SNAPSHOT-2018_08_23_22_01-fcc9bd6 docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25029][BUILD][CORE] Janino "Two non-abstract methods ..." errors

2018-08-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f2d35427e -> 9b6baeb7b [SPARK-25029][BUILD][CORE] Janino "Two non-abstract methods ..." errors ## What changes were proposed in this pull request? Update to janino 3.0.9 to address Java 8 + Scala 2.12 incompatibility. The error manifests

spark git commit: [SPARK-4502][SQL] Parquet nested column pruning - foundation

2018-08-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/master cd6dff78b -> f2d35427e [SPARK-4502][SQL] Parquet nested column pruning - foundation (Link to Jira: https://issues.apache.org/jira/browse/SPARK-4502) _N.B. This is a restart of PR #16578 which includes a subset of that code. Relevant

svn commit: r28923 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_23_20_01-cd6dff7-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Fri Aug 24 03:16:10 2018 New Revision: 28923 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_23_20_01-cd6dff7 docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25209][SQL] Avoid deserializer check in Dataset.apply when Dataset is actually DataFrame

2018-08-23 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master b88ddb8a8 -> cd6dff78b [SPARK-25209][SQL] Avoid deserializer check in Dataset.apply when Dataset is actually DataFrame ## What changes were proposed in this pull request? Dataset.apply calls dataset.deserializer (to provide an early

spark git commit: [SPARK-23425][SQL] Support wildcard in HDFS path for load table command

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 0ce09ec54 -> b88ddb8a8 [SPARK-23425][SQL] Support wildcard in HDFS path for load table command ## What changes were proposed in this pull request? **Problem statement** load data command with hdfs file paths consists of wild card

spark git commit: [SPARK-25205][CORE] Fix typo in spark.network.crypto.keyFactoryIterations

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master b5e118808 -> 0ce09ec54 [SPARK-25205][CORE] Fix typo in spark.network.crypto.keyFactoryIterations Closes #22195 from squito/SPARK-25205. Authored-by: Imran Rashid Signed-off-by: hyukjinkwon Project:

spark git commit: [SPARK-25205][CORE] Fix typo in spark.network.crypto.keyFactoryIterations

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 9cb9d7201 -> fcc9bd632 [SPARK-25205][CORE] Fix typo in spark.network.crypto.keyFactoryIterations Closes #22195 from squito/SPARK-25205. Authored-by: Imran Rashid Signed-off-by: hyukjinkwon (cherry picked from commit

spark git commit: [SPARK-25124][ML] VectorSizeHint setSize and getSize don't return values

2018-08-23 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 8ed044928 -> b5e118808 [SPARK-25124][ML] VectorSizeHint setSize and getSize don't return values ## What changes were proposed in this pull request? In feature.py, VectorSizeHint setSize and getSize don't return value. Add return. ## How

svn commit: r28921 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_23_16_02-8ed0449-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Thu Aug 23 23:16:22 2018 New Revision: 28921 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_23_16_02-8ed0449 docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-23207][SPARK-22905][SPARK-24564][SPARK-25114][SQL][BACKPORT-2.2] Shuffle+Repartition on a DataFrame could lead to incorrect answers

2018-08-23 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 124789b62 -> d7c3aae20 [SPARK-23207][SPARK-22905][SPARK-24564][SPARK-25114][SQL][BACKPORT-2.2] Shuffle+Repartition on a DataFrame could lead to incorrect answers ## What changes were proposed in this pull request? Back port of

spark git commit: [SPARK-25204][SS] Fix race in rate source test.

2018-08-23 Thread tdas
Repository: spark Updated Branches: refs/heads/master a9aacdf1c -> 8ed044928 [SPARK-25204][SS] Fix race in rate source test. ## What changes were proposed in this pull request? Fix a race in the rate source tests. We need a better way of testing restart behavior. ## How was this patch

svn commit: r28916 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_23_08_02-a9aacdf-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Thu Aug 23 15:16:16 2018 New Revision: 28916 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_23_08_02-a9aacdf docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25208][SQL] Loosen Cast.forceNullable for DecimalType.

2018-08-23 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 5d572fc7c -> a9aacdf1c [SPARK-25208][SQL] Loosen Cast.forceNullable for DecimalType. ## What changes were proposed in this pull request? Casting to `DecimalType` is not always needed to force nullable. If the decimal type to cast is wider

spark git commit: [SPARK-25126][SQL] Avoid creating Reader for all orc files

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master e3b7bb413 -> 5d572fc7c [SPARK-25126][SQL] Avoid creating Reader for all orc files ## What changes were proposed in this pull request? [SPARK-25126] (https://issues.apache.org/jira/browse/SPARK-25126) reports loading a large number of orc

svn commit: r28905 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_23_04_02-e3b7bb4-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Thu Aug 23 11:18:00 2018 New Revision: 28905 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_23_04_02-e3b7bb4 docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r28903 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_23_00_01-8cc591c-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-23 Thread pwendell
Author: pwendell Date: Thu Aug 23 07:16:22 2018 New Revision: 28903 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_23_00_01-8cc591c docs [This commit notification would consist of 1478 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-24811][FOLLOWUP][SQL] Revise package of AvroDataToCatalyst and CatalystDataToAvro

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 8cc591c91 -> e3b7bb413 [SPARK-24811][FOLLOWUP][SQL] Revise package of AvroDataToCatalyst and CatalystDataToAvro ## What changes were proposed in this pull request? In https://github.com/apache/spark/pull/21838, the class

spark git commit: [SPARK-25164][SQL] Avoid rebuilding column and path list for each column in parquet reader

2018-08-23 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 2a0a8f753 -> 8cc591c91 [SPARK-25164][SQL] Avoid rebuilding column and path list for each column in parquet reader ## What changes were proposed in this pull request? VectorizedParquetRecordReader::initializeInternal rebuilds the column

spark git commit: [SPARK-23034][SQL] Show RDD/relation names in RDD/Hive table scan nodes

2018-08-23 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 49720906c -> 2a0a8f753 [SPARK-23034][SQL] Show RDD/relation names in RDD/Hive table scan nodes ## What changes were proposed in this pull request? This pr proposed to show RDD/relation names in RDD/Hive table scan nodes. This change made

spark git commit: [SPARK-23932][SQL][FOLLOW-UP] Fix an example of zip_with function.

2018-08-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 05974f943 -> 49720906c [SPARK-23932][SQL][FOLLOW-UP] Fix an example of zip_with function. ## What changes were proposed in this pull request? This is a follow-up pr of #22031 which added `zip_with` function to fix an example. ## How was