spark git commit: [SPARK-22721] BytesToBytesMap peak memory usage not accurate after reset()

2017-12-07 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 8ae004b46 -> d32337b1e [SPARK-22721] BytesToBytesMap peak memory usage not accurate after reset() ## What changes were proposed in this pull request? BytesToBytesMap doesn't update peak memory usage before shrinking back to initial capaci

[2/2] spark git commit: [SPARK-22672][SQL][TEST] Refactor ORC Tests

2017-12-07 Thread wenchen
[SPARK-22672][SQL][TEST] Refactor ORC Tests ## What changes were proposed in this pull request? Since SPARK-20682, we have two `OrcFileFormat`s. This PR refactors ORC tests with three principles (with a few exceptions) 1. Move test suite into `sql/core`. 2. Create `HiveXXX` test suite in `sql/hi

[1/2] spark git commit: [SPARK-22672][SQL][TEST] Refactor ORC Tests

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master d32337b1e -> c1e5688d1 http://git-wip-us.apache.org/repos/asf/spark/blob/c1e5688d/sql/hive/src/test/scala/org/apache/spark/sql/hive/orc/HiveOrcQuerySuite.scala -- diff --git

spark git commit: [SPARK-22703][SQL] make ColumnarRow an immutable view

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master c1e5688d1 -> e103adf45 [SPARK-22703][SQL] make ColumnarRow an immutable view ## What changes were proposed in this pull request? Similar to https://github.com/apache/spark/pull/19842 , we should also make `ColumnarRow` an immutable view,

spark git commit: [SPARK-22705][SQL] Case, Coalesce, and In use less global variables

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master e103adf45 -> ea2fbf419 [SPARK-22705][SQL] Case, Coalesce, and In use less global variables ## What changes were proposed in this pull request? This PR accomplishes the following two items. 1. Reduce # of global variables from two to one f

spark git commit: [SPARK-22452][SQL] Add getInt, getLong, getBoolean to DataSourceV2Options

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master ea2fbf419 -> 2be448260 [SPARK-22452][SQL] Add getInt, getLong, getBoolean to DataSourceV2Options - Implemented methods getInt, getLong, getBoolean for DataSourceV2Options - Added new unit tests to exercise these methods Author: Sunitha Kam

spark git commit: [SPARK-22618][CORE] Catch exception in removeRDD to stop jobs from dying

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 2be448260 -> beb717f64 [SPARK-22618][CORE] Catch exception in removeRDD to stop jobs from dying ## What changes were proposed in this pull request? I propose that BlockManagerMasterEndpoint.removeRdd() should catch and log any IOException

spark git commit: [SPARK-22712][SQL] Use `buildReaderWithPartitionValues` in native OrcFileFormat

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master beb717f64 -> dd59a4be3 [SPARK-22712][SQL] Use `buildReaderWithPartitionValues` in native OrcFileFormat ## What changes were proposed in this pull request? To support vectorization in native OrcFileFormat later, we need to use `buildReader

spark git commit: [SPARK-22699][SQL] GenerateSafeProjection should not use global variables for struct

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master dd59a4be3 -> fc2944630 [SPARK-22699][SQL] GenerateSafeProjection should not use global variables for struct ## What changes were proposed in this pull request? GenerateSafeProjection is defining a mutable state for each struct, which is

spark git commit: [SPARK-22696][SQL] objects functions should not use unneeded global variables

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master fc2944630 -> b79071910 [SPARK-22696][SQL] objects functions should not use unneeded global variables ## What changes were proposed in this pull request? Some objects functions are using global variables which are not needed. This can gene

spark git commit: [SPARK-22660][BUILD] Use position() and limit() to fix ambiguity issue in scala-2.12

2017-12-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master b79071910 -> f41c0a93f [SPARK-22660][BUILD] Use position() and limit() to fix ambiguity issue in scala-2.12 …a-2.12 and JDK9 ## What changes were proposed in this pull request? Some compile error after upgrading to scala-2.12 ```javascr

svn commit: r23431 - /release/spark/KEYS

2017-12-07 Thread srowen
Author: srowen Date: Thu Dec 7 16:47:55 2017 New Revision: 23431 Log: Add Felix Cheung signing key for Spark Modified: release/spark/KEYS Modified: release/spark/KEYS == --- release/spark/KEYS (original) +++ release

spark git commit: [SPARK-22719][SQL] Refactor ConstantPropagation

2017-12-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f41c0a93f -> 18b75d465 [SPARK-22719][SQL] Refactor ConstantPropagation ## What changes were proposed in this pull request? The current time complexity of ConstantPropagation is O(n^2), which can be slow when the query is complex. Refactor

spark git commit: [SPARK-22688][SQL] Upgrade Janino version to 3.0.8

2017-12-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 7fd6d535d -> 20846758b [SPARK-22688][SQL] Upgrade Janino version to 3.0.8 This PR upgrade Janino version to 3.0.8. [Janino 3.0.8](https://janino-compiler.github.io/janino/changelog.html) includes an important fix to reduce the number

spark git commit: [SPARK-22688][SQL][HOTFIX] Upgrade Janino version to 3.0.8

2017-12-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 20846758b -> 9e2d96d1d [SPARK-22688][SQL][HOTFIX] Upgrade Janino version to 3.0.8 ## What changes were proposed in this pull request? Hotfix inadvertent change to xmlbuilder dep when updating Janino. See backport of https://github.com/

spark git commit: [SPARK-22279][SQL] Turn on spark.sql.hive.convertMetastoreOrc by default

2017-12-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 18b75d465 -> aa1764ba1 [SPARK-22279][SQL] Turn on spark.sql.hive.convertMetastoreOrc by default ## What changes were proposed in this pull request? Like Parquet, this PR aims to turn on `spark.sql.hive.convertMetastoreOrc` by default. ##

spark git commit: [SPARK-21787][SQL] Support for pushing down filters for DateType in native OrcFileFormat

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master aa1764ba1 -> 0ba8f4b21 [SPARK-21787][SQL] Support for pushing down filters for DateType in native OrcFileFormat ## What changes were proposed in this pull request? This PR support for pushing down filters for DateType in ORC ## How was t

spark git commit: [SPARK-22688][SQL] Upgrade Janino version to 3.0.8

2017-12-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 768b34a2e -> 0f6862106 [SPARK-22688][SQL] Upgrade Janino version to 3.0.8 This PR upgrade Janino version to 3.0.8. [Janino 3.0.8](https://janino-compiler.github.io/janino/changelog.html) includes an important fix to reduce the number

spark git commit: [SPARK-22187][SS][REVERT] Revert change in state row format for mapGroupsWithState

2017-12-07 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 0ba8f4b21 -> b11869bc3 [SPARK-22187][SS][REVERT] Revert change in state row format for mapGroupsWithState ## What changes were proposed in this pull request? #19416 changed the format in which rows were encoded in the state store. Howeve

spark git commit: [SPARK-22452][SQL] Add getDouble to DataSourceV2Options

2017-12-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master b11869bc3 -> f88a67bf0 [SPARK-22452][SQL] Add getDouble to DataSourceV2Options - Implemented getDouble method in DataSourceV2Options - Add unit test Author: Sunitha Kambhampati Closes #19921 from skambha/ds2. Project: http://git-wip-us