spark git commit: [SPARK-17010][MINOR][DOC] Wrong description in memory management document

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master 665e17532 -> 7a6a3c3fb [SPARK-17010][MINOR][DOC] Wrong description in memory management document ## What changes were proposed in this pull request? change the remain percent to right one. ## How was this patch tested? Manual review

spark git commit: [SPARK-17010][MINOR][DOC] Wrong description in memory management document

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 d3a30d2f0 -> 1e4013571 [SPARK-17010][MINOR][DOC] Wrong description in memory management document ## What changes were proposed in this pull request? change the remain percent to right one. ## How was this patch tested? Manual review

spark git commit: [SPARK-17007][SQL] Move test data files into a test-data folder

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master 425c7c2db -> 665e17532 [SPARK-17007][SQL] Move test data files into a test-data folder ## What changes were proposed in this pull request? This patch moves all the test data files in sql/core/src/test/resources to

spark git commit: [SPARK-17008][SPARK-17009][SQL] Normalization and isolation in SQLQueryTestSuite.

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master ab648c000 -> 425c7c2db [SPARK-17008][SPARK-17009][SQL] Normalization and isolation in SQLQueryTestSuite. ## What changes were proposed in this pull request? This patch enhances SQLQueryTestSuite in two ways: 1. SPARK-17009: Use a new

[2/2] spark git commit: [SPARK-14743][YARN] Add a configurable credential manager for Spark running on YARN

2016-08-10 Thread vanzin
[SPARK-14743][YARN] Add a configurable credential manager for Spark running on YARN ## What changes were proposed in this pull request? Add a configurable token manager for Spark on running on yarn. ### Current Problems ### 1. Supported token provider is hard-coded, currently only hdfs, hbase

[1/2] spark git commit: [SPARK-14743][YARN] Add a configurable credential manager for Spark running on YARN

2016-08-10 Thread vanzin
Repository: spark Updated Branches: refs/heads/master bd2c12fb4 -> ab648c000 http://git-wip-us.apache.org/repos/asf/spark/blob/ab648c00/yarn/src/main/scala/org/apache/spark/deploy/yarn/security/HiveCredentialProvider.scala --

spark git commit: [SPARK-12920][CORE] Honor "spark.ui.retainedStages" to reduce mem-pressure

2016-08-10 Thread vanzin
Repository: spark Updated Branches: refs/heads/master bf5cb8af4 -> bd2c12fb4 [SPARK-12920][CORE] Honor "spark.ui.retainedStages" to reduce mem-pressure When large number of jobs are run concurrently with Spark thrift server, thrift server starts running at high CPU due to GC pressure. Job UI

spark git commit: [SPARK-16928] [SQL] Recursive call of ColumnVector::getInt() breaks JIT inlining

2016-08-10 Thread davies
Repository: spark Updated Branches: refs/heads/master 214ba66a0 -> bf5cb8af4 [SPARK-16928] [SQL] Recursive call of ColumnVector::getInt() breaks JIT inlining ## What changes were proposed in this pull request? In both `OnHeapColumnVector` and `OffHeapColumnVector`, we implemented `getInt()`

spark git commit: [SPARK-16579][SPARKR] add install.spark function

2016-08-10 Thread shivaram
Repository: spark Updated Branches: refs/heads/branch-2.0 977fbbfca -> d3a30d2f0 [SPARK-16579][SPARKR] add install.spark function Add an install_spark function to the SparkR package. User can run `install_spark()` to install Spark to a local directory within R. Updates: Several changes

spark git commit: [SPARK-16579][SPARKR] add install.spark function

2016-08-10 Thread shivaram
Repository: spark Updated Branches: refs/heads/master d4a912243 -> 214ba66a0 [SPARK-16579][SPARKR] add install.spark function ## What changes were proposed in this pull request? Add an install_spark function to the SparkR package. User can run `install_spark()` to install Spark to a local

spark git commit: [SPARK-16710][SPARKR][ML] spark.glm should support weightCol

2016-08-10 Thread shivaram
Repository: spark Updated Branches: refs/heads/master 19af298bb -> d4a912243 [SPARK-16710][SPARKR][ML] spark.glm should support weightCol ## What changes were proposed in this pull request? Training GLMs on weighted dataset is very important use cases, but it is not supported by SparkR

spark git commit: [SPARK-15639] [SPARK-16321] [SQL] Push down filter at RowGroups level for parquet reader

2016-08-10 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 15637f735 -> 977fbbfca [SPARK-15639] [SPARK-16321] [SQL] Push down filter at RowGroups level for parquet reader The base class `SpecificParquetRecordReaderBase` used for vectorized parquet reader will try to get pushed-down filters

spark git commit: [SPARK-15639] [SPARK-16321] [SQL] Push down filter at RowGroups level for parquet reader

2016-08-10 Thread davies
Repository: spark Updated Branches: refs/heads/master 11a6844be -> 19af298bb [SPARK-15639] [SPARK-16321] [SQL] Push down filter at RowGroups level for parquet reader ## What changes were proposed in this pull request? The base class `SpecificParquetRecordReaderBase` used for vectorized

spark git commit: Revert "[SPARK-15899][SQL] Fix the construction of the file path with hadoop Path"

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 719ac5f37 -> 15637f735 Revert "[SPARK-15899][SQL] Fix the construction of the file path with hadoop Path" This reverts commit 719ac5f37ccf32c34c70524b8cf9a2699c71a353. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [SPARK-16866][SQL] Infrastructure for file-based SQL end-to-end tests

2016-08-10 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 0578ff968 -> b9f8a1170 [SPARK-16866][SQL] Infrastructure for file-based SQL end-to-end tests ## What changes were proposed in this pull request? This patch introduces SQLQueryTestSuite, a basic framework for end-to-end SQL test cases

spark git commit: [SPARK-16324][SQL] regexp_extract should doc that it returns empty string when match fails

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 2285de734 -> 20efb7969 [SPARK-16324][SQL] regexp_extract should doc that it returns empty string when match fails ## What changes were proposed in this pull request? Doc that regexp_extract returns empty string when regex or group

spark git commit: [SPARK-16324][SQL] regexp_extract should doc that it returns empty string when match fails

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master eca58755f -> 0578ff968 [SPARK-16324][SQL] regexp_extract should doc that it returns empty string when match fails ## What changes were proposed in this pull request? Doc that regexp_extract returns empty string when regex or group does

spark git commit: [SPARK-16927][SPARK-16923] Override task properties at dispatcher.

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master bfda53f63 -> eca58755f [SPARK-16927][SPARK-16923] Override task properties at dispatcher. ## What changes were proposed in this pull request? - enable setting default properties for all jobs submitted through the dispatcher [SPARK-16927]

spark git commit: Typo: Fow -> For

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2b10ebe6a -> bfda53f63 Typo: Fow -> For Author: Andrew Ash Closes #14563 from ash211/patch-8. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-16185][SQL] Better Error Messages When Creating Table As Select Without Enabling Hive Support

2016-08-10 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 41a7dbdd3 -> 2b10ebe6a [SPARK-16185][SQL] Better Error Messages When Creating Table As Select Without Enabling Hive Support What changes were proposed in this pull request? When we do not turn on the Hive Support, the following query

spark git commit: [SPARK-16522][MESOS] Spark application throws exception on exit.

2016-08-10 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 475ee3815 -> 2285de734 [SPARK-16522][MESOS] Spark application throws exception on exit. This is backport of https://github.com/apache/spark/pull/14175 to branch 2.0 Author: Sun Rui Closes #14575 from

spark git commit: [SPARK-10601][SQL] Support `MINUS` set operator

2016-08-10 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master bdd537164 -> 41a7dbdd3 [SPARK-10601][SQL] Support `MINUS` set operator ## What changes were proposed in this pull request? This PR adds `MINUS` set operator which is equivalent `EXCEPT DISTINCT`. This will slightly improve the

spark git commit: [SPARK-16959][SQL] Rebuild Table Comment when Retrieving Metadata from Hive Metastore

2016-08-10 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 1203c8415 -> bdd537164 [SPARK-16959][SQL] Rebuild Table Comment when Retrieving Metadata from Hive Metastore ### What changes were proposed in this pull request? The `comment` in `CatalogTable` returned from Hive is always empty. We store

spark git commit: [MINOR][SPARKR] R API documentation for "coltypes" is confusing

2016-08-10 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9dc3e602d -> 1203c8415 [MINOR][SPARKR] R API documentation for "coltypes" is confusing ## What changes were proposed in this pull request? R API documentation for "coltypes" is confusing, found when working on another ticket. Current

spark git commit: Fixed typo

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/master 121643bc7 -> 9dc3e602d Fixed typo ## What changes were proposed in this pull request? Fixed small typo - "value ... ~~in~~ is null" ## How was this patch tested? Still compiles! Author: Michał Kiełbowicz

spark git commit: Fixed typo

2016-08-10 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 2d136dba4 -> 475ee3815 Fixed typo ## What changes were proposed in this pull request? Fixed small typo - "value ... ~~in~~ is null" ## How was this patch tested? Still compiles! Author: Michał Kiełbowicz