spark git commit: [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 dae1581d9 -> c864e8a80 [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet ## What changes were proposed in this pull request? This is a follow-up PR of #15741 in order to keep `nextPowerOf2` consistent

spark git commit: [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 8e145a94b -> cfe76028b [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet ## What changes were proposed in this pull request? This is a follow-up PR of #15741 in order to keep `nextPowerOf2` consistent

spark git commit: [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master a08463b1d -> 27602c337 [SPARK-18200][GRAPHX][FOLLOW-UP] Support zero as an initial capacity in OpenHashSet ## What changes were proposed in this pull request? This is a follow-up PR of #15741 in order to keep `nextPowerOf2` consistent. *

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 91d567150 -> 8e145a94b [SPARK-14393][SQL][DOC] update doc for python and R ## What changes were proposed in this pull request? minor doc update that should go to master & branch-2.1 ## How was this patch tested? manual Author: Felix

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master aa412c55e -> a08463b1d [SPARK-14393][SQL][DOC] update doc for python and R ## What changes were proposed in this pull request? minor doc update that should go to master & branch-2.1 ## How was this patch tested? manual Author: Felix Che

spark git commit: [SPARK-18259][SQL] Do not capture Throwable in QueryExecution

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 37550c492 -> 91d567150 [SPARK-18259][SQL] Do not capture Throwable in QueryExecution ## What changes were proposed in this pull request? `QueryExecution.toString` currently captures `java.lang.Throwable`s; this is far from a best pract

spark git commit: [SPARK-18259][SQL] Do not capture Throwable in QueryExecution

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master dc4c60098 -> aa412c55e [SPARK-18259][SQL] Do not capture Throwable in QueryExecution ## What changes were proposed in this pull request? `QueryExecution.toString` currently captures `java.lang.Throwable`s; this is far from a best practice

spark git commit: [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2.6 are deprecated in Spark 2.1.0

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 af60b1ebb -> 37550c492 [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2.6 are deprecated in Spark 2.1.0 ## What changes were proposed in this pull request? Document that Java 7, Python 2.6, Scala 2.10, Hado

spark git commit: [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2.6 are deprecated in Spark 2.1.0

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master f22954ad4 -> dc4c60098 [SPARK-18138][DOCS] Document that Java 7, Python 2.6, Scala 2.10, Hadoop < 2.6 are deprecated in Spark 2.1.0 ## What changes were proposed in this pull request? Document that Java 7, Python 2.6, Scala 2.10, Hadoop <

spark git commit: [SPARK-18257][SS] Improve error reporting for FileStressSuite

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 2daca62cd -> af60b1ebb [SPARK-18257][SS] Improve error reporting for FileStressSuite ## What changes were proposed in this pull request? This patch improves error reporting for FileStressSuite, when there is an error in Spark itself (n

spark git commit: [SPARK-18257][SS] Improve error reporting for FileStressSuite

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master e89202523 -> f22954ad4 [SPARK-18257][SS] Improve error reporting for FileStressSuite ## What changes were proposed in this pull request? This patch improves error reporting for FileStressSuite, when there is an error in Spark itself (not u

spark git commit: [SPARKR][TEST] remove unnecessary suppressWarnings

2016-11-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 67659c9af -> e89202523 [SPARKR][TEST] remove unnecessary suppressWarnings ## What changes were proposed in this pull request? In test_mllib.R, there are two unnecessary suppressWarnings. This PR just removes them. ## How was this patch t

spark git commit: [SPARK-18212][SS][KAFKA] increase executor poll timeout

2016-11-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 098e4ca9c -> 67659c9af [SPARK-18212][SS][KAFKA] increase executor poll timeout ## What changes were proposed in this pull request? Increase poll timeout to try and address flaky test ## How was this patch tested? Ran existing unit tests

spark git commit: [SPARK-18212][SS][KAFKA] increase executor poll timeout

2016-11-03 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.1 569f77a11 -> 2daca62cd [SPARK-18212][SS][KAFKA] increase executor poll timeout ## What changes were proposed in this pull request? Increase poll timeout to try and address flaky test ## How was this patch tested? Ran existing unit te

spark git commit: [SPARK-18099][YARN] Fail if same files added to distributed cache for --files and --archives

2016-11-03 Thread tgraves
Repository: spark Updated Branches: refs/heads/branch-2.1 3e139e239 -> 569f77a11 [SPARK-18099][YARN] Fail if same files added to distributed cache for --files and --archives ## What changes were proposed in this pull request? During spark-submit, if yarn dist cache is instructed to add same

spark git commit: [SPARK-18099][YARN] Fail if same files added to distributed cache for --files and --archives

2016-11-03 Thread tgraves
Repository: spark Updated Branches: refs/heads/master 16293311c -> 098e4ca9c [SPARK-18099][YARN] Fail if same files added to distributed cache for --files and --archives ## What changes were proposed in this pull request? During spark-submit, if yarn dist cache is instructed to add same file

spark git commit: [SPARK-18237][HIVE] hive.exec.stagingdir have no effect

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master b17057c0a -> 16293311c [SPARK-18237][HIVE] hive.exec.stagingdir have no effect hive.exec.stagingdir have no effect in spark2.0.1, Hive confs in hive-site.xml will be loaded in `hadoopConf`, so we should use `hadoopConf` in `InsertIntoHiv

spark git commit: [SPARK-18237][HIVE] hive.exec.stagingdir have no effect

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 4f91630c8 -> 3e139e239 [SPARK-18237][HIVE] hive.exec.stagingdir have no effect hive.exec.stagingdir have no effect in spark2.0.1, Hive confs in hive-site.xml will be loaded in `hadoopConf`, so we should use `hadoopConf` in `InsertInt

spark git commit: [SPARK-18244][SQL] Rename partitionProviderIsHive -> tracksPartitionsInCatalog

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 c2876bfbf -> 4f91630c8 [SPARK-18244][SQL] Rename partitionProviderIsHive -> tracksPartitionsInCatalog ## What changes were proposed in this pull request? This patch renames partitionProviderIsHive to tracksPartitionsInCatalog, as the o

spark git commit: [SPARK-18244][SQL] Rename partitionProviderIsHive -> tracksPartitionsInCatalog

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 27daf6bcd -> b17057c0a [SPARK-18244][SQL] Rename partitionProviderIsHive -> tracksPartitionsInCatalog ## What changes were proposed in this pull request? This patch renames partitionProviderIsHive to tracksPartitionsInCatalog, as the old n

spark git commit: [SPARK-17949][SQL] A JVM object based aggregate operator

2016-11-03 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 66a99f4a4 -> 27daf6bcd [SPARK-17949][SQL] A JVM object based aggregate operator ## What changes were proposed in this pull request? This PR adds a new hash-based aggregate operator named `ObjectHashAggregateExec` that supports `TypedImper

spark git commit: [SPARK-17981][SPARK-17957][SQL] Fix Incorrect Nullability Setting to False in FilterExec

2016-11-03 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 9dc9f9a5d -> 66a99f4a4 [SPARK-17981][SPARK-17957][SQL] Fix Incorrect Nullability Setting to False in FilterExec ### What changes were proposed in this pull request? When `FilterExec` contains `isNotNull`, which could be inferred and pushe

spark git commit: [SPARK-17981][SPARK-17957][SQL] Fix Incorrect Nullability Setting to False in FilterExec

2016-11-03 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/branch-2.1 99891e56e -> c2876bfbf [SPARK-17981][SPARK-17957][SQL] Fix Incorrect Nullability Setting to False in FilterExec ### What changes were proposed in this pull request? When `FilterExec` contains `isNotNull`, which could be inferred and p

spark git commit: [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassifier

2016-11-03 Thread yliang
Repository: spark Updated Branches: refs/heads/branch-2.1 71104c9c9 -> 99891e56e [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassifier ## What changes were proposed in this pull request? Add missing 'subsamplingRate' of pyspark GBTClassifier ## How was this patch

spark git commit: [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassifier

2016-11-03 Thread yliang
Repository: spark Updated Branches: refs/heads/master 0ea5d5b24 -> 9dc9f9a5d [SPARK-18177][ML][PYSPARK] Add missing 'subsamplingRate' of pyspark GBTClassifier ## What changes were proposed in this pull request? Add missing 'subsamplingRate' of pyspark GBTClassifier ## How was this patch test

spark git commit: [SQL] minor - internal doc improvement for InsertIntoTable.

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 bc7f05f5f -> 71104c9c9 [SQL] minor - internal doc improvement for InsertIntoTable. ## What changes were proposed in this pull request? I was reading this part of the code and was really confused by the "partition" parameter. This patch

spark git commit: [SQL] minor - internal doc improvement for InsertIntoTable.

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 937af592e -> 0ea5d5b24 [SQL] minor - internal doc improvement for InsertIntoTable. ## What changes were proposed in this pull request? I was reading this part of the code and was really confused by the "partition" parameter. This patch add

spark git commit: [SPARK-18219] Move commit protocol API (internal) from sql/core to core module

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.1 c4c5328f2 -> bc7f05f5f [SPARK-18219] Move commit protocol API (internal) from sql/core to core module ## What changes were proposed in this pull request? This patch moves the new commit protocol API from sql/core to core module, so we

spark git commit: [SPARK-18219] Move commit protocol API (internal) from sql/core to core module

2016-11-03 Thread rxin
Repository: spark Updated Branches: refs/heads/master 96cc1b567 -> 937af592e [SPARK-18219] Move commit protocol API (internal) from sql/core to core module ## What changes were proposed in this pull request? This patch moves the new commit protocol API from sql/core to core module, so we can

spark git commit: [SPARK-17122][SQL] support drop current database

2016-11-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/master d24e73647 -> 96cc1b567 [SPARK-17122][SQL] support drop current database ## What changes were proposed in this pull request? In Spark 1.6 and earlier, we can drop the database we are using. In Spark 2.0, native implementation prevent us fr

spark git commit: [SPARK-17122][SQL] support drop current database

2016-11-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.1 965c964c2 -> c4c5328f2 [SPARK-17122][SQL] support drop current database ## What changes were proposed in this pull request? In Spark 1.6 and earlier, we can drop the database we are using. In Spark 2.0, native implementation prevent u