spark git commit: [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing nonlocal file path

2017-11-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 af8a692d6 -> ba00bd961 [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing nonlocal file path ## What changes were proposed in this pull request? When user tries to load data with a non existing hdfs

spark git commit: [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing nonlocal file path

2017-11-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/master dc365422b -> 16adaf634 [SPARK-22601][SQL] Data load is getting displayed successful on providing non existing nonlocal file path ## What changes were proposed in this pull request? When user tries to load data with a non existing hdfs

spark git commit: [SPARK-22653] executorAddress registered in CoarseGrainedSchedulerBac…

2017-11-30 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.2 0121ebc83 -> af8a692d6 [SPARK-22653] executorAddress registered in CoarseGrainedSchedulerBac… https://issues.apache.org/jira/browse/SPARK-22653 executorRef.address can be null, pass the executorAddress which accounts for it being

spark git commit: [SPARK-22653] executorAddress registered in CoarseGrainedSchedulerBac…

2017-11-30 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 7da1f5708 -> dc365422b [SPARK-22653] executorAddress registered in CoarseGrainedSchedulerBac… https://issues.apache.org/jira/browse/SPARK-22653 executorRef.address can be null, pass the executorAddress which accounts for it being null a

spark git commit: [SPARK-22373] Bump Janino dependency version to fix thread safety issue…

2017-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7e5f669eb -> 7da1f5708 [SPARK-22373] Bump Janino dependency version to fix thread safety issue… … with Janino when compiling generated code. ## What changes were proposed in this pull request? Bump up Janino dependency version to fix

spark git commit: [SPARK-22373] Bump Janino dependency version to fix thread safety issue…

2017-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 a02a8bd23 -> 4e3680f22 [SPARK-22373] Bump Janino dependency version to fix thread safety issue… … with Janino when compiling generated code. ## What changes were proposed in this pull request? Bump up Janino dependency version to

spark git commit: [SPARK-22373] Bump Janino dependency version to fix thread safety issue…

2017-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 d7b14746d -> 0121ebc83 [SPARK-22373] Bump Janino dependency version to fix thread safety issue… … with Janino when compiling generated code. ## What changes were proposed in this pull request? Bump up Janino dependency version to

spark git commit: [SPARK-22428][DOC] Add spark application garbage collector configurat…

2017-11-30 Thread srowen
Repository: spark Updated Branches: refs/heads/master f5f8e84d9 -> 7e5f669eb [SPARK-22428][DOC] Add spark application garbage collector configurat… ## What changes were proposed in this pull request? The spark properties for configuring the ContextCleaner are not documented in the

spark git commit: [SPARK-22614] Dataset API: repartitionByRange(...)

2017-11-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/master bcceab649 -> f5f8e84d9 [SPARK-22614] Dataset API: repartitionByRange(...) ## What changes were proposed in this pull request? This PR introduces a way to explicitly range-partition a Dataset. So far, only round-robin and hash

spark git commit: [SPARK-22489][SQL] Shouldn't change broadcast join buildSide if user clearly specified

2017-11-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 6ac57fd0d -> bcceab649 [SPARK-22489][SQL] Shouldn't change broadcast join buildSide if user clearly specified ## What changes were proposed in this pull request? How to reproduce: ```scala import

spark git commit: [SPARK-21417][SQL] Infer join conditions using propagated constraints

2017-11-30 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 999ec137a -> 6ac57fd0d [SPARK-21417][SQL] Infer join conditions using propagated constraints ## What changes were proposed in this pull request? This PR adds an optimization rule that infers join conditions using propagated constraints.

spark git commit: [SPARK-22570][SQL] Avoid to create a lot of global variables by using a local variable with allocation of an object in generated code

2017-11-30 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 932bd09c8 -> 999ec137a [SPARK-22570][SQL] Avoid to create a lot of global variables by using a local variable with allocation of an object in generated code ## What changes were proposed in this pull request? This PR reduces # of global

spark git commit: [SPARK-22635][SQL][ORC] FileNotFoundException while reading ORC files containing special characters

2017-11-30 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 6eb203fae -> 932bd09c8 [SPARK-22635][SQL][ORC] FileNotFoundException while reading ORC files containing special characters ## What changes were proposed in this pull request? SPARK-22146 fix the FileNotFoundException issue only for the

spark git commit: [SPARK-22654][TESTS] Retry Spark tarball download if failed in HiveExternalCatalogVersionsSuite

2017-11-30 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.2 38a0532cf -> d7b14746d [SPARK-22654][TESTS] Retry Spark tarball download if failed in HiveExternalCatalogVersionsSuite ## What changes were proposed in this pull request? Adds a simple loop to retry download of Spark tarballs from

spark git commit: [SPARK-22654][TESTS] Retry Spark tarball download if failed in HiveExternalCatalogVersionsSuite

2017-11-30 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 9c29c5576 -> 6eb203fae [SPARK-22654][TESTS] Retry Spark tarball download if failed in HiveExternalCatalogVersionsSuite ## What changes were proposed in this pull request? Adds a simple loop to retry download of Spark tarballs from

spark git commit: [SPARK-22643][SQL] ColumnarArray should be an immutable view

2017-11-30 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 444a2bbb6 -> 9c29c5576 [SPARK-22643][SQL] ColumnarArray should be an immutable view ## What changes were proposed in this pull request? To make `ColumnVector` public, `ColumnarArray` need to be public too, and we should not have mutable

spark git commit: [SPARK-22652][SQL] remove set methods in ColumnarRow

2017-11-30 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 92cfbeeb5 -> 444a2bbb6 [SPARK-22652][SQL] remove set methods in ColumnarRow ## What changes were proposed in this pull request? As a step to make `ColumnVector` public, the `ColumnarRow` returned by `ColumnVector#getStruct` should be