spark git commit: [SPARK-22489][DOC][FOLLOWUP] Update broadcast behavior changes in migration section

2017-12-03 Thread lixiao
Repository: spark Updated Branches: refs/heads/master dff440f1e -> 4131ad03f [SPARK-22489][DOC][FOLLOWUP] Update broadcast behavior changes in migration section ## What changes were proposed in this pull request? Update broadcast behavior changes in migration section. ## How was this patch

spark git commit: [SPARK-22626][SQL] deals with wrong Hive's statistics (zero rowCount)

2017-12-03 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 2c16267f7 -> dff440f1e [SPARK-22626][SQL] deals with wrong Hive's statistics (zero rowCount) This pr to ensure that the Hive's statistics `totalSize` (or `rawDataSize`) > 0, `rowCount` also must be > 0. Otherwise may cause OOM when CBO is

spark git commit: [SPARK-22669][SQL] Avoid unnecessary function calls in code generation

2017-12-03 Thread wenchen
Repository: spark Updated Branches: refs/heads/master f23dddf10 -> 2c16267f7 [SPARK-22669][SQL] Avoid unnecessary function calls in code generation ## What changes were proposed in this pull request? In many parts of the codebase for code generation, we are splitting the code to avoid

spark git commit: [SPARK-20682][SPARK-15474][SPARK-21791] Add new ORCFileFormat based on ORC 1.4.1

2017-12-03 Thread wenchen
Repository: spark Updated Branches: refs/heads/master d2cf95aa6 -> f23dddf10 [SPARK-20682][SPARK-15474][SPARK-21791] Add new ORCFileFormat based on ORC 1.4.1 ## What changes were proposed in this pull request? Since [SPARK-2883](https://issues.apache.org/jira/browse/SPARK-2883), Apache