spark git commit: [SPARK-21566][SQL][PYTHON] Python method for summary

2017-08-18 Thread holden
Repository: spark Updated Branches: refs/heads/master a2db5c576 -> 10be01848 [SPARK-21566][SQL][PYTHON] Python method for summary ## What changes were proposed in this pull request? Adds the recently added `summary` method to the python dataframe interface. ## How was this patch tested?

spark git commit: [MINOR][TYPO] Fix typos: runnning and Excecutors

2017-08-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 7880909c4 -> a2db5c576 [MINOR][TYPO] Fix typos: runnning and Excecutors ## What changes were proposed in this pull request? Fix typos ## How was this patch tested? Existing tests Author: Andrew Ash Closes #18996

spark git commit: [SPARK-21743][SQL][FOLLOW-UP] top-most limit should not cause memory leak

2017-08-18 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 23ea89808 -> 7880909c4 [SPARK-21743][SQL][FOLLOW-UP] top-most limit should not cause memory leak ## What changes were proposed in this pull request? This is a follow-up of https://github.com/apache/spark/pull/18955 , to fix a bug that we

spark git commit: [SPARK-21213][SQL] Support collecting partition-level statistics: rowCount and sizeInBytes

2017-08-18 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 07a2b8738 -> 23ea89808 [SPARK-21213][SQL] Support collecting partition-level statistics: rowCount and sizeInBytes ## What changes were proposed in this pull request? Added support for ANALYZE TABLE [db_name].tablename PARTITION

spark git commit: [SPARK-21778][SQL] Simpler Dataset.sample API in Scala / Java

2017-08-18 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 310454be3 -> 07a2b8738 [SPARK-21778][SQL] Simpler Dataset.sample API in Scala / Java ## What changes were proposed in this pull request? Dataset.sample requires a boolean flag withReplacement as the first argument. However, most of the