spark git commit: [SPARK-20262][SQL] AssertNotNull should throw NullPointerException

2017-04-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.1 fc242ccf4 -> 658b35885 [SPARK-20262][SQL] AssertNotNull should throw NullPointerException AssertNotNull currently throws RuntimeException. It should throw NullPointerException, which is more specific. N/A Author: Reynold Xin

spark git commit: [SPARK-20262][SQL] AssertNotNull should throw NullPointerException

2017-04-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 7577e9c35 -> e1afc4dcc [SPARK-20262][SQL] AssertNotNull should throw NullPointerException ## What changes were proposed in this pull request? AssertNotNull currently throws RuntimeException. It should throw NullPointerException, which is

spark git commit: [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions

2017-04-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.0 9016e17af -> a0b499f60 [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions ## What changes were proposed in this pull request? Similar to `Project`, when `Aggregate` has

spark git commit: [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions

2017-04-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.1 77911201c -> fc242ccf4 [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions ## What changes were proposed in this pull request? Similar to `Project`, when `Aggregate` has

spark git commit: [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions

2017-04-07 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 589f3edb8 -> 7577e9c35 [SPARK-20246][SQL] should not push predicate down through aggregate with non-deterministic expressions ## What changes were proposed in this pull request? Similar to `Project`, when `Aggregate` has

spark git commit: [SPARK-20255] Move listLeafFiles() to InMemoryFileIndex

2017-04-07 Thread rxin
Repository: spark Updated Branches: refs/heads/master 1ad73f0a2 -> 589f3edb8 [SPARK-20255] Move listLeafFiles() to InMemoryFileIndex ## What changes were proposed in this pull request Trying to get a grip on the `FileIndex` hierarchy, I was confused by the following inconsistency: On the

spark git commit: [SPARK-20258][DOC][SPARKR] Fix SparkR logistic regression example in programming guide (did not converge)

2017-04-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 8feb799af -> 1ad73f0a2 [SPARK-20258][DOC][SPARKR] Fix SparkR logistic regression example in programming guide (did not converge) ## What changes were proposed in this pull request? SparkR logistic regression example did not converge in

spark git commit: [SPARK-20197][SPARKR] CRAN check fail with package installation

2017-04-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 870b9d9aa -> 8feb799af [SPARK-20197][SPARKR] CRAN check fail with package installation ## What changes were proposed in this pull request? Test failed because SPARK_HOME is not set before Spark is installed. Author: Felix Cheung

spark git commit: [SPARK-20026][DOC][SPARKR] Add Tweedie example for SparkR in programming guide

2017-04-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9e0893b53 -> 870b9d9aa [SPARK-20026][DOC][SPARKR] Add Tweedie example for SparkR in programming guide ## What changes were proposed in this pull request? Add Tweedie example for SparkR in programming guide. The doc was already updated in

spark git commit: [SPARK-20218][DOC][APP-ID] applications//stages' in REST API, add description.

2017-04-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 fb81a412e -> 77911201c [SPARK-20218][DOC][APP-ID] applications//stages' in REST API,add description. ## What changes were proposed in this pull request? 1. '/applications/[app-id]/stages' in rest api.status should add description

spark git commit: [SPARK-20218][DOC][APP-ID] applications//stages' in REST API, add description.

2017-04-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1a52a6237 -> 9e0893b53 [SPARK-20218][DOC][APP-ID] applications//stages' in REST API,add description. ## What changes were proposed in this pull request? 1. '/applications/[app-id]/stages' in rest api.status should add description

spark git commit: [SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation

2017-04-07 Thread mlnick
Repository: spark Updated Branches: refs/heads/master ad3cc1312 -> 1a52a6237 [SPARK-20076][ML][PYSPARK] Add Python interface for ml.stats.Correlation ## What changes were proposed in this pull request? The Dataframes-based support for the correlation statistics is added in #17108. This

spark git commit: [SPARK-20245][SQL][MINOR] pass output to LogicalRelation directly

2017-04-07 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 626b4cafc -> ad3cc1312 [SPARK-20245][SQL][MINOR] pass output to LogicalRelation directly ## What changes were proposed in this pull request? Currently `LogicalRelation` has a `expectedOutputAttributes` parameter, which makes it hard to