spark git commit: [SPARK-17492][SQL] Fix Reading Cataloged Data Sources without Extending SchemaRelationProvider

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/master cb324f611 -> 3a80f92f8 [SPARK-17492][SQL] Fix Reading Cataloged Data Sources without Extending SchemaRelationProvider ### What changes were proposed in this pull request? For data sources without extending `SchemaRelationProvider`, we

spark git commit: [SPARK-17425][SQL] Override sameResult in HiveTableScanExec to make ReuseExchange work in text format table

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/master b50b34f56 -> cb324f611 [SPARK-17425][SQL] Override sameResult in HiveTableScanExec to make ReuseExchange work in text format table ## What changes were proposed in this pull request? The PR will override the `sameResult` in

spark git commit: [SPARK-17609][SQL] SessionCatalog.tableExists should not check temp view

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 8bde03bf9 -> b50b34f56 [SPARK-17609][SQL] SessionCatalog.tableExists should not check temp view ## What changes were proposed in this pull request? After #15054 , there is no place in Spark SQL that need `SessionCatalog.tableExists` to

[spark] Git Push Summary

2016-09-21 Thread pwendell
Repository: spark Updated Tags: refs/tags/v2.0.1-rc1 [created] 00f2e28ed - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[2/2] spark git commit: Preparing development version 2.0.2-SNAPSHOT

2016-09-21 Thread pwendell
Preparing development version 2.0.2-SNAPSHOT Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e8b26be9 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/e8b26be9 Diff:

[1/2] spark git commit: Preparing Spark release v2.0.1-rc1

2016-09-21 Thread pwendell
Repository: spark Updated Branches: refs/heads/branch-2.0 053b20a79 -> e8b26be9b Preparing Spark release v2.0.1-rc1 Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/00f2e28e Tree:

spark git commit: Bump doc version for release 2.0.1.

2016-09-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 ec377e773 -> 053b20a79 Bump doc version for release 2.0.1. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/053b20a7 Tree:

spark git commit: [SPARK-17494][SQL] changePrecision() on compact decimal should respect rounding mode

2016-09-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3497ebe51 -> 8bde03bf9 [SPARK-17494][SQL] changePrecision() on compact decimal should respect rounding mode ## What changes were proposed in this pull request? Floor()/Ceil() of decimal is implemented using changePrecision() by passing a

spark git commit: [SPARK-17494][SQL] changePrecision() on compact decimal should respect rounding mode

2016-09-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 966abd6af -> ec377e773 [SPARK-17494][SQL] changePrecision() on compact decimal should respect rounding mode ## What changes were proposed in this pull request? Floor()/Ceil() of decimal is implemented using changePrecision() by

spark git commit: [SPARK-17627] Mark Streaming Providers Experimental

2016-09-21 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 59e6ab11a -> 966abd6af [SPARK-17627] Mark Streaming Providers Experimental All of structured streaming is experimental in its first release. We missed the annotation on two of the APIs. Author: Michael Armbrust

spark git commit: [SPARK-17627] Mark Streaming Providers Experimental

2016-09-21 Thread rxin
Repository: spark Updated Branches: refs/heads/master 6902edab7 -> 3497ebe51 [SPARK-17627] Mark Streaming Providers Experimental All of structured streaming is experimental in its first release. We missed the annotation on two of the APIs. Author: Michael Armbrust

spark git commit: [SPARK-17315][FOLLOW-UP][SPARKR][ML] Fix print of Kolmogorov-Smirnov test summary

2016-09-21 Thread yliang
Repository: spark Updated Branches: refs/heads/master c133907c5 -> 6902edab7 [SPARK-17315][FOLLOW-UP][SPARKR][ML] Fix print of Kolmogorov-Smirnov test summary ## What changes were proposed in this pull request? #14881 added Kolmogorov-Smirnov Test wrapper to SparkR. I found that

spark git commit: [SPARK-17569] Make StructuredStreaming FileStreamSource batch generation faster

2016-09-21 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 8c3ee2bc4 -> 7cbe21644 [SPARK-17569] Make StructuredStreaming FileStreamSource batch generation faster ## What changes were proposed in this pull request? While getting the batch for a `FileStreamSource` in StructuredStreaming, we know

spark git commit: [SPARK-17512][CORE] Avoid formatting to python path for yarn and mesos cluster mode

2016-09-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/branch-2.0 cd0bd89d7 -> 59e6ab11a [SPARK-17512][CORE] Avoid formatting to python path for yarn and mesos cluster mode ## What changes were proposed in this pull request? Yarn and mesos cluster mode support remote python path (HDFS/S3 scheme) by

spark git commit: [SPARK-17512][CORE] Avoid formatting to python path for yarn and mesos cluster mode

2016-09-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9fcf1c51d -> 8c3ee2bc4 [SPARK-17512][CORE] Avoid formatting to python path for yarn and mesos cluster mode ## What changes were proposed in this pull request? Yarn and mesos cluster mode support remote python path (HDFS/S3 scheme) by

spark git commit: [SPARK-17623][CORE] Clarify type of TaskEndReason with a failed task.

2016-09-21 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 2cd1bfa4f -> 9fcf1c51d [SPARK-17623][CORE] Clarify type of TaskEndReason with a failed task. ## What changes were proposed in this pull request? In TaskResultGetter, enqueueFailedTask currently deserializes the result as a TaskEndReason.

spark git commit: [SPARK-4563][CORE] Allow driver to advertise a different network address.

2016-09-21 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master b4a4421b6 -> 2cd1bfa4f [SPARK-4563][CORE] Allow driver to advertise a different network address. The goal of this feature is to allow the Spark driver to run in an isolated environment, such as a docker container, and be able to use the

spark git commit: [SPARK-17418] Prevent kinesis-asl-assembly artifacts from being published

2016-09-21 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.6 8f88412c3 -> ce0a222f5 [SPARK-17418] Prevent kinesis-asl-assembly artifacts from being published This patch updates the `kinesis-asl-assembly` build to prevent that module from being published as part of Maven releases and snapshot

spark git commit: [SPARK-11918][ML] Better error from WLS for cases like singular input

2016-09-21 Thread dbtsai
Repository: spark Updated Branches: refs/heads/master d7ee12211 -> b4a4421b6 [SPARK-11918][ML] Better error from WLS for cases like singular input ## What changes were proposed in this pull request? Update error handling for Cholesky decomposition to provide a little more info when input is

spark git commit: [SPARK-17418] Prevent kinesis-asl-assembly artifacts from being published

2016-09-21 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 248922fd4 -> d7ee12211 [SPARK-17418] Prevent kinesis-asl-assembly artifacts from being published This patch updates the `kinesis-asl-assembly` build to prevent that module from being published as part of Maven releases and snapshot

spark git commit: [BACKPORT 2.0][MINOR][BUILD] Fix CheckStyle Error

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 65295bad9 -> 45bccdd9c [BACKPORT 2.0][MINOR][BUILD] Fix CheckStyle Error ## What changes were proposed in this pull request? This PR is to fix the code style errors. ## How was this patch tested? Manual. Before: ``` ./dev/lint-java

spark git commit: [SPARK-17590][SQL] Analyze CTE definitions at once and allow CTE subquery to define CTE

2016-09-21 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master dd7561d33 -> 248922fd4 [SPARK-17590][SQL] Analyze CTE definitions at once and allow CTE subquery to define CTE ## What changes were proposed in this pull request? We substitute logical plan with CTE definitions in the analyzer rule

spark git commit: [CORE][MINOR] Add minor code change to TaskState and Task

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 25a020be9 -> dd7561d33 [CORE][MINOR] Add minor code change to TaskState and Task ## What changes were proposed in this pull request? - TaskState and ExecutorState expose isFailed and isFinished functions. It can be useful to add test

spark git commit: [SPARK-17583][SQL] Remove uesless rowSeparator variable and set auto-expanding buffer as default for maxCharsPerColumn option in CSV

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 57dc326bd -> 25a020be9 [SPARK-17583][SQL] Remove uesless rowSeparator variable and set auto-expanding buffer as default for maxCharsPerColumn option in CSV ## What changes were proposed in this pull request? This PR includes the changes

spark git commit: [SPARK-17219][ML] Add NaN value handling in Bucketizer

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master b366f1849 -> 57dc326bd [SPARK-17219][ML] Add NaN value handling in Bucketizer ## What changes were proposed in this pull request? This PR fixes an issue when Bucketizer is called to handle a dataset containing NaN value. Sometimes, null

spark git commit: [SPARK-17017][MLLIB][ML] add a chiSquare Selector based on False Positive Rate (FPR) test

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 28fafa3ee -> b366f1849 [SPARK-17017][MLLIB][ML] add a chiSquare Selector based on False Positive Rate (FPR) test ## What changes were proposed in this pull request? Univariate feature selection works by selecting the best features based

spark git commit: [SPARK-17599] Prevent ListingFileCatalog from failing if path doesn't exist

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 3977223a3 -> 28fafa3ee [SPARK-17599] Prevent ListingFileCatalog from failing if path doesn't exist ## What changes were proposed in this pull request? The `ListingFileCatalog` lists files given a set of resolved paths. If a folder is

spark git commit: [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-1.6 8646b84fb -> 8f88412c3 [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value ## What changes were proposed in this pull request? Remainder(%) expression's `eval()` returns incorrect result when the

spark git commit: [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.0 726f05716 -> 65295bad9 [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value ## What changes were proposed in this pull request? Remainder(%) expression's `eval()` returns incorrect result when the

spark git commit: [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value

2016-09-21 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 7654385f2 -> 3977223a3 [SPARK-17617][SQL] Remainder(%) expression.eval returns incorrect result on double value ## What changes were proposed in this pull request? Remainder(%) expression's `eval()` returns incorrect result when the

spark git commit: [SPARK-17595][MLLIB] Use a bounded priority queue to find synonyms in Word2VecModel

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master d3b886976 -> 7654385f2 [SPARK-17595][MLLIB] Use a bounded priority queue to find synonyms in Word2VecModel ## What changes were proposed in this pull request? The code in `Word2VecModel.findSynonyms` to choose the vocabulary elements

spark git commit: [SPARK-17585][PYSPARK][CORE] PySpark SparkContext.addFile supports adding files recursively

2016-09-21 Thread yliang
Repository: spark Updated Branches: refs/heads/master 61876a427 -> d3b886976 [SPARK-17585][PYSPARK][CORE] PySpark SparkContext.addFile supports adding files recursively ## What changes were proposed in this pull request? Users would like to add a directory as dependency in some cases, they

spark git commit: [CORE][DOC] Fix errors in comments

2016-09-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master e48ebc4e4 -> 61876a427 [CORE][DOC] Fix errors in comments ## What changes were proposed in this pull request? While reading source code of CORE and SQL core, I found some minor errors in comments such as extra space, missing blank line

spark-website git commit: Add Israel Spark meetup to community page per request. Use https for meetup while we're here. Pick up a recent change to paper hyperlink reflected only in markdown, not HTML

2016-09-21 Thread srowen
Repository: spark-website Updated Branches: refs/heads/asf-site eee58685c -> 7c96b646e Add Israel Spark meetup to community page per request. Use https for meetup while we're here. Pick up a recent change to paper hyperlink reflected only in markdown, not HTML Project: