spark git commit: [SPARK-10264][DOCUMENTATION] Added @Since to ml.recomendation

2016-01-18 Thread srowen
Repository: spark Updated Branches: refs/heads/master bc36b0f1a -> 233d6cee9 [SPARK-10264][DOCUMENTATION] Added @Since to ml.recomendation I create new pr since original pr long time no update. Please help to review. srowen Author: Tommy YU Closes #10756 from

spark git commit: [SPARK-12558][FOLLOW-UP] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-18 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 233d6cee9 -> db9a86058 [SPARK-12558][FOLLOW-UP] AnalysisException when multiple functions applied in GROUP BY clause Addresses the comments from Yin. https://github.com/apache/spark/pull/10520 Author: Dilip Biswal

spark git commit: [SPARK-12558][FOLLOW-UP] AnalysisException when multiple functions applied in GROUP BY clause

2016-01-18 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 5803fce90 -> 53184ce77 [SPARK-12558][FOLLOW-UP] AnalysisException when multiple functions applied in GROUP BY clause Addresses the comments from Yin. https://github.com/apache/spark/pull/10520 Author: Dilip Biswal

spark git commit: [SPARK-12873][SQL] Add more comment in HiveTypeCoercion for type widening

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master db9a86058 -> 44fcf992a [SPARK-12873][SQL] Add more comment in HiveTypeCoercion for type widening I was reading this part of the analyzer code again and got confused by the difference between findWiderTypeForTwo and

spark git commit: [SPARK-12889][SQL] Rename ParserDialect -> ParserInterface.

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 721845c1b -> 39ac56fc6 [SPARK-12889][SQL] Rename ParserDialect -> ParserInterface. Based on discussions in #10801, I'm submitting a pull request to rename ParserDialect to ParserInterface. Author: Reynold Xin

spark git commit: [SPARK-12814][DOCUMENT] Add deploy instructions for Python in flume integration doc

2016-01-18 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.6 8c2b67f55 -> 7482c7b5a [SPARK-12814][DOCUMENT] Add deploy instructions for Python in flume integration doc This PR added instructions to get flume assembly jar for Python users in the flume integration page like Kafka doc. Author:

spark git commit: [SPARK-12814][DOCUMENT] Add deploy instructions for Python in flume integration doc

2016-01-18 Thread tdas
Repository: spark Updated Branches: refs/heads/master 404190221 -> a973f483f [SPARK-12814][DOCUMENT] Add deploy instructions for Python in flume integration doc This PR added instructions to get flume assembly jar for Python users in the flume integration page like Kafka doc. Author:

spark git commit: [SPARK-12894][DOCUMENT] Add deploy instructions for Python in Kinesis integration doc

2016-01-18 Thread tdas
Repository: spark Updated Branches: refs/heads/branch-1.6 7482c7b5a -> d43704d7f [SPARK-12894][DOCUMENT] Add deploy instructions for Python in Kinesis integration doc This PR added instructions to get Kinesis assembly jar for Python users in the Kinesis integration page like Kafka doc.

spark git commit: [SPARK-12894][DOCUMENT] Add deploy instructions for Python in Kinesis integration doc

2016-01-18 Thread tdas
Repository: spark Updated Branches: refs/heads/master 4bcea1b85 -> 721845c1b [SPARK-12894][DOCUMENT] Add deploy instructions for Python in Kinesis integration doc This PR added instructions to get Kinesis assembly jar for Python users in the Kinesis integration page like Kafka doc. Author:

spark git commit: Revert "[SPARK-12829] Turn Java style checker on"

2016-01-18 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master a973f483f -> 4bcea1b85 Revert "[SPARK-12829] Turn Java style checker on" This reverts commit 591c88c9e2a6c2e2ca84f1b66c635f198a16d112. `lint-java` doesn't work on a machine with a clean Maven cache. Project:

spark git commit: [SPARK-12700] [SQL] embed condition into SMJ and BroadcastHashJoin

2016-01-18 Thread davies
Repository: spark Updated Branches: refs/heads/master 39ac56fc6 -> 323d51f1d [SPARK-12700] [SQL] embed condition into SMJ and BroadcastHashJoin Currently SortMergeJoin and BroadcastHashJoin do not support condition, the need a followed Filter for that, the result projection to generate

spark git commit: [SPARK-12885][MINOR] Rename 3 fields in ShuffleWriteMetrics

2016-01-18 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 323d51f1d -> 2b5d11f34 [SPARK-12885][MINOR] Rename 3 fields in ShuffleWriteMetrics This is a small step in implementing SPARK-10620, which migrates TaskMetrics to accumulators. This patch is strictly a cleanup patch and introduces no

spark git commit: [SPARK-12841][SQL][BRANCH-1.6] fix cast in filter

2016-01-18 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 d43704d7f -> 68265ac23 [SPARK-12841][SQL][BRANCH-1.6] fix cast in filter In SPARK-10743 we wrap cast with `UnresolvedAlias` to give `Cast` a better alias if possible. However, for cases like filter, the `UnresolvedAlias` can't be

spark git commit: [HOT][BUILD] Changed the import order

2016-01-18 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 2b5d11f34 -> 74ba84b64 [HOT][BUILD] Changed the import order This PR is to fix the master's build break. The following tests failed due to the import order issues in the master.

spark git commit: [SPARK-12668][SQL] Providing aliases for CSV options to be similar to Pandas and R

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 74ba84b64 -> 453dae567 [SPARK-12668][SQL] Providing aliases for CSV options to be similar to Pandas and R https://issues.apache.org/jira/browse/SPARK-12668 Spark CSV datasource has been being merged (filed in

spark git commit: [SPARK-12346][ML] Missing attribute names in GLM for vector-type features

2016-01-18 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.6 53184ce77 -> 8c2b67f55 [SPARK-12346][ML] Missing attribute names in GLM for vector-type features Currently `summary()` fails on a GLM model fitted over a vector feature missing ML attrs, since the output feature attrs will also have

spark git commit: [SPARK-12346][ML] Missing attribute names in GLM for vector-type features

2016-01-18 Thread meng
Repository: spark Updated Branches: refs/heads/master 44fcf992a -> 5e492e9d5 [SPARK-12346][ML] Missing attribute names in GLM for vector-type features Currently `summary()` fails on a GLM model fitted over a vector feature missing ML attrs, since the output feature attrs will also have no

spark git commit: [SPARK-12855][SQL] Remove parser dialect developer API

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master b8cb548a4 -> 38c3c0e31 [SPARK-12855][SQL] Remove parser dialect developer API This pull request removes the public developer parser API for external parsers. Given everything a parser depends on (e.g. logical plans and expressions) are

spark git commit: [SPARK-10985][CORE] Avoid passing evicted blocks throughout BlockManager

2016-01-18 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 302bb569f -> b8cb548a4 [SPARK-10985][CORE] Avoid passing evicted blocks throughout BlockManager This patch refactors portions of the BlockManager and CacheManager in order to avoid having to pass `evictedBlocks` lists throughout the code.

spark git commit: [SPARK-12884] Move classes to their own files for readability

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 5e492e9d5 -> 302bb569f [SPARK-12884] Move classes to their own files for readability This is a small step in implementing SPARK-10620, which migrates `TaskMetrics` to accumulators. This patch is strictly a cleanup patch and introduces no

spark git commit: [SPARK-12841][SQL] fix cast in filter

2016-01-18 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 38c3c0e31 -> 4f11e3f2a [SPARK-12841][SQL] fix cast in filter In SPARK-10743 we wrap cast with `UnresolvedAlias` to give `Cast` a better alias if possible. However, for cases like `filter`, the `UnresolvedAlias` can't be resolved and

spark git commit: [SPARK-12882][SQL] simplify bucket tests and add more comments

2016-01-18 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4f11e3f2a -> 404190221 [SPARK-12882][SQL] simplify bucket tests and add more comments Right now, the bucket tests are kind of hard to understand, this PR simplifies them and add more commetns. Author: Wenchen Fan