spark git commit: [SPARK-14134][CORE] Change the package name used for shading classes.

2016-04-06 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master d76592276 -> 21d5ca128 [SPARK-14134][CORE] Change the package name used for shading classes. The current package name uses a dash, which is a little weird but seemed to work. That is, until a new test tried to mock a class that references

spark git commit: [SPARK-12610][SQL] Left Anti Join

2016-04-06 Thread rxin
Repository: spark Updated Branches: refs/heads/master 4901086fe -> d76592276 [SPARK-12610][SQL] Left Anti Join ### What changes were proposed in this pull request? This PR adds support for `LEFT ANTI JOIN` to Spark SQL. A `LEFT ANTI JOIN` is the exact opposite of a `LEFT SEMI JOIN` and can

spark git commit: [SPARK-14446][TESTS] Fix ReplSuite for Scala 2.10.

2016-04-06 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 611dbce4b -> 4901086fe [SPARK-14446][TESTS] Fix ReplSuite for Scala 2.10. Just use the same test code as the 2.11 version, which seems to pass. Author: Marcelo Vanzin Closes #12223 from vanzin/SPARK-14446.

spark git commit: [SPARK-12382][ML] Remove mllib GBT implementation and wrap ml

2016-04-06 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 864d1b4d6 -> bb873754b [SPARK-12382][ML] Remove mllib GBT implementation and wrap ml ## What changes were proposed in this pull request? This patch removes the implementation of gradient boosted trees in

spark git commit: [SPARK-14436][SQL] Make JavaDatasetAggregatorSuiteBase public.

2016-04-06 Thread vanzin
Repository: spark Updated Branches: refs/heads/master f1def573f -> 864d1b4d6 [SPARK-14436][SQL] Make JavaDatasetAggregatorSuiteBase public. Without this, unit tests that extend that class fail for me locally on maven, because JUnit tries to run methods in that class and gets an

spark git commit: [SPARK-13112][CORE] Make sure RegisterExecutorResponse arrive before LaunchTask

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master c4bb02abf -> f1def573f [SPARK-13112][CORE] Make sure RegisterExecutorResponse arrive before LaunchTask ## What changes were proposed in this pull request? Send `RegisterExecutorResponse` using `executorRef` in order to make sure

spark git commit: [SPARK-14290][CORE][NETWORK] avoid significant memory copy in netty's transferTo

2016-04-06 Thread vanzin
Repository: spark Updated Branches: refs/heads/master d717ae1fd -> c4bb02abf [SPARK-14290][CORE][NETWORK] avoid significant memory copy in netty's transferTo ## What changes were proposed in this pull request? When netty transfer data that is not `FileRegion`, data will be in format of

spark git commit: [SPARK-14444][BUILD] Add a new scalastyle `NoScalaDoc` to prevent ScalaDoc-style multiline comments

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 457e58bef -> d717ae1fd [SPARK-1][BUILD] Add a new scalastyle `NoScalaDoc` to prevent ScalaDoc-style multiline comments ## What changes were proposed in this pull request? According to the [Spark Code Style

spark git commit: [SPARK-14424][BUILD][DOCS] Update the build docs to switch from assembly to package and add a no…

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 9af5423ec -> 457e58bef [SPARK-14424][BUILD][DOCS] Update the build docs to switch from assembly to package and add a no… ## What changes were proposed in this pull request? Change our build docs & shell scripts to that developers are

spark git commit: [SPARK-12133][STREAMING] Streaming dynamic allocation

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master de4792605 -> 9af5423ec [SPARK-12133][STREAMING] Streaming dynamic allocation ## What changes were proposed in this pull request? Added a new Executor Allocation Manager for the Streaming scheduler for doing Streaming Dynamic Allocation.

spark git commit: [SPARK-14391][LAUNCHER] Increase test timeouts.

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5a4b11a90 -> de4792605 [SPARK-14391][LAUNCHER] Increase test timeouts. Most of the time tests should still pass really quickly; it's just when machines are overloaded that the tests may take a little time, but that's still preferable over

spark git commit: [SPARK-14224] [SPARK-14223] [SPARK-14310] [SQL] fix RowEncoder and parquet reader for wide table

2016-04-06 Thread davies
Repository: spark Updated Branches: refs/heads/master a4ead6d38 -> 5a4b11a90 [SPARK-14224] [SPARK-14223] [SPARK-14310] [SQL] fix RowEncoder and parquet reader for wide table ## What changes were proposed in this pull request? 1) fix the RowEncoder for wide table (many columns) by splitting

spark git commit: [SPARK-14382][SQL] QueryProgress should be post after committedOffsets is updated

2016-04-06 Thread tdas
Repository: spark Updated Branches: refs/heads/master 9c6556c5f -> a4ead6d38 [SPARK-14382][SQL] QueryProgress should be post after committedOffsets is updated ## What changes were proposed in this pull request? Make sure QueryProgress is post after committedOffsets is updated. If

spark git commit: [SPARK-13430][PYSPARK][ML] Python API for training summaries of linear and logistic regression

2016-04-06 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master bb1fa5b21 -> 9c6556c5f [SPARK-13430][PYSPARK][ML] Python API for training summaries of linear and logistic regression ## What changes were proposed in this pull request? Adding Python API for training summaries of LogisticRegression and

spark git commit: [SPARK-14320][SQL] Make ColumnarBatch.Row mutable

2016-04-06 Thread yhuai
Repository: spark Updated Branches: refs/heads/master af73d9737 -> bb1fa5b21 [SPARK-14320][SQL] Make ColumnarBatch.Row mutable ## What changes were proposed in this pull request? In order to leverage a data structure like `AggregateHashMap` (https://github.com/apache/spark/pull/12055) to

spark git commit: [SPARK-13538][ML] Add GaussianMixture to ML

2016-04-06 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 8cffcb60d -> af73d9737 [SPARK-13538][ML] Add GaussianMixture to ML JIRA: https://issues.apache.org/jira/browse/SPARK-13538 ## What changes were proposed in this pull request? Add GaussianMixture and GaussianMixtureModel to ML package ##

spark git commit: [SPARK-14322][MLLIB] Use treeAggregate instead of reduce in OnlineLDAOptimizer

2016-04-06 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-1.6 cfe9f02ca -> dca0d9a48 [SPARK-14322][MLLIB] Use treeAggregate instead of reduce in OnlineLDAOptimizer ## What changes were proposed in this pull request? jira: https://issues.apache.org/jira/browse/SPARK-14322 OnlineLDAOptimizer uses

spark git commit: [SPARK-13786][ML][PYSPARK] Add save/load for pyspark.ml.tuning

2016-04-06 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 3c8d88216 -> db0b06c6e [SPARK-13786][ML][PYSPARK] Add save/load for pyspark.ml.tuning ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13786 Add save/load for Python CrossValidator/Model and

spark git commit: [SPARK-14383][SQL] missing "|" in the g4 file

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5abd02c02 -> 3c8d88216 [SPARK-14383][SQL] missing "|" in the g4 file ## What changes were proposed in this pull request? A very trivial one. It missed "|" between DISTRIBUTE and UNSET. ## How was this patch tested? I do not think it is

spark git commit: [SPARK-14429][SQL] Improve LIKE pattern in "SHOW TABLES / FUNCTIONS LIKE " DDL

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 10494feae -> 5abd02c02 [SPARK-14429][SQL] Improve LIKE pattern in "SHOW TABLES / FUNCTIONS LIKE " DDL LIKE is commonly used in SHOW TABLES / FUNCTIONS etc DDL. In the pattern, user can use `|` or `*` as wildcards. 1. Currently, we used

spark git commit: [SPARK-14426][SQL] Merge PerserUtils and ParseUtils

2016-04-06 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 90ca18448 -> 10494feae [SPARK-14426][SQL] Merge PerserUtils and ParseUtils ## What changes were proposed in this pull request? We have ParserUtils and ParseUtils which are both utility collections for use during the parsing process.

spark git commit: [SPARK-14418][PYSPARK] fix unpersist of Broadcast in Python

2016-04-06 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 59236e5c5 -> 90ca18448 [SPARK-14418][PYSPARK] fix unpersist of Broadcast in Python ## What changes were proposed in this pull request? Currently, Broaccast.unpersist() will remove the file of broadcast, which should be the behavior of

spark git commit: [SPARK-14288][SQL] Memory Sink for streaming

2016-04-06 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 5e64dab86 -> 59236e5c5 [SPARK-14288][SQL] Memory Sink for streaming This PR exposes the internal testing `MemorySink` though the data source API. This will allow users to easily test streaming applications in the Spark shell or other

spark git commit: [SPARK-14430][BUILD] use https while downloading binaries from build/mvn

2016-04-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master 24015199f -> 5e64dab86 [SPARK-14430][BUILD] use https while downloading binaries from build/mvn ## What changes were proposed in this pull request? `./build/mvn` file was downloading binaries in non HTTPS mode. This PR tends to fix it.

spark git commit: Added omitted word in error message

2016-04-06 Thread srowen
Repository: spark Updated Branches: refs/heads/master 25a4c8e0c -> 24015199f Added omitted word in error message ## What changes were proposed in this pull request? Added an omitted word in the error message displayed by the Graphx Pregel API when `maxIterations <= 0` ## How was this patch

spark git commit: [SPARK-14396][BUILD][HOT] Fix compilation against Scala 2.10

2016-04-06 Thread hvanhovell
Repository: spark Updated Branches: refs/heads/master 78c1076d0 -> 25a4c8e0c [SPARK-14396][BUILD][HOT] Fix compilation against Scala 2.10 What changes were proposed in this pull request? This PR is to fix the compilation errors in Scala 2.10 build, as shown in the link: