Repository: spark
Updated Branches:
refs/heads/master d76592276 -> 21d5ca128
[SPARK-14134][CORE] Change the package name used for shading classes.
The current package name uses a dash, which is a little weird but seemed
to work. That is, until a new test tried to mock a class that references
o
Repository: spark
Updated Branches:
refs/heads/master 4901086fe -> d76592276
[SPARK-12610][SQL] Left Anti Join
### What changes were proposed in this pull request?
This PR adds support for `LEFT ANTI JOIN` to Spark SQL. A `LEFT ANTI JOIN` is
the exact opposite of a `LEFT SEMI JOIN` and can b
Repository: spark
Updated Branches:
refs/heads/master 611dbce4b -> 4901086fe
[SPARK-14446][TESTS] Fix ReplSuite for Scala 2.10.
Just use the same test code as the 2.11 version, which seems to pass.
Author: Marcelo Vanzin
Closes #12223 from vanzin/SPARK-14446.
Project: http://git-wip-us.ap
Repository: spark
Updated Branches:
refs/heads/master bb873754b -> 611dbce4b
[SPARK-12555][SQL] Result should not be corrupted after input columns are
reordered
This PR add test case described in SPARK-12555 to validate that correct data is
returned when input data is reordered and to avoid
Repository: spark
Updated Branches:
refs/heads/master 864d1b4d6 -> bb873754b
[SPARK-12382][ML] Remove mllib GBT implementation and wrap ml
## What changes were proposed in this pull request?
This patch removes the implementation of gradient boosted trees in
mllib/tree/GradientBoostedTrees.sc
Repository: spark
Updated Branches:
refs/heads/master f1def573f -> 864d1b4d6
[SPARK-14436][SQL] Make JavaDatasetAggregatorSuiteBase public.
Without this, unit tests that extend that class fail for me locally
on maven, because JUnit tries to run methods in that class and gets
an IllegalAccessEr
Repository: spark
Updated Branches:
refs/heads/master c4bb02abf -> f1def573f
[SPARK-13112][CORE] Make sure RegisterExecutorResponse arrive before LaunchTask
## What changes were proposed in this pull request?
Send `RegisterExecutorResponse` using `executorRef` in order to make sure
RegisterE
Repository: spark
Updated Branches:
refs/heads/master d717ae1fd -> c4bb02abf
[SPARK-14290][CORE][NETWORK] avoid significant memory copy in netty's transferTo
## What changes were proposed in this pull request?
When netty transfer data that is not `FileRegion`, data will be in format of
`ByteB
Repository: spark
Updated Branches:
refs/heads/master 457e58bef -> d717ae1fd
[SPARK-1][BUILD] Add a new scalastyle `NoScalaDoc` to prevent
ScalaDoc-style multiline comments
## What changes were proposed in this pull request?
According to the [Spark Code Style
Guide](https://cwiki.apache
Repository: spark
Updated Branches:
refs/heads/master 9af5423ec -> 457e58bef
[SPARK-14424][BUILD][DOCS] Update the build docs to switch from assembly to
package and add a noâ¦
## What changes were proposed in this pull request?
Change our build docs & shell scripts to that developers are aw
Repository: spark
Updated Branches:
refs/heads/master de4792605 -> 9af5423ec
[SPARK-12133][STREAMING] Streaming dynamic allocation
## What changes were proposed in this pull request?
Added a new Executor Allocation Manager for the Streaming scheduler for doing
Streaming Dynamic Allocation.
Repository: spark
Updated Branches:
refs/heads/master 5a4b11a90 -> de4792605
[SPARK-14391][LAUNCHER] Increase test timeouts.
Most of the time tests should still pass really quickly; it's just
when machines are overloaded that the tests may take a little time,
but that's still preferable over j
Repository: spark
Updated Branches:
refs/heads/master a4ead6d38 -> 5a4b11a90
[SPARK-14224] [SPARK-14223] [SPARK-14310] [SQL] fix RowEncoder and parquet
reader for wide table
## What changes were proposed in this pull request?
1) fix the RowEncoder for wide table (many columns) by splitting t
Repository: spark
Updated Branches:
refs/heads/master 9c6556c5f -> a4ead6d38
[SPARK-14382][SQL] QueryProgress should be post after committedOffsets is
updated
## What changes were proposed in this pull request?
Make sure QueryProgress is post after committedOffsets is updated. If
QueryProgr
Repository: spark
Updated Branches:
refs/heads/master bb1fa5b21 -> 9c6556c5f
[SPARK-13430][PYSPARK][ML] Python API for training summaries of linear and
logistic regression
## What changes were proposed in this pull request?
Adding Python API for training summaries of LogisticRegression and
Repository: spark
Updated Branches:
refs/heads/master af73d9737 -> bb1fa5b21
[SPARK-14320][SQL] Make ColumnarBatch.Row mutable
## What changes were proposed in this pull request?
In order to leverage a data structure like `AggregateHashMap`
(https://github.com/apache/spark/pull/12055) to spe
Repository: spark
Updated Branches:
refs/heads/master 8cffcb60d -> af73d9737
[SPARK-13538][ML] Add GaussianMixture to ML
JIRA: https://issues.apache.org/jira/browse/SPARK-13538
## What changes were proposed in this pull request?
Add GaussianMixture and GaussianMixtureModel to ML package
##
Repository: spark
Updated Branches:
refs/heads/master db0b06c6e -> 8cffcb60d
[SPARK-14322][MLLIB] Use treeAggregate instead of reduce in OnlineLDAOptimizer
## What changes were proposed in this pull request?
jira: https://issues.apache.org/jira/browse/SPARK-14322
OnlineLDAOptimizer uses RDD.r
Repository: spark
Updated Branches:
refs/heads/branch-1.6 cfe9f02ca -> dca0d9a48
[SPARK-14322][MLLIB] Use treeAggregate instead of reduce in OnlineLDAOptimizer
## What changes were proposed in this pull request?
jira: https://issues.apache.org/jira/browse/SPARK-14322
OnlineLDAOptimizer uses R
Repository: spark
Updated Branches:
refs/heads/branch-1.5 13f0f4892 -> 1d507a997
[SPARK-14322][MLLIB] Use treeAggregate instead of reduce in OnlineLDAOptimizer
## What changes were proposed in this pull request?
jira: https://issues.apache.org/jira/browse/SPARK-14322
OnlineLDAOptimizer uses R
Repository: spark
Updated Branches:
refs/heads/master 3c8d88216 -> db0b06c6e
[SPARK-13786][ML][PYSPARK] Add save/load for pyspark.ml.tuning
## What changes were proposed in this pull request?
https://issues.apache.org/jira/browse/SPARK-13786
Add save/load for Python CrossValidator/Model and
Repository: spark
Updated Branches:
refs/heads/master 5abd02c02 -> 3c8d88216
[SPARK-14383][SQL] missing "|" in the g4 file
## What changes were proposed in this pull request?
A very trivial one. It missed "|" between DISTRIBUTE and UNSET.
## How was this patch tested?
I do not think it is r
Repository: spark
Updated Branches:
refs/heads/master 10494feae -> 5abd02c02
[SPARK-14429][SQL] Improve LIKE pattern in "SHOW TABLES / FUNCTIONS LIKE
" DDL
LIKE is commonly used in SHOW TABLES / FUNCTIONS etc DDL. In the
pattern, user can use `|` or `*` as wildcards.
1. Currently, we used
Repository: spark
Updated Branches:
refs/heads/master 90ca18448 -> 10494feae
[SPARK-14426][SQL] Merge PerserUtils and ParseUtils
## What changes were proposed in this pull request?
We have ParserUtils and ParseUtils which are both utility collections for use
during the parsing process.
Those
Repository: spark
Updated Branches:
refs/heads/master 59236e5c5 -> 90ca18448
[SPARK-14418][PYSPARK] fix unpersist of Broadcast in Python
## What changes were proposed in this pull request?
Currently, Broaccast.unpersist() will remove the file of broadcast, which
should be the behavior of des
Repository: spark
Updated Branches:
refs/heads/master 5e64dab86 -> 59236e5c5
[SPARK-14288][SQL] Memory Sink for streaming
This PR exposes the internal testing `MemorySink` though the data source API.
This will allow users to easily test streaming applications in the Spark shell
or other loc
Repository: spark
Updated Branches:
refs/heads/master 24015199f -> 5e64dab86
[SPARK-14430][BUILD] use https while downloading binaries from build/mvn
## What changes were proposed in this pull request?
`./build/mvn` file was downloading binaries in non HTTPS mode. This PR tends to
fix it.
#
Repository: spark
Updated Branches:
refs/heads/master 25a4c8e0c -> 24015199f
Added omitted word in error message
## What changes were proposed in this pull request?
Added an omitted word in the error message displayed by the Graphx Pregel API
when `maxIterations <= 0`
## How was this patch
Repository: spark
Updated Branches:
refs/heads/master 78c1076d0 -> 25a4c8e0c
[SPARK-14396][BUILD][HOT] Fix compilation against Scala 2.10
What changes were proposed in this pull request?
This PR is to fix the compilation errors in Scala 2.10 build, as shown in the
link:
https://amplab.cs
29 matches
Mail list logo