Repository: spark
Updated Branches:
refs/heads/branch-1.1 cf15b22d4 - 1687d6ba9
[SPARK-2062][GraphX] VertexRDD.apply does not use the mergeFunc
VertexRDD.apply had a bug where it ignored the merge function for
duplicate vertices and instead used whichever vertex attribute occurred
first. This
Repository: spark
Updated Branches:
refs/heads/master 3bbbdd818 - a48956f58
MAINTENANCE: Automated closing of pull requests.
This commit exists to close the following pull requests on Github:
Closes #726 (close requested by 'pwendell')
Closes #151 (close requested by 'pwendell')
Project:
Repository: spark
Updated Branches:
refs/heads/master a48956f58 - be0c7563e
[SPARK-1701] Clarify slice vs partition in the programming guide
This is a partial solution to SPARK-1701, only addressing the
documentation confusion.
Additional work can be to actually change the numSlices
Repository: spark
Updated Branches:
refs/heads/master be0c7563e - a03e5b81e
[SPARK-1701] [PySpark] remove slice terminology from python examples
Author: Matthew Farrellee m...@redhat.com
Closes #2304 from mattf/SPARK-1701-partition-over-slice-for-python-examples and
squashes the following
[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib
Currently, we serialize the data between JVM and Python case by case manually,
this cannot scale to support so many APIs in MLlib.
This patch will try to address this problem by serialize the data using pickle
protocol, using
Repository: spark
Updated Branches:
refs/heads/master a03e5b81e - fce5e251d
http://git-wip-us.apache.org/repos/asf/spark/blob/fce5e251/python/pyspark/mllib/random.py
--
diff --git a/python/pyspark/mllib/random.py
Repository: spark
Updated Branches:
refs/heads/master 2c3cc7641 - 5522151eb
[SPARK-2594][SQL] Support CACHE TABLE name AS SELECT ...
This feature allows user to add cache table from the select query.
Example : ```CACHE TABLE testCacheTable AS SELECT * FROM TEST_TABLE```
Spark takes this type
Repository: spark
Updated Branches:
refs/heads/master 5522151eb - a95ad99e3
[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row
Fix the issue when applySchema() to an RDD of Row.
Also add type mapping for BinaryType.
Author: Davies Liu davies@gmail.com
Closes #2448 from
Repository: spark
Updated Branches:
refs/heads/master ba68a51c4 - 99b06b6fd
[Build] Fix passing of args to sbt
Simple mistake, simple fix:
```shell
args=arg1 arg2 arg3
sbt $args# sbt sees 3 arguments
sbt $args # sbt sees 1 argument
```
Should fix the problems we are seeing
Repository: spark
Updated Branches:
refs/heads/master 99b06b6fd - 8af237061
[Docs] Fix outdated docs for standalone cluster
This is now supported!
Author: andrewor14 andrewo...@gmail.com
Author: Andrew Or andrewo...@gmail.com
Closes #2461 from andrewor14/document-standalone-cluster and
Repository: spark
Updated Branches:
refs/heads/branch-1.1 1687d6ba9 - fd8835323
[Docs] Fix outdated docs for standalone cluster
This is now supported!
Author: andrewor14 andrewo...@gmail.com
Author: Andrew Or andrewo...@gmail.com
Closes #2461 from andrewor14/document-standalone-cluster and
11 matches
Mail list logo