date:20140919

git commit: [SPARK-2062][GraphX] VertexRDD.apply does not use the mergeFunc

2014-09-19 Thread ankurdave

Repository: spark Updated Branches: refs/heads/branch-1.1 cf15b22d4 - 1687d6ba9 [SPARK-2062][GraphX] VertexRDD.apply does not use the mergeFunc VertexRDD.apply had a bug where it ignored the merge function for duplicate vertices and instead used whichever vertex attribute occurred first. This

git commit: MAINTENANCE: Automated closing of pull requests.

2014-09-19 Thread pwendell

Repository: spark Updated Branches: refs/heads/master 3bbbdd818 - a48956f58 MAINTENANCE: Automated closing of pull requests. This commit exists to close the following pull requests on Github: Closes #726 (close requested by 'pwendell') Closes #151 (close requested by 'pwendell') Project:

git commit: [SPARK-1701] Clarify slice vs partition in the programming guide

2014-09-19 Thread joshrosen

Repository: spark Updated Branches: refs/heads/master a48956f58 - be0c7563e [SPARK-1701] Clarify slice vs partition in the programming guide This is a partial solution to SPARK-1701, only addressing the documentation confusion. Additional work can be to actually change the numSlices

git commit: [SPARK-1701] [PySpark] remove slice terminology from python examples

2014-09-19 Thread joshrosen

Repository: spark Updated Branches: refs/heads/master be0c7563e - a03e5b81e [SPARK-1701] [PySpark] remove slice terminology from python examples Author: Matthew Farrellee m...@redhat.com Closes #2304 from mattf/SPARK-1701-partition-over-slice-for-python-examples and squashes the following

[2/2] git commit: [SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib

2014-09-19 Thread meng

[SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib Currently, we serialize the data between JVM and Python case by case manually, this cannot scale to support so many APIs in MLlib. This patch will try to address this problem by serialize the data using pickle protocol, using

[1/2] [SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib

2014-09-19 Thread meng

Repository: spark Updated Branches: refs/heads/master a03e5b81e - fce5e251d http://git-wip-us.apache.org/repos/asf/spark/blob/fce5e251/python/pyspark/mllib/random.py -- diff --git a/python/pyspark/mllib/random.py

git commit: [SPARK-2594][SQL] Support CACHE TABLE name AS SELECT ...

2014-09-19 Thread marmbrus

Repository: spark Updated Branches: refs/heads/master 2c3cc7641 - 5522151eb [SPARK-2594][SQL] Support CACHE TABLE name AS SELECT ... This feature allows user to add cache table from the select query. Example : ```CACHE TABLE testCacheTable AS SELECT * FROM TEST_TABLE``` Spark takes this type

git commit: [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row

2014-09-19 Thread marmbrus

Repository: spark Updated Branches: refs/heads/master 5522151eb - a95ad99e3 [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row Fix the issue when applySchema() to an RDD of Row. Also add type mapping for BinaryType. Author: Davies Liu davies@gmail.com Closes #2448 from

git commit: [Build] Fix passing of args to sbt

2014-09-19 Thread marmbrus

Repository: spark Updated Branches: refs/heads/master ba68a51c4 - 99b06b6fd [Build] Fix passing of args to sbt Simple mistake, simple fix: ```shell args=arg1 arg2 arg3 sbt $args# sbt sees 3 arguments sbt $args # sbt sees 1 argument ``` Should fix the problems we are seeing

git commit: [Docs] Fix outdated docs for standalone cluster

2014-09-19 Thread andrewor14

Repository: spark Updated Branches: refs/heads/master 99b06b6fd - 8af237061 [Docs] Fix outdated docs for standalone cluster This is now supported! Author: andrewor14 andrewo...@gmail.com Author: Andrew Or andrewo...@gmail.com Closes #2461 from andrewor14/document-standalone-cluster and

git commit: [Docs] Fix outdated docs for standalone cluster

2014-09-19 Thread andrewor14

Repository: spark Updated Branches: refs/heads/branch-1.1 1687d6ba9 - fd8835323 [Docs] Fix outdated docs for standalone cluster This is now supported! Author: andrewor14 andrewo...@gmail.com Author: Andrew Or andrewo...@gmail.com Closes #2461 from andrewor14/document-standalone-cluster and

git commit: [SPARK-2062][GraphX] VertexRDD.apply does not use the mergeFunc

git commit: MAINTENANCE: Automated closing of pull requests.

git commit: [SPARK-1701] Clarify slice vs partition in the programming guide

git commit: [SPARK-1701] [PySpark] remove slice terminology from python examples

[2/2] git commit: [SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib

[1/2] [SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlib

git commit: [SPARK-2594][SQL] Support CACHE TABLE name AS SELECT ...

git commit: [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row

git commit: [Build] Fix passing of args to sbt

git commit: [Docs] Fix outdated docs for standalone cluster

git commit: [Docs] Fix outdated docs for standalone cluster

11 matches

Site Navigation

Mail list logo

Footer information