A very nice addition for us PySpark users in 0.9.1 is the addition of
RDD.repartition(), which is not mentioned in the release
notes<http://spark.apache.org/releases/spark-release-0-9-1.html>
!

This is super helpful for when you create an RDD from a gzipped file and
then need to explicitly shuffle the data around to parallelize operations
on it appropriately.

Thanks people!

FYI, 
docs/latest<http://spark.apache.org/docs/latest/api/pyspark/index.html>hasn't
been updated yet to reflect the new additions to PySpark.

Nick



On Wed, Apr 9, 2014 at 6:07 PM, Matei Zaharia <matei.zaha...@gmail.com>wrote:

> Thanks TD for managing this release, and thanks to everyone who
> contributed!
>
> Matei
>
> On Apr 9, 2014, at 2:59 PM, Tathagata Das <tathagata.das1...@gmail.com>
> wrote:
>
> A small additional note: Please use the direct download links in the Spark
> Downloads <http://spark.apache.org/downloads.html> page. The Apache
> mirrors take a day or so to sync from the main repo, so may not work
> immediately.
>
> TD
>
>
> On Wed, Apr 9, 2014 at 2:54 PM, Tathagata Das <tathagata.das1...@gmail.com
> > wrote:
>
>> Hi everyone,
>>
>> We have just posted Spark 0.9.1, which is a maintenance release with
>> bug fixes, performance improvements, better stability with YARN and
>> improved parity of the Scala and Python API. We recommend all 0.9.0
>> users to upgrade to this stable release.
>>
>> This is the first release since Spark graduated as a top level Apache
>> project. Contributions to this release came from 37 developers.
>>
>> The full release notes are at:
>> http://spark.apache.org/releases/spark-release-0-9-1.html
>>
>> You can download the release at:
>> http://spark.apache.org/downloads.html
>>
>> Thanks all the developers who contributed to this release:
>> Aaron Davidson, Aaron Kimball, Andrew Ash, Andrew Or, Andrew Tulloch,
>> Bijay Bisht, Bouke van der Bijl, Bryn Keller, Chen Chao,
>> Christian Lundgren, Diana Carroll, Emtiaz Ahmed, Frank Dai,
>> Henry Saputra, jianghan, Josh Rosen, Jyotiska NK, Kay Ousterhout,
>> Kousuke Saruta, Mark Grover, Matei Zaharia, Nan Zhu, Nick Lanham,
>> Patrick Wendell, Prabin Banka, Prashant Sharma, Qiuzhuang,
>> Raymond Liu, Reynold Xin, Sandy Ryza, Sean Owen, Shixiong Zhu,
>> shiyun.wxm, Stevo Slavić, Tathagata Das, Tom Graves, Xiangrui Meng
>>
>> TD
>>
>
>
>

Reply via email to