Re: Spark 0.9.1 released

2014-04-09 Thread Nicholas Chammas
Ah, looks good now. It took me a minute to realize that doing a hard refresh on the docs page was missing the RDD class doc page... And thanks for updating the release notes. On Wed, Apr 9, 2014 at 7:21 PM, Tathagata Das wrote: > Thanks Nick for pointing that out! I have updated the release >

Re: Spark 0.9.1 released

2014-04-09 Thread Tathagata Das
Thanks Nick for pointing that out! I have updated the release notes. But I see the new operations like repartition in the latest PySpark RDD docs. Maybe refresh the page couple of

Re: Spark 0.9.1 released

2014-04-09 Thread Nicholas Chammas
A very nice addition for us PySpark users in 0.9.1 is the addition of RDD.repartition(), which is not mentioned in the release notes ! This is super helpful for when you create an RDD from a gzipped file and then need to explicitly shuffle

Re: Spark 0.9.1 released

2014-04-09 Thread Matei Zaharia
Thanks TD for managing this release, and thanks to everyone who contributed! Matei On Apr 9, 2014, at 2:59 PM, Tathagata Das wrote: > A small additional note: Please use the direct download links in the Spark > Downloads page. The Apache mirrors take a day or so to sync from the main > repo,

Re: Spark 0.9.1 released

2014-04-09 Thread Tathagata Das
A small additional note: Please use the direct download links in the Spark Downloads page. The Apache mirrors take a day or so to sync from the main repo, so may not work immediately. TD On Wed, Apr 9, 2014 at 2:54 PM, Tathagata Das wrote: > Hi everyone,

Spark 0.9.1 released

2014-04-09 Thread Tathagata Das
Hi everyone, We have just posted Spark 0.9.1, which is a maintenance release with bug fixes, performance improvements, better stability with YARN and improved parity of the Scala and Python API. We recommend all 0.9.0 users to upgrade to this stable release. This is the first release since Spark