Re: Python implementation of RDD interface

Davies Liu Fri, 29 May 2015 11:47:33 -0700

There is another implementation of RDD interface in Python, called
DPark [1], Could you have a few words to compare these two?


[1] https://github.com/douban/dpark/

On Fri, May 29, 2015 at 8:29 AM, Sven Kreiss <s...@svenkreiss.com> wrote:
> I wanted to share a Python implementation of RDDs: pysparkling.
>
> http://trivial.io/post/120179819751/pysparkling-is-a-native-implementation-of-the
>
> The benefit is that you can apply the same code that you use in PySpark on
> large datasets in pysparkling on small datasets or single documents. When
> running with pysparkling, there is no dependency on the Java Virtual Machine
> or Hadoop.
>
> Sven

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Re: Python implementation of RDD interface

Reply via email to