Re: RDDs

Tobias Pfeiffer Wed, 03 Sep 2014 17:56:34 -0700

Hello,

On Wed, Sep 3, 2014 at 6:02 PM, rapelly kartheek <kartheek.m...@gmail.com>
wrote:
>
> Can someone tell me what kind of operations can be performed on a
> replicated rdd?? What are the use-cases of a replicated rdd.
>

I suggest you read

https://spark.apache.org/docs/latest/programming-guide.html#resilient-distributed-datasets-rdds
as an introduction, it lists a lot of the transformations and output
operations you can use.
Personally, I also found it quite helpful to read the paper about RDDs:
  http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf

> One basic doubt that is bothering me from long time: what is the
> difference between an application and job in the Spark parlance. I am
> confused b'cas of Hadoop jargon.
>

OK, someone else might answer that. I am myself confused with application,
job, task, stage etc. ;-)

Tobias

Re: RDDs

Reply via email to