Thank you, I like and agree with your point. RDD evolved to Datasets by means 
of an optimizer.
I just wonder what are the use cases for RDDs (other than current version of 
GraphX leveraging RDDs)?

Best,
Ovidiu

> On 01 Sep 2016, at 16:26, Sean Owen <so...@cloudera.com> wrote:
> 
> Here's my paraphrase:
> 
> Datasets are really the new RDDs. They have a similar nature
> (container of strongly-typed objects) but bring some optimizations via
> Encoders for common types.
> 
> DataFrames are different from RDDs and Datasets and do not replace and
> are not replaced by them. They're fundamentally for tabular data, not
> arbitrary objects, and thus supports SQL-like operations that only
> make sense on tabular  data.
> 
> On Thu, Sep 1, 2016 at 3:17 PM, Ashok Kumar
> <ashok34...@yahoo.com.invalid> wrote:
>> Hi,
>> 
>> What are practical differences between the new Data set in Spark 2 and the
>> existing DataFrame.
>> 
>> Has Dataset replaced Data Frame and what advantages it has if I use Data
>> Frame instead of Data Frame.
>> 
>> Thanks
>> 
>> 
> 
> ---------------------------------------------------------------------
> To unsubscribe e-mail: user-unsubscr...@spark.apache.org
> 


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to