Thank you, I like and agree with your point. RDD evolved to Datasets by means of an optimizer. I just wonder what are the use cases for RDDs (other than current version of GraphX leveraging RDDs)?
Best, Ovidiu > On 01 Sep 2016, at 16:26, Sean Owen <so...@cloudera.com> wrote: > > Here's my paraphrase: > > Datasets are really the new RDDs. They have a similar nature > (container of strongly-typed objects) but bring some optimizations via > Encoders for common types. > > DataFrames are different from RDDs and Datasets and do not replace and > are not replaced by them. They're fundamentally for tabular data, not > arbitrary objects, and thus supports SQL-like operations that only > make sense on tabular data. > > On Thu, Sep 1, 2016 at 3:17 PM, Ashok Kumar > <ashok34...@yahoo.com.invalid> wrote: >> Hi, >> >> What are practical differences between the new Data set in Spark 2 and the >> existing DataFrame. >> >> Has Dataset replaced Data Frame and what advantages it has if I use Data >> Frame instead of Data Frame. >> >> Thanks >> >> > > --------------------------------------------------------------------- > To unsubscribe e-mail: user-unsubscr...@spark.apache.org > --------------------------------------------------------------------- To unsubscribe e-mail: user-unsubscr...@spark.apache.org