DataFrame is a type alias of Dataset[Row], so externally it seems like Dataset is the main type and DataFrame is a derivative type. However, internally, since everything is processed as Rows, everything uses DataFrames, Type classes used in a Dataset is internally converted to rows for processing. . Therefore internally DataFrame is like "main" type that is used.
On Thu, Jun 16, 2016 at 11:18 AM, Cody Koeninger <c...@koeninger.org> wrote: > Sorry, meant DataFrame vs Dataset > > On Thu, Jun 16, 2016 at 12:53 PM, Cody Koeninger <c...@koeninger.org> > wrote: > > Is there a principled reason why sql.streaming.* and > > sql.execution.streaming.* are making extensive use of DataFrame > > instead of Datasource? > > > > Or is that just a holdover from code written before the move / type > alias? > > --------------------------------------------------------------------- > To unsubscribe, e-mail: dev-unsubscr...@spark.apache.org > For additional commands, e-mail: dev-h...@spark.apache.org > >