Is this really an internal / external distinction? For a concrete example, Source.getBatch seems to be a public interface, but returns DataFrame.
On Thu, Jun 16, 2016 at 1:42 PM, Tathagata Das <[email protected]> wrote: > DataFrame is a type alias of Dataset[Row], so externally it seems like > Dataset is the main type and DataFrame is a derivative type. > However, internally, since everything is processed as Rows, everything uses > DataFrames, Type classes used in a Dataset is internally converted to rows > for processing. . Therefore internally DataFrame is like "main" type that is > used. > > On Thu, Jun 16, 2016 at 11:18 AM, Cody Koeninger <[email protected]> wrote: >> >> Sorry, meant DataFrame vs Dataset >> >> On Thu, Jun 16, 2016 at 12:53 PM, Cody Koeninger <[email protected]> >> wrote: >> > Is there a principled reason why sql.streaming.* and >> > sql.execution.streaming.* are making extensive use of DataFrame >> > instead of Datasource? >> > >> > Or is that just a holdover from code written before the move / type >> > alias? >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [email protected] >> For additional commands, e-mail: [email protected] >> > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
