Thanks!
From: Herman van Hövell tot Westerflier [mailto:hvanhov...@questtec.nl]
Sent: Fri, Jun 03, 2016 10:05
To: Gerhard Fiedler <gfied...@algebraixdata.com>
Cc: dev@spark.apache.org
Subject: Re: Where is DataFrame.scala in 2.0?
Hi Gerhard,
DataFrame and DataSet have been merged in Spark 2.0. A DataFrame is now a
DataSet that contains Row objects. We still maintain a type alias for
DataFrame:
https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/package.scala#L45
HTH
Kind regards,
Herman van Hövell tot Westerflier
2016-06-03 17:01 GMT+02:00 Gerhard Fiedler
<gfied...@algebraixdata.com<mailto:gfied...@algebraixdata.com>>:
When I look at the sources in Github, I see DataFrame.scala at
https://github.com/apache/spark/blob/branch-1.6/sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala
in the 1.6 branch. But when I change the branch to branch-2.0 or master, I get
a 404 error. I also can’t find the file in the directory listings, for example
https://github.com/apache/spark/tree/branch-2.0/sql/core/src/main/scala/org/apache/spark/sql
(for branch-2.0).
It seems that quite a few APIs use the DataFrame class, even in 2.0. Can
someone please point me to its location, or otherwise explain why it is not
there?
Thanks,
Gerhard