I am no expert myself, but from what I understand DataFrame is grandfathering SchemaRDD. This was done for API stability as spark sql matured out of alpha as part of 1.3.0 release.
It is forward looking and brings (dataframe like) syntax that was not available with the older schema RDD. On Apr 18, 2015, at 4:43 PM, Arun Patel <arunp.bigd...@gmail.com> wrote: > Experts, > > I have few basic questions on DataFrames vs Spark SQL. My confusion is more > with DataFrames. > > 1) What is the difference between Spark SQL and DataFrames? Are they same? > 2) Documentation says SchemaRDD is renamed as DataFrame. This means > SchemaRDD is not existing in 1.3? > 3) As per documentation, it looks like creating dataframe is no different > than SchemaRDD - df = > sqlContext.jsonFile("examples/src/main/resources/people.json"). > So, my question is what is the difference? > > Thanks for your help. > > Arun --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org