I am no expert myself, but from what I understand DataFrame is grandfathering 
SchemaRDD. This was done for API stability as spark sql matured out of alpha as 
part of 1.3.0 release. 

It is forward looking and brings (dataframe like) syntax that was not available 
with the older schema RDD.

On Apr 18, 2015, at 4:43 PM, Arun Patel <arunp.bigd...@gmail.com> wrote:

> Experts,
> 
> I have few basic questions on DataFrames vs Spark SQL.  My confusion is more 
> with DataFrames. 
> 
> 1)  What is the difference between Spark SQL and DataFrames?  Are they same?
> 2)  Documentation says SchemaRDD is renamed as DataFrame. This means 
> SchemaRDD is not existing in 1.3?  
> 3)  As per documentation, it looks like creating dataframe is no different 
> than SchemaRDD -  df = 
> sqlContext.jsonFile("examples/src/main/resources/people.json").  
>     So, my question is what is the difference?
> 
> Thanks for your help.  
> 
> Arun


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to