Here's a longer version of that talk that I gave, which goes into more
detail on the internals:
http://www.slideshare.net/databricks/spark-sql-deep-dive-melbroune
On Fri, Aug 21, 2015 at 8:28 AM, Sameer Farooqui same...@databricks.com
wrote:
Have you seen the Spark SQL paper?:
Have you seen the Spark SQL paper?:
https://people.csail.mit.edu/matei/papers/2015/sigmod_spark_sql.pdf
On Thu, Aug 20, 2015 at 11:35 PM, Dawid Wysakowicz
wysakowicz.da...@gmail.com wrote:
Hi,
thanks for answers. I have read answers you provided, but I rather look
for some materials on the
Hi,
thanks for answers. I have read answers you provided, but I rather look for
some materials on the internals. E.g how the optimizer works, how the query
is translated into rdd operations etc. The API I am quite familiar with.
A good starting point for me was: Spark DataFrames: Simple and Fast
Hi,
I would like to dip into SparkSQL. Get to know better the architecture,
good practices, some internals. Could you advise me some materials on this
matter?
Regards
Dawid
Hi Dawid
The best pace to get started is the Spark SQL Guide from Apache
http://spark.apache.org/docs/latest/sql-programming-guide.html
Regards
Muhammad
On Thu, Aug 20, 2015 at 5:46 AM, Dawid Wysakowicz
wysakowicz.da...@gmail.com wrote:
Hi,
I would like to dip into SparkSQL. Get to know
See also
http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.package
Cheers
On Thu, Aug 20, 2015 at 7:50 AM, Muhammad Atif muhammadatif...@gmail.com
wrote:
Hi Dawid
The best pace to get started is the Spark SQL Guide from Apache
Or if you're a python lover then this is a good place -
https://spark.apache.org/docs/1.4.1/api/python/pyspark.sql.html#
On Thu, Aug 20, 2015 at 10:58 AM, Ted Yu yuzhih...@gmail.com wrote:
See also
http://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.package
Cheers