Spark SQL performance and data size constraints

2014-11-26 Thread SK
this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-SQL-performance-and-data-size-constraints-tp19843.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e

RE: Spark SQL performance and data size constraints

2014-11-26 Thread Cheng, Hao
. -Original Message- From: SK [mailto:skrishna...@gmail.com] Sent: Wednesday, November 26, 2014 4:17 PM To: u...@spark.incubator.apache.org Subject: Spark SQL performance and data size constraints Hi, I use the following code to read in data and extract the unique users using Spark SQL. The data