Re: How to patch sparkSQL on EC2?

2014-10-14 Thread Christos Kozanitis Christos Kozanitis
ahhh never mind… I didn’t notice that a spark-assembly jar file gets produced after compiling the whole spark suite… So no more manual editing of the jar file of the AMI for now! Christos On Oct 10, 2014, at 12:15 AM, Christos Kozanitis wrote: Hi I have written a few extensions

How to patch sparkSQL on EC2?

2014-10-10 Thread Christos Kozanitis Christos Kozanitis
Hi I have written a few extensions for sparkSQL (for version 1.1.0) and I am trying to deploy my new jar files (one for catalyst and one for sql/core) on ec2. My approach was to create a new spark/lib/spark-assembly-1.1.0-hadoop1.0.4.jar that merged the contents of the old one with the

Re: SparkSQL operator priority

2014-07-19 Thread Christos Kozanitis
. https://github.com/apache/spark/pull/511/ On Jul 18, 2014, at 8:04 PM, Christos Kozanitis kozani...@berkeley.edu wrote: Hello What is the order with which SparkSQL deserializes parquet fields? Is it possible to modify it? I am using SparkSQL to query a parquet file that consists of a lot

SparkSQL operator priority

2014-07-18 Thread Christos Kozanitis
Hello What is the order with which SparkSQL deserializes parquet fields? Is it possible to modify it? I am using SparkSQL to query a parquet file that consists of a lot of fields (around 30 or so). Let me call an example table MyTable and let's suppose the name of one of its fields is position.