Re: Spark SQL with a sorted file

2014-12-22 Thread Jerry Raj
or use the ParquetRelation2 for workaround. (https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/parquet/newParquet.scala) Cheng Hao -Original Message- From: Jerry Raj [mailto:jerry@gmail.com mailto:jerry@gmail.com

Spark SQL DSL for joins?

2014-12-16 Thread Jerry Raj
Hi, I'm using the Scala DSL for Spark SQL, but I'm not able to do joins. I have two tables (backed by Parquet files) and I need to do a join across them using a common field (user_id). This works fine using standard SQL but not using the language-integrated DSL neither t1.join(t2, on =

Re: Spark SQL DSL for joins?

2014-12-16 Thread Jerry Raj
Another problem with the DSL: t1.where('term == dmin).count() returns zero. But sqlCtx.sql(select * from t1 where term = 'dmin').count() returns 700, which I know is correct from the data. Is there something wrong with how I'm using the DSL? Thanks On 17/12/14 11:13 am, Jerry Raj wrote

Spark SQL UDF returning a list?

2014-12-03 Thread Jerry Raj
Hi, Can a UDF return a list of values that can be used in a WHERE clause? Something like: sqlCtx.registerFunction(myudf, { Array(1, 2, 3) }) val sql = select doc_id, doc_value from doc_table where doc_id in myudf() This does not work: Exception in thread main

Spark SQL with a sorted file

2014-12-03 Thread Jerry Raj
Hi, If I create a SchemaRDD from a file that I know is sorted on a certain field, is it possible to somehow pass that information on to Spark SQL so that SQL queries referencing that field are optimized? Thanks -Jerry - To

Re: [silk] (no subject)

2011-06-21 Thread Jerry Raj
On 21/06/11 11:43 AM, Venkat Mangudi wrote: On Tuesday 21 June 2011 11:24 AM, Biju Chacko wrote: Sorry to go off on a tangent, but: So not agreeing with you. Incidentally, what do you think of this: I'm unlurking because I had to say

[ug-bosug] Inconsistent File System structure- so consistently

2008-08-23 Thread Jerry Raj
Possibly Linux is using the Solaris partition as swap. The partition type ID 0x82 is used to denote a Solaris partition, but Linux uses the same id to denote a swap partition. You could check /etc/fstab on Linux to check what its using for swap. -Jerry Amit k. Saha wrote: Hi! Okay, so

[ug-bosug] porting libntp .. AC_FUNC_MALLOC error

2007-08-10 Thread Jerry Raj
On 09/08/07 11:34, Anil Gulecha wrote: Hi all, I was trying to port libntp library that allows talking to Creative(and other) mp3 players.. the source consists of the libraries and a sample application. I had the initial -Wall errors that I corrected and the library has built fine.