Re: HIVE NOT EXISTS

2012-10-01 Thread Mohit Anchlia
b.pagename IS NULL Hope that helps, Matt On Mon, Oct 1, 2012 at 4:42 PM, Mohit Anchlia mohitanch...@gmail.com wrote: Could someone help me understand what alternative do I have for this query? I am trying to check if a given row exists in the table. select

SQL help

2012-05-24 Thread Mohit Anchlia
I am new to Hive. I have several SQL from RDBMS database that I need to convert to hive. What's the best reference for HIVEQL? For now I am trying to figure out how to do this in hive: Select distinct A_ID, First_Value(path IGNORE NULLS) over(PARTITION BY A_ID ORDER BY t_timestamp) From WEB_DATA

Re: SQL help

2012-05-24 Thread Mohit Anchlia
input near 'select' 'min' '(' in expression specification On Thu, May 24, 2012 at 1:02 PM, Mohit Anchlia mohitanch...@gmail.comwrote: I am new to Hive. I have several SQL from RDBMS database that I need to convert to hive. What's the best reference for HIVEQL? For now I am trying to figure out

Re: SQL help

2012-05-24 Thread Mohit Anchlia
on it? Edward On Thu, May 24, 2012 at 5:13 PM, Mohit Anchlia mohitanch...@gmail.com wrote: I am now trying to do it this way but doesn't work in hive. I think I am missing something here, can someone please help? select a_id from web_data t1 where a_id = (select min(a_id) from web_data t2

Re: SQL help

2012-05-24 Thread Mohit Anchlia
guess do some kind of group by and store it in intermediate file and run another select on it? Yes, that is my recommendation. On Thu, May 24, 2012 at 2:57 PM, Mohit Anchlia mohitanch...@gmail.comwrote: On Thu, May 24, 2012 at 2:19 PM, Edward Capriolo edlinuxg...@gmail.comwrote: Hive

Local hadoop

2012-05-23 Thread Mohit Anchlia
When I launch simple SQL I see local hadoop. And when I do hadoop job fs -list in my hadoop cluster I don't see any jobs. Am I doing something wrong here? # hive Hive history file=/tmp/root/hive_job_log_root_201205231 Execution log at:

Re: Local hadoop

2012-05-23 Thread Mohit Anchlia
others. Be careful here. If you have the default settings your dfs.replication would be 1 or you might get some other nasty surprises. On Wed, May 23, 2012 at 7:59 PM, Mohit Anchlia mohitanch...@gmail.com wrote: Thanks. I used --hiveconf to set the jobtracker and it worked. On Wed, May 23

RCFile and UDF

2012-05-22 Thread Mohit Anchlia
I am new to Hive. Currently I am trying out one of the use cases where we write xml files into a sequence file. We then read the sequence file and convert it into more structured row, col format using pig udf. This is currently being stored as snapp compression. Now what I want to do is use hive