RE: NEED HELP in Hive Query
Thanks John :-), I got it now in Pig also :-). A = load '/File/00_0' using PigStorage('\u0001') as as (name, date, url, hit:INT); B = group A by (id, name, date, url); C = foreach B generate flatten(A.id), flatten(A.name), flatten(A.url), SUM(A.hit) ; D = distinct C; Dump D; Thanks & Regards Yogesh Kumar Dhari From: j...@omernik.com Date: Sun, 14 Oct 2012 12:29:23 -0500 Subject: Re: NEED HELP in Hive Query To: user@hive.apache.org select NAME, DATE, URL, SUM(HITCOUNT) as HITCOUNT from yourtable group by NAME, DATE, URL That's the HIVE answer. Not sure the PIG answer. On Sun, Oct 14, 2012 at 9:54 AM, yogesh dhari wrote: Hi all, I have this file. I want this operation to perform in HIVE & PIG NAME DATE URL HITCOUNT timesascent.in2008-08-27 http://timesascent.in/index.aspx?page=tparchives15 timesascent.in2008-08-27 http://timesascent.in/index.aspx?page=article§id=1&contentid=200812182008121814134447219270b26 20 timesascent.in2008-08-27http://timesascent.in/37 timesascent.in2008-08-27http://timesascent.in/section/39/Job%20Wise 14 timesascent.in2008-08-27 http://timesascent.in/article/7/2011062120110621171709769aacc537/Work-environment--Employee-productivity.html 20 timesascent.in2008-08-27http://timesascent.in/17 timesascent.in2008-08-27http://timesascent.in/section/2/Interviews 15 timesascent.in2008-08-27http://timesascent.in/17 timesascent.in2008-08-27http://timesascent.in/27 timesascent.in2008-08-27http://timesascent.in/37 timesascent.in2008-08-27http://timesascent.in/27 timesascent.in2008-08-27http://www.timesascent.in/16 timesascent.in2008-08-27http://timesascent.in/section/2/Interviews 14 timesascent.in2008-08-27http://timesascent.in/14 timesascent.in2008-08-27http://timesascent.in/22 I want to add all HITCOUNT for the same NAME, DATE & URL like timesascent.in2008-08-27http://timesascent.in/(addition of all hitcount under same name, date, url (37+17+17+27+)) Please suggest me is there any method to perform this query. Thanks & Regards Yogesh Kumar
Re: NEED HELP in Hive Query
select NAME, DATE, URL, SUM(HITCOUNT) as HITCOUNT from yourtable group by NAME, DATE, URL That's the HIVE answer. Not sure the PIG answer. On Sun, Oct 14, 2012 at 9:54 AM, yogesh dhari wrote: > Hi all, > > I have this file. I want this operation to perform in *HIVE & PIG* > > NAME DATE > URL > HITCOUNT >timesascent.in2008-08-27 > http://timesascent.in/index.aspx?page=tparchives15 > timesascent.in2008-08-27 > http://timesascent.in/index.aspx?page=article§id=1&contentid=200812182008121814134447219270b26 > 20 > timesascent.in2008-08-27http://timesascent.in/37 > timesascent.in2008-08-27 > http://timesascent.in/section/39/Job%20Wise14 > timesascent.in2008-08-27 > http://timesascent.in/article/7/2011062120110621171709769aacc537/Work-environment--Employee-productivity.html > 20 > timesascent.in2008-08-27http://timesascent.in/17 > timesascent.in2008-08-27 > http://timesascent.in/section/2/Interviews15 > timesascent.in2008-08-27http://timesascent.in/17 >timesascent.in2008-08-27http://timesascent.in/27 > timesascent.in2008-08-27http://timesascent.in/37 > timesascent.in2008-08-27http://timesascent.in/27 > timesascent.in2008-08-27http://www.timesascent.in/16 > timesascent.in2008-08-27 > http://timesascent.in/section/2/Interviews14 > timesascent.in2008-08-27http://timesascent.in/14 > timesascent.in2008-08-27http://timesascent.in/22 > > > I want to *add all HITCOUNT for the same NAME, DATE & URL * > > like > > timesascent.in2008-08-27http://timesascent.in/(addition of > all hitcount under same name, date, url (37+17+17+27+)) > > Please suggest me is there any method to perform this query. > > > Thanks & Regards > Yogesh Kumar > > > >
NEED HELP in Hive Query
Hi all, I have this file. I want this operation to perform in HIVE & PIG NAME DATE URL HITCOUNT timesascent.in2008-08-27 http://timesascent.in/index.aspx?page=tparchives15 timesascent.in2008-08-27 http://timesascent.in/index.aspx?page=article§id=1&contentid=200812182008121814134447219270b26 20 timesascent.in2008-08-27http://timesascent.in/37 timesascent.in2008-08-27http://timesascent.in/section/39/Job%20Wise 14 timesascent.in2008-08-27 http://timesascent.in/article/7/2011062120110621171709769aacc537/Work-environment--Employee-productivity.html 20 timesascent.in2008-08-27http://timesascent.in/17 timesascent.in2008-08-27http://timesascent.in/section/2/Interviews 15 timesascent.in2008-08-27http://timesascent.in/17 timesascent.in2008-08-27http://timesascent.in/27 timesascent.in2008-08-27http://timesascent.in/37 timesascent.in2008-08-27http://timesascent.in/27 timesascent.in2008-08-27http://www.timesascent.in/16 timesascent.in2008-08-27http://timesascent.in/section/2/Interviews 14 timesascent.in2008-08-27http://timesascent.in/14 timesascent.in2008-08-27http://timesascent.in/22 I want to add all HITCOUNT for the same NAME, DATE & URL like timesascent.in2008-08-27http://timesascent.in/(addition of all hitcount under same name, date, url (37+17+17+27+)) Please suggest me is there any method to perform this query. Thanks & Regards Yogesh Kumar