something.
Regards,
--
Rohan Monga
sorry for the delayed response.
yes the python script follows that.
--
Rohan Monga
On Tue, Jan 14, 2014 at 4:31 PM, Stephen Sprague sprag...@gmail.com wrote:
@OP - first thing i'd ask is does your python script obey the ^A,^B,^C,^D
etc. nesting delimiter pattern. give that your create table
cluster by g1 ) m
insert overwrite table t1
reduce m.g1, m.g2, m.g3
using 'python customScript.py' as ( f1 , f2 );
however f2 is not being loaded properly into t1, it comes up broken or
null. What should I do so that f2 is loaded as an array of structs.
Thanks,
--
Rohan Monga
thanks Mark,
I ended up going the custom reducer way. I will try out the query you have
sent.
Regards,
--
Rohan Monga
On Wed, Feb 1, 2012 at 11:06 AM, Mark Grover mgro...@oanda.com wrote:
Rohan,
You could do it one of the following ways:
1) Write a UDAF that does the avg(f2 - avg_f2
)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:197)
/snip
Is there a workaround ? I have tried with hive 0.7.1 and 0.8
Thanks
--
Rohan Monga
this to work?
Thanks,
--
Rohan Monga
On Fri, Jan 20, 2012 at 12:51 PM, Edward Capriolo edlinuxg...@gmail.comwrote:
I think if you are grouping by b, b has to be in your select list. Try
this.
FROM (
select b,count(a) as theCount from table one group by b
) a select mean(theCount);
I think
and the information being lost ] :)
But why the nullpointer exception?
Regards,
--
Rohan Monga
On Fri, Jan 20, 2012 at 2:32 PM, Edward Capriolo edlinuxg...@gmail.comwrote:
IMHO You can not possibly nest the percentile calculation because the
results would be meaningless. percentile has
i would also recommend cloudera tutorials.
--
Rohan Monga
On Tue, Dec 13, 2011 at 3:50 AM, Mohammad Tariq donta...@gmail.com wrote:
Hi Praveenesh,
If you want something other than the wiki, you can use books
like Hadoop definitive guide or Hadoop in action...These books
have
Yup, sub-queries don't work in the where clause.
Is there another document and/or JIRA that i can look at for the
implementation of IN clause?
Regards,
--
Rohan Monga
On Mon, Dec 12, 2011 at 10:19 PM, Igor Tatarinov i...@decide.com wrote:
I think the doc refers to an IN subquery
WHERE x
where row in (1,2,3);
/snip
work. and I found this JIRA
https://issues.apache.org/jira/browse/HIVE-801which kinda shows that
it should work.
Was this feature deprecated in the later version ( 0.7.1) or is the
documentation out of sync?
Thanks,
--
Rohan Monga
,
--
Rohan Monga
On Thu, Nov 17, 2011 at 2:02 PM, Mark Grover mgro...@oanda.com wrote:
Rohan,
The short answer is: I don't know:-) If you could paste the log, I or someone
else of the mailing list could be able to help.
BTW, What version of Hive were you using? Did you set the threshold before
Hi Prajakta,
I get it now. But are there plans to support groupbys of that format,
or would I be able to write a UDF of sorts to do that?
Also, could you please give me an example of accelerating equi-joins
with indices, I haven't been able to get that to work either :(
Regards,
--
Rohan Monga
joins?
And for the semantic analysis thing, I sent the error in my first
email, here are the lines that I modified to get it to work. Basically
changed _count_Of to _count_of
Regards,
--
Rohan Monga
On Wed, Sep 21, 2011 at 4:22 AM, Prajakta Kalmegh pkalm...@gmail.com wrote:
Hi Rohan
I run
mo.c1, mo.c2, mo.c3
using 'python convert.py'
as x1, x2;
Say, the missing record was for c1=1, then with this query, the record
shows up in mytable2. Is there something I am missing ?
Regards,
--
Rohan Monga
';' before removing
comments? or am I missing something?
Thanks,
--
Rohan Monga
15 matches
Mail list logo