Re: percentile_approx slowness

2014-10-02 Thread Kevin Weiler
be put into a new table? -- Kevin Weiler IT IMC Financial Markets | 233 S. Wacker Drive, Suite 4300 | Chicago, IL 60606 | http://imc-chicago.com/ Phone: +1 312-204-7439 | Fax: +1 312-244-3301 | E-Mail: kevin.wei...@imc-chicago.com<mailto:kevin.wei...@imc-chicago.com> On Sep 25, 2014, at 3

percentile_approx slowness

2014-09-25 Thread Kevin Weiler
13 hours to complete (in the case of 100,000 as my sampling rate). Are there some hive settings I should be investigating to see if I can have this query complete in a reasonable time? -- Kevin Weiler IT IMC Financial Markets | 233 S. Wacker Drive, Suite 4300 | Chicago, IL 60606 | http://imc

Re: Remove duplicate records in Hive

2014-09-10 Thread Kevin Weiler
Whoops, thought this was someone in my office, so obviously you can’t come see me :) -- Kevin Weiler IT IMC Financial Markets | 233 S. Wacker Drive, Suite 4300 | Chicago, IL 60606 | http://imc-chicago.com/ Phone: +1 312-204-7439 | Fax: +1 312-244-3301 | E-Mail: kevin.wei...@imc-chicago.com

Re: Remove duplicate records in Hive

2014-09-10 Thread Kevin Weiler
If you can just query the table for your results, you can do a SELECT DISTINCT instead of just a SELECT. If you give me a bit more information about where the duplicate data is coming from, I can provide a bit more detail. You can come see me on the end of desk. -- Kevin Weiler IT IMC

Re: hive query with in statement

2014-08-13 Thread Kevin Weiler
This is a relatively old stack overflow post. I’m not sure what version you guys are using, but IN seems to work just fine for me. -- Kevin Weiler IT IMC Financial Markets | 233 S. Wacker Drive, Suite 4300 | Chicago, IL 60606 | http://imc-chicago.com/ Phone: +1 312-204-7439 | Fax: +1 312-244

case statement in SELECT TRANSFORM

2014-07-30 Thread Kevin Weiler
Is it possible to have CASE or SUM statements inside of a TRANSFORM selection? When I do it now, I get the following error: FAILED: ParseException line 41:10 mismatched input 'AS' expecting ) near 'END' in transform clause -- Kevin Weiler IT IMC Financial Markets | 233 S.

Re: python UDF and Avro tables

2014-07-29 Thread Kevin Weiler
are passed through STDIN and delimited by “\t”. I simply needed to split on this field in my python UDF and it worked. Thanks! -- Kevin Weiler IT IMC Financial Markets | 233 S. Wacker Drive, Suite 4300 | Chicago, IL 60606 | http://imc-chicago.com/ Phone: +1 312-204-7439 | Fax: +1 312-244-3301

python UDF and Avro tables

2014-07-24 Thread Kevin Weiler
troller.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548) at org.apache.hadoop.mapred.Child.main(Child.java:262) -- Kevin Weiler IT IMC Financial Markets |