回复: BUG IN HIVE-4650 seems not fixed

2013-07-29 Thread wzc1989
Hi: I attach the output of EXPLAIN, and the hive I use is compiled from trunk and my hadoop version is 1.0.1. I use default hive configuration. -- wzc1...@gmail.com 已使用 Sparrow (http://www.sparrowmailapp.com/?sig) 已使用 Sparrow (http://www.sparrowmailapp.com/?sig) 在

Templeton create table with custom inputformat

2013-07-29 Thread Peter Marron
Hi, (I'm a little bit behind in reading the lists, so apologies if this is a duplicate question.) I am running Templeton v1 (?) and HCatalog 0.5.0 with hive 0.11.0 over Hadoop 1.0.4. I can use something like this: curl -s -X PUT -HContent-type:application/json -d @createtable.json

Union support in HCatalog

2013-07-29 Thread Neha Tomar
Hi All, We have union data type support with Hive (tried Hive 0.9.0). However, HCatalog 0.5.0 does not support it. I looked into the HCatalog code org.apache.hcatalog.data.schema.HCatFieldSchema.Type to confirm this. It does not list union data type. Please let me know if my understanding is not

Re: PL/SQL to HiveQL translation

2013-07-29 Thread Jérôme Verdier
Hi, Thanks everyone for your help. Has anyone have a good tutorial to run Hive queries and scripts with Java (over Eclipse). I have some Java Development basis but i'm pretty new using Hive with Java/Eclipse. Thanks. 2013/7/25 j.barrett Strausser j.barrett.straus...@gmail.com The advice I

Composite blob key mapping in hive

2013-07-29 Thread G.S.Vijay Raajaa
Hi, I would like to know if it is possible to map a composite key stored as blob in HBase to Hive?? Regards, Vijay Raajaa G S

Re: Composite blob key mapping in hive

2013-07-29 Thread kulkarni . swarnim
Hi, Please refer to the workaround posted on HIVE-2599 and let me know if that works for you. On Jul 29, 2013, at 6:22 AM, G.S.Vijay Raajaa gsvijayraa...@gmail.com wrote: Hi, I would like to know if it is possible to map a composite key stored as blob in HBase to Hive?? Regards,

BytesWritable

2013-07-29 Thread Clum, Josh
Hi, I wrote a custom serde that stores takes data and stores it in objects called DocumentWritable that inherits from Writable. When I try to run code to insert into the table that uses my serde, I get an error saying that DocumentWritable cannot be cast to BytesWritable. Does my

Re: PL/SQL to HiveQL translation

2013-07-29 Thread Brendan Heussler
Jerome, There is a really good page on the wiki: https://cwiki.apache.org/Hive/hiveserver2-clients.html I use the HiveServer2 JDBC driver. Maybe there are other ways? Brendan On Mon, Jul 29, 2013 at 5:47 AM, Jérôme Verdier verdier.jerom...@gmail.comwrote: Hi, Thanks everyone for your

Re: BytesWritable

2013-07-29 Thread Roberto Congiu
Hi Josh, it doesn't have to. However, your table does not specify any input format, which makes Hive use the default HiveIgnoreKeyTextOutputFormat (see

Re: BytesWritable

2013-07-29 Thread Clum, Josh
Thank you for the reply. I fixed it by using the stored as clause in my hive query. Everything is good. From: Roberto Congiu roberto.con...@openx.commailto:roberto.con...@openx.com Reply-To: user@hive.apache.orgmailto:user@hive.apache.org user@hive.apache.orgmailto:user@hive.apache.org Date:

Best Performance on Large Scale Join

2013-07-29 Thread Brad Ruderman
Hi All- I have 2 tables: CREATE TABLE users ( a bigint, b int ) CREATE TABLE products ( a bigint, c int ) Each table has about 8 billion records (roughly 2k files total mappers). I want to know the most performant way to do the following query: SELECT u.b, p.c,

Hive Metastore Server 0.9 Connection Reset and Connection Timeout errors

2013-07-29 Thread agateaaa
Hi All: We are running into frequent problem using HCatalog 0.4.1 (HIve Metastore Server 0.9) where we get connection reset or connection timeout errors. The hive metastore server has been allocated enough (12G) memory. This is a critical problem for us and would appreciate if anyone has any

Re: Best Performance on Large Scale Join

2013-07-29 Thread Nitin Pawar
Brad, whats the cluster capacity you have got? how many uniq values of a,b and c you have got individually in any of the one table? Is there any chance you can partition data? are there any columns you have on which you can create buckets? I have done joins having 10 billion records in one

Re: Hive Metastore Server 0.9 Connection Reset and Connection Timeout errors

2013-07-29 Thread Nitin Pawar
Is there any chance you can do a update on test environment with hcat-0.5 and hive-0(11 or 10) and see if you can reproduce the issue? We used to see this error when there was load on hcat server or some network issue connecting to the server(second one was rare occurrence) On Mon, Jul 29, 2013

Re: Best Performance on Large Scale Join

2013-07-29 Thread Michael Malak
Perhaps you can first create a temp table that contains only the records that will match?  See the UNION ALL trick at http://www.mail-archive.com/hive-user@hadoop.apache.org/msg01906.html From: Brad Ruderman bruder...@radiumone.com To: user@hive.apache.org

Re: Hive Metastore Server 0.9 Connection Reset and Connection Timeout errors

2013-07-29 Thread agateaaa
Thanks Nitin! We have simiar setup (identical hcatalog and hive server versions) on a another production environment and dont see any errors (its been running ok for a few months) Unfortunately we wont be able to move to hcat 0.5 and hive 0.11 or hive 0.10 soon. I did see that the last time we

Re: Best Performance on Large Scale Join

2013-07-29 Thread Brad Ruderman
Hi Michael and Nitin- Thanks for your response. Some things to note: Michael- I will definitely try this method, it looks interesting. Nitin - -Users Table and Product Tables are already unique. -I cannot partition the data, since the data is coming from already partitioned tables and I am doing

UDAF terminatePartial structure

2013-07-29 Thread Ritesh Agrawal
Hi all, I am writing my first UDAF. In my terminatePartial() function, I need to store different data having different data types. Below is a list of items that I need to store 1. C1 : list of doubles 2. C2: list of doubles 3. C3: double 4. Show: list of strings I am wondering can I use

Re: UDAF terminatePartial structure

2013-07-29 Thread Robin Morris
I believe a map will be passed correctly from the terminatePartial to the merge functions. But it seems a bit of overkill. Why not define a class within your UDAF which has 4 public data members, and return instances of that class from terminatePartial()? Robin On 7/29/13 3:19 PM, Ritesh

Re: UDAF terminatePartial structure

2013-07-29 Thread Ritesh Agrawal
Hi Robin, Thanks for the suggestion. I did find such an example in Hadoop The definitive guide book. However I am not total confused. The book extends UDAF instead of AbstractGenericUDAFResolver. Which one is recommended ? Also the example in the book uses DoubleWritable as a return type

Re: UDAF terminatePartial structure

2013-07-29 Thread Igor Tatarinov
I found this Cloudera example helpful: http://grepcode.com/file/repository.cloudera.com/content/repositories/releases/org.apache.hadoop.hive/hive-contrib/0.7.0-cdh3u0/org/apache/hadoop/hive/contrib/udaf/example/UDAFExampleMaxMinNUtil.java#UDAFExampleMaxMinNUtil.Evaluator igor decide.com On

Possible release date for Hive 0.12.0 ?

2013-07-29 Thread Sanjay Subramanian
Hi guys When is stable Hive 0.12.0 expected I have a use case that needs this fixed and looks like its fixed in 0.12.0 https://issues.apache.org/jira/browse/HIVE-3603 Sanjay CONFIDENTIALITY NOTICE == This email message and any attachments are for the exclusive use of

Re: Hive Metastore Server 0.9 Connection Reset and Connection Timeout errors

2013-07-29 Thread agateaaa
Looking at the hive metastore server logs see errors like these: 2013-07-26 06:34:52,853 ERROR server.TThreadPoolServer (TThreadPoolServer.java:run(182)) - Error occurred during processing of message. java.lang.NullPointerException at

Re: Hive Metastore Server 0.9 Connection Reset and Connection Timeout errors

2013-07-29 Thread agateaaa
Looking at the hive metastore server logs see errors like these: 2013-07-26 06:34:52,853 ERROR server.TThreadPoolServer (TThreadPoolServer.java:run(182)) - Error occurred during processing of message. java.lang.NullPointerException at

Tablesample doubling

2013-07-29 Thread j.barrett Strausser
Hello All, Why does TABLESAMPLE(N rows) produce ouptut with 2*N rows? I have the following script: DROP TABLE IF EXISTS sparse_features_small; CREATE TABLE sparse_features_small ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' LINES TERMINATED BY '\n' as SELECT * FROM

Re: Tablesample doubling

2013-07-29 Thread j.barrett Strausser
SELECT COUNT(*) FROM sparse_features_small; And I receive back : Total MapReduce CPU Time Spent: 3 seconds 330 msec OK 10 Rather than the expected 5 I am running hive 11.2 On Mon, Jul 29, 2013 at 9:51 PM, j.barrett Strausser j.barrett.straus...@gmail.com wrote: Hello All, Why

Re: Tablesample doubling

2013-07-29 Thread j.barrett Strausser
Nevermind I see in the docs, it is rows PER SPLIT. -b On Mon, Jul 29, 2013 at 9:52 PM, j.barrett Strausser j.barrett.straus...@gmail.com wrote: SELECT COUNT(*) FROM sparse_features_small; And I receive back : Total MapReduce CPU Time Spent: 3 seconds 330 msec OK 10 Rather than

Re: Tablesample doubling

2013-07-29 Thread Stephen Sprague
+1 for documentation. sometimes it surprises you. :) On Mon, Jul 29, 2013 at 7:11 PM, j.barrett Strausser j.barrett.straus...@gmail.com wrote: Nevermind I see in the docs, it is rows PER SPLIT. -b On Mon, Jul 29, 2013 at 9:52 PM, j.barrett Strausser j.barrett.straus...@gmail.com

Re: UDAF terminatePartial structure

2013-07-29 Thread Ritesh Agrawal
Hi Robin,igor Thanks for the suggestion and links. Based on examples I found, below is my UDF. However, I am getting following error when trying to run it. Not sure what the error means = ERROR FAILED: Hive Internal Error:

Re: Composite blob key mapping in hive

2013-07-29 Thread G.S.Vijay Raajaa
Hi, Thanks for the reply. The workaround can help me if it is a composite string literal with separators in the same. I would like to know if it works with the following constraints: 1) Is it possible to map a composite key based on the length instead of separators. *eg: *Map first 10 bytes

Re: Composite blob key mapping in hive

2013-07-29 Thread kulkarni.swar...@gmail.com
Yes. It is possible to do that. The attached patch on the bug adds in a new HBaseCompositeKey class that consumers can extend to provide their own implementations. This will help hive understand their custom arrangement of the composite keys. If you can try the patch and let me know if it worked