JOIN + LATERAL VIEW works, but + MAPJOIN and no longer get any results

2012-05-22 Thread Ruben de Vries
Okay first off; I know JOIN + LATERAL VIEW together isn't working so I moved my JOIN into a subquery and that makes the query work properly However when I added a MAPJOIN hint for the JOIN in the subquery it will also stop doing the reducer for the main query! This only happens when there's a

start hive cli error

2012-05-22 Thread Dimboo Zhu
hi there, I got the following trace stack when startuping hive cli. It worked=20 well last week when i just installed it. Anybody can help? thanks, Dianbau [dzhu@bbdw-194 bin]$ ./hive Logging initialized using configuration in j=

Re: start hive cli error

2012-05-22 Thread Nitin Pawar
error is due to default encoding. hive supports UTF-8 based encoding but somehow your hive setup is picking up GB2312. can you provide the output of locale command? Thanks, Nitin On Tue, May 22, 2012 at 3:17 PM, Dimboo Zhu dianbo@gmail.com wrote: hi there, I got the following trace

from-insert-select trouble

2012-05-22 Thread Avdeev V . M .
Hello! I'm very new to the world Hadoop and Hive so I cannot solve a problem that I encountered. Hadoop has been deployed on a single-node in a pseudo-distributed mode. I'm trying to copy data from one table to another. Source table created by Sqoop, destination table created by query

Re: Re: start hive cli error

2012-05-22 Thread dianbo . zhu
Hi Nitin, i reinstalled and did not modify anything, but it also can not work. It worked well when i first ran it months ago. the output of locale command is below: LANG=zh_CN LC_CTYPE=zh_CN LC_NUMERIC=zh_CN LC_TIME=zh_CN LC_COLLATE=zh_CN LC_MONETARY=zh_CN LC_MESSAGES=zh_CN LC_PAPER=zh_CN

AbstractMethodError while using serde

2012-05-22 Thread Sumit Kumar
Hi all, I'm using csv-serde code ( https://github.com/ogrodnek/csv-serde ) with hadoop 0.20.205 and hive 0.7.1 and running into the following issue 2012-05-22 15:51:44,354 WARN org.apache.hadoop.mapred.Child: Error running child java.lang.RuntimeException: java.lang.AbstractMethodError:

Re[2]: from-insert-select trouble

2012-05-22 Thread Avdeev V . M .
Found. 2012-05-22 17:52:47,117 FATAL org.apache.hadoop.mapred.Child: Error running child : java.lang.OutOfMemoryError: Java heap space at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$Packet.init(DFSClient.java:2790) at

Re: Re[2]: from-insert-select trouble

2012-05-22 Thread Bejoy KS
Great, good catch.. Not enough child heap size available to process your data volume. If you have free memory available just increase child.opts memory and it may pass through as well. Regards Bejoy KS Sent from handheld, please excuse typos. -Original Message- From: Avdeev V. M.

Re: Hive + Cassandra

2012-05-22 Thread Edward Capriolo
There are actually several storage handlers floating out in the wild for Mongo, Cassandra, HyperTable, etc. Your best bet is referencing the Cassandra ticket and building/using the brisk/dse code. Eventually the code will make it's way into Cassandra. It is a shame the code has been in production

Condition for doing a sort merge bucket map join

2012-05-22 Thread Bruce Bian
Hi , I've got 7 large tables to join(each ~10G in size) into one table, all with the same* 2 *join keys, I've read some documents on sort merge bucket map join, but failed to fire that. I've bucketed all the 7 tables into 20 buckets and sorted by one of the join key, set

Re: Condition for doing a sort merge bucket map join

2012-05-22 Thread Mark Grover
Hi Bruce, Instead of joining 7 tables in the query, can you please start off with 2 tables and see if that works? If it doesn't, feel free to paste your table definitions and join query along with any properties you are setting and folks on the mailing list can take a jab at it. Mark -

RCFile and UDF

2012-05-22 Thread Mohit Anchlia
I am new to Hive. Currently I am trying out one of the use cases where we write xml files into a sequence file. We then read the sequence file and convert it into more structured row, col format using pig udf. This is currently being stored as snapp compression. Now what I want to do is use hive

protobuf 2.4.1 and ObjectInspector

2012-05-22 Thread kulkarni.swar...@gmail.com
I am trying to use the ReflectionStructObjectInspector to extract fields from a protobuf generated from 2.4.1 compiler. I am seeing that reflections fails to extract fields out of the generated protobuf class. Specifically, this code snippet: public static Field[]

Map side aggregations

2012-05-22 Thread Raghunath, Ranjith
I have the parameter hive.map.aggr set to true. However, when I look at the counters associated with the map tasks I notice the following Combine input records 0. I am interpreting this as a failure to perform the map side aggregation. Is that accurate? Is this option not working in hive 0.7.1?

RE: Re: start hive cli error

2012-05-22 Thread Hezhiqiang (Ransom)
Is it your linux cosole problem? You changed SecureCRT or putty charset “GB2312”? Best regards Ransom. From: dianbo.zhu [mailto:dianbo@gmail.com] Sent: Tuesday, May 22, 2012 6:48 PM To: user Subject: Re: Re: start hive cli error Hi Nitin, i reinstalled and did not modify anything, but it

Re: Map side aggregations

2012-05-22 Thread Tucker, Matt
Try setting hive.auto.convert.join to true. The CLI will have a local task before it starts a map-reduce job on the cluster. Matt On May 22, 2012, at 8:43 PM, Raghunath, Ranjith ranjith.raghuna...@usaa.commailto:ranjith.raghuna...@usaa.com wrote: I have the parameter hive.map.aggr set to

Want to give a short talk at the next Hive User Group meetup?

2012-05-22 Thread Carl Steinbach
Hi, I just wanted to remind everyone that the next Hive User Group meetup is happening on June 12th (the day before the Hadoop Summit) in San Jose. More details about the meetup can be found on the Hive User Group page located here: http://www.meetup.com/Hive-User-Group-Meeting/events/62458462/

Re: Map side aggregations

2012-05-22 Thread Ranjith
Thanks Matt. I am not performing a join so does that matter? What does this local task do? Thanks, Ranjith On May 22, 2012, at 8:17 PM, Tucker, Matt matt.tuc...@disney.com wrote: Try setting hive.auto.convert.join to true. The CLI will have a local task before it starts a map-reduce job on