Re: Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Tim Spence
Thank you. Are there tools for parsing the Hive logs for errors? If not, can you talk about the strategy used at Facebook to deal with detection and resolution of MR errors? Perhaps I can write a script to identify errors. First I have to solve the mystery of why there are no logs on my hadoop

Re: Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Ning Zhang
Hive queries are compiled to different types tasks (MapReduce, MoveTask, etc), so a successful MR task as indicated in the JT doesn't mean the whole query succeeded. So you need to examine the status of the hive query to see if it succeeded or not. You can also check the hive's log file under /

Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Tim Spence
I've been using Hive in production for two months now. We're mainly using it for processing server logs, about 1-2GB per day (2-2.5 million requests). Typically we import a day's worth of logs at once. That said, sometimes we decide to tweak a calculated column. When that happens, we modify our