Re: Why two map stages for a simple select query?

2010-08-13 Thread Leo Alekseyev
Ning, thanks -- I can indeed force a map-only task with hive.merge.mapfiles=false. However, I'm still curious what triggers whether or not the merge MR job is run?.. In my original message I gave two sample queries; I believe hive.merge.mapfiles was set to true for both of them. But for the firs

Re: Why two map stages for a simple select query?

2010-08-13 Thread Ning Zhang
The second map-reduce job is probably the merge job which takes the output of the first map-only job (the real query) and merge the resulting files. The merge job is not always triggered. If you look at the plan you may find it is a child of a conditional task, which means it is conditionally tr

Why two map stages for a simple select query?

2010-08-13 Thread Leo Alekseyev
Hi all, I'm mystified by Hive's behavior for two types of queries. 1: consider the following simple select query: insert overwrite table alogs_test_extracted1 select raw.client_ip, raw.cookie, raw.referrer_flag from alogs_test_rc6 raw; Both tables are stored as rcfiles, and LZO compression is turn

RE: what is difference hive local model and standalone model.

2010-08-13 Thread Joydeep Sen Sarma
Lei - not sure I understand the question. I tried to document the relationship between hive, MR and local-mode at http://wiki.apache.org/hadoop/Hive/GettingStarted#Hive.2C_Map-Reduce_and_Local-Mode recently. perhaps you have already read it. Regarding whether local mode can be run on windows or

Re: Filter Operator appear twice?

2010-08-13 Thread John Sichi
Coincidentally, Amareshwari recently noticed the same thing on hive-dev and logged it here: https://issues.apache.org/jira/browse/HIVE-1538 JVS On Aug 13, 2010, at 8:21 AM, Namit Jain wrote: > Currently, the hive optimizer tries to push the filter up, but probably does > not remove the origi

Re: Filter Operator appear twice?

2010-08-13 Thread Yingyi Bu
Ok, thanks a lot, Namit! Yingyi 2010/8/13 Namit Jain : > Currently, the hive optimizer tries to  push the filter up, but probably does > not remove the original filter. > You can file a jira for that. > > > Thanks, > -namit > > > From: Yingyi Bu [buyin...

what is difference hive local model and standalone model.

2010-08-13 Thread lei liu
what is difference hive local model and standalone model. Can the hive local model be ran in windows?

RE: Filter Operator appear twice?

2010-08-13 Thread Namit Jain
Currently, the hive optimizer tries to push the filter up, but probably does not remove the original filter. You can file a jira for that. Thanks, -namit From: Yingyi Bu [buyin...@gmail.com] Sent: Friday, August 13, 2010 7:52 AM To: hive-user@hadoop.apa

Filter Operator appear twice?

2010-08-13 Thread Yingyi Bu
Hi, Consider the following query: INSERT OVERWRITE TABLE q1_pricing_summary_report SELECT L_RETURNFLAG, L_LINESTATUS, SUM(L_QUANTITY), SUM(L_EXTENDEDPRICE), SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)), SUM(L_EXTENDEDPRICE*(1-L_DISCOUNT)*(1+L_TAX)), AVG(L_QUANTITY), AVG(L_EXTENDEDPRICE), AVG(L_DISCOUNT),

how to support chinese in hive

2010-08-13 Thread shangan
hi,all Could anyone tell me how to configurate hive in order to support Chinese characters ? And when using hwi,how to configure directory of the result file, by default now it is the 'conf' directory under my installation path. 2010-08-13 shangan