Closing the loop. We identified the issue with help from the Tez team. It was mis-configured mapreduce.reduce.cpu.vcores that caused problems.
If anyone who tries Hive on Tez with EMR Hadoop and sees that reducers are stuck, this<http://mail-archives.apache.org/mod_mbox/incubator-tez-user/201312.mbox/%3Cc3f953c465a3b2f4b02d62348ce67161%40mail.gmail.com%3E> might be helpful. Thanks, Cheolsoo On Fri, Dec 13, 2013 at 4:51 PM, Gunther Hagleitner < ghagleit...@hortonworks.com> wrote: > dev on bcc > > Zhenxiao, > > Cool you got it set up. > > The query runs a full order by before the limit - are you sure it's not > just still running? Hive on Tez prints "total tasks/completed tasks", so no > update may mean none of the reduce tasks have finished yet. > > If not, it'd be great to see the yarn logs (yarn logs -applicationId) and > get more info about the table you're using (size, file format, etc). If the > logs are really big you might want to consider opening/attaching them to a > jira (issues.apache.org) (or send them directly to me). > > There are a bunch of settings that might be of interest to you (in general > not just for this query) - I've attached a text doc with some details. > > Thanks, > Gunther. > > > > > On Fri, Dec 13, 2013 at 1:12 PM, Zhenxiao Luo <z...@netflix.com> wrote: > >> Hi, >> >> Excuse me. May I ask a question about running Hive on Tez? >> >> I've installed Hive on Tez, and running a simple query from hiveCli, >> >> hive> set hive.optimize.tez=true; >> hive> select * from table order by title_id limit 5; >> >> While, each time, I could see from the TezJobMonitor that, all the map >> jobs are done, but the reducer never get started, and the job is >> running forever there. >> >> I tried a number of times, and each time the same failure(job running >> hangs) happens again and again. >> Does anyone successfully running queries using Hive on Tez? Are there >> any tips or suggestions? >> >> Here is my job log: >> >> 13/12/13 20:57:31 INFO client.TezSession: Submitting dag to >> TezSession, sessionName=HIVE-365b35bc-2461-4e2f-83f9-8da1fa356a86, >> applicationId=application_1386896881353_0027 >> 13/12/13 20:57:33 INFO client.TezSession: Submitted dag to TezSession, >> sessionName=HIVE-365b35bc-2461-4e2f-83f9-8da1fa356a86, >> applicationId=application_1386896881353_0027, >> dagId=dag_1386896881353_0027_1 >> 13/12/13 20:57:33 INFO client.RMProxy: Connecting to ResourceManager >> at /10.183.195.180:9022 >> 13/12/13 20:57:33 INFO log.PerfLogger: </PERFLOG method=TezSubmitDag >> start=1386968251250 end=1386968253338 duration=2088 >> from=org.apache.hadoop.hive.ql.exec.tez.TezTask> >> >> >> 13/12/13 20:57:33 INFO tez.TezJobMonitor: >> >> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG method=TezRunDag >> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >> method=TezSubmitToRunningDag >> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> 13/12/13 20:57:33 INFO log.PerfLogger: </PERFLOG >> method=TezSubmitToRunningDag start=1386968253341 end=1386968253402 >> duration=61 from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> Status: Running (application id: application_1386896881353_0027) >> >> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Status: Running (application >> id: application_1386896881353_0027) >> >> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >> method=TezRunVertex.Reducer 2 >> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> 13/12/13 20:57:33 INFO log.PerfLogger: <PERFLOG >> method=TezRunVertex.Map 1 >> from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> Map 1: -/- Reducer 2: -/- >> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Map 1: -/- Reducer 2: -/- >> Map 1: -/- Reducer 2: 0/1 >> 13/12/13 20:57:33 INFO tez.TezJobMonitor: Map 1: -/- Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:34 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:37 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:40 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:43 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:46 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:49 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:52 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 0/16 Reducer 2: 0/1 >> 13/12/13 20:57:55 INFO tez.TezJobMonitor: Map 1: 0/16 Reducer 2: 0/1 >> Map 1: 1/16 Reducer 2: 0/1 >> 13/12/13 20:57:56 INFO tez.TezJobMonitor: Map 1: 1/16 Reducer 2: 0/1 >> Map 1: 2/16 Reducer 2: 0/1 >> 13/12/13 20:57:58 INFO tez.TezJobMonitor: Map 1: 2/16 Reducer 2: 0/1 >> Map 1: 3/16 Reducer 2: 0/1 >> 13/12/13 20:57:58 INFO tez.TezJobMonitor: Map 1: 3/16 Reducer 2: 0/1 >> Map 1: 5/16 Reducer 2: 0/1 >> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 5/16 Reducer 2: 0/1 >> Map 1: 8/16 Reducer 2: 0/1 >> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 8/16 Reducer 2: 0/1 >> Map 1: 12/16 Reducer 2: 0/1 >> 13/12/13 20:57:59 INFO tez.TezJobMonitor: Map 1: 12/16 Reducer 2: 0/1 >> Map 1: 15/16 Reducer 2: 0/1 >> 13/12/13 20:58:00 INFO tez.TezJobMonitor: Map 1: 15/16 Reducer 2: 0/1 >> 13/12/13 20:58:00 INFO log.PerfLogger: </PERFLOG >> method=TezRunVertex.Map 1 start=1386968253402 end=1386968280223 >> duration=26821 from=org.apache.hadoop.hive.ql.exec.tez.TezJobMonitor> >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:00 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:03 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:06 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:09 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:12 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:15 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:18 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:21 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:24 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:27 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:30 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:33 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:36 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:39 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:42 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:45 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:48 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:51 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:54 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:58:57 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:00 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:03 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:06 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:09 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:12 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:15 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:18 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:21 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:24 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:27 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:30 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:33 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:36 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:39 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:42 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:45 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:48 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:52 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:55 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 20:59:58 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 21:00:01 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 21:00:04 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 21:00:07 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 21:00:10 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> Map 1: 16/16 Reducer 2: 0/1 >> 13/12/13 21:00:13 INFO tez.TezJobMonitor: Map 1: 16/16 Reducer 2: 0/1 >> >> Thanks, >> Zhenxiao >> > > > CONFIDENTIALITY NOTICE > NOTICE: This message is intended for the use of the individual or entity > to which it is addressed and may contain information that is confidential, > privileged and exempt from disclosure under applicable law. If the reader > of this message is not the intended recipient, you are hereby notified that > any printing, copying, dissemination, distribution, disclosure or > forwarding of this communication is strictly prohibited. If you have > received this communication in error, please contact the sender immediately > and delete it from your system. Thank You.