Hi Nitin,

I have a M/R job which gets executed as soon as a hive query runs. This hive 
query has many complex joins.

Both the Map/Reduce jobs complete with status at 100%, after which I get the 
below error


2014-06-27 23:42:06,396 INFO  org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printInfo(377)) - 2014-06-27 23:42:06,395 Stage-2 map = 
100%,  reduce = 100%, Cumulative CPU 532.7 sec
2014-06-27 23:42:07,429 INFO  org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printInfo(377)) - 2014-06-27 23:42:07,428 Stage-2 map = 
100%,  reduce = 100%, Cumulative CPU 532.7 sec
2014-06-27 23:42:07,430 INFO  org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printInfo(377)) - MapReduce Total cumulative CPU time: 8 
minutes 52 seconds 700 msec
2014-06-27 23:42:07,445 ERROR org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printError(386)) - Ended Job = job_1403907716709_0021 with 
errors
2014-06-27 23:42:07,447 ERROR org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printError(386)) - Error during job, obtaining debugging 
information...
2014-06-27 23:42:07,447 INFO  org.apache.hadoop.conf.Configuration.deprecation 
(Configuration.java:warnOnceIfDeprecated(840)) - mapred.job.tracker is 
deprecated. Instead, use mapreduce.jobtracker.address
2014-06-27 23:42:07,452 ERROR org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printError(386)) - Examining task ID: 
task_1403907716709_0021_m_000004 (and more) from job job_1403907716709_0021
2014-06-27 23:42:07,453 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,453 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,453 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,453 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,454 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,457 ERROR org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printError(386)) - Examining task ID: 
task_1403907716709_0021_r_000000 (and more) from job job_1403907716709_0021
2014-06-27 23:42:07,457 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,457 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,458 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,458 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,458 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,479 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,480 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,482 WARN  org.apache.hadoop.hive.shims.HadoopShimsSecure 
(Hadoop23Shims.java:getTaskAttemptLogUrl(56)) - Can't fetch tasklog: 
TaskLogServlet is not supported in MR2 mode.
2014-06-27 23:42:07,491 ERROR org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printError(386)) -
Task with the most failures(4):
-----
Task ID:
  task_1403907716709_0021_r_000004

URL:
  
http://ip-10-40-51-206.ec2.internal:9026/taskdetails.jsp?jobid=job_1403907716709_0021&tipid=task_1403907716709_0021_r_000004
-----
Diagnostic Messages for this Task:
Exception from container-launch:
org.apache.hadoop.util.Shell$ExitCodeException:
        at org.apache.hadoop.util.Shell.runCommand(Shell.java:464)
        at org.apache.hadoop.util.Shell.run(Shell.java:379)
        at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:589)
        at 
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:283)
        at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:79)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)




2014-06-27 23:42:07,494 INFO  org.apache.hadoop.hive.ql.exec.Task 
(SessionState.java:printInfo(377)) - Counters:
2014-06-27 23:42:07,501 INFO  
org.apache.hadoop.yarn.client.api.impl.YarnClientImpl 
(YarnClientImpl.java:killApplication(182)) - Killing application 
application_1403907716709_0021
2014-06-27 23:42:07,605 ERROR org.apache.hadoop.hive.ql.Driver 
(SessionState.java:printError(386)) - FAILED: Execution Error, return code 2 
from org.apache.hadoop.hive.ql.exec.MapRedTask
2014-06-27 23:42:07,606 INFO  org.apache.hadoop.hive.ql.Driver 
(PerfLogger.java:PerfLogEnd(127)) - </PERFLOG method=Driver.execute 
start=1403911523544 end=1403912527606 duration=1004062>
2014-06-27 23:42:07,606 INFO  org.apache.hadoop.hive.ql.Driver 
(SessionState.java:printInfo(377)) - MapReduce Jobs Launched:
2014-06-27 23:42:07,607 INFO  org.apache.hadoop.hive.ql.Driver 
(SessionState.java:printInfo(377)) - Job 0: Map: 25  Reduce: 10   Cumulative 
CPU: 569.15 sec   HDFS Read: 8125 HDFS Write: 483117997 SUCCESS
2014-06-27 23:42:07,607 INFO  org.apache.hadoop.hive.ql.Driver 
(SessionState.java:printInfo(377)) - Job 1: Map: 5  Reduce: 10   Cumulative 
CPU: 532.7 sec   HDFS Read: 483383876 HDFS Write: 100388623 FAIL
2014-06-27 23:42:07,607 INFO  org.apache.hadoop.hive.ql.Driver 
(SessionState.java:printInfo(377)) - Total MapReduce CPU Time Spent: 18 minutes 
21 seconds 850 msec
2014-06-27 23:42:07,608 INFO  org.apache.hadoop.hive.ql.Driver 
(PerfLogger.java:PerfLogBegin(100)) - <PERFLOG method=releaseLocks>
2014-06-27 23:42:07,608 INFO  org.apache.hadoop.hive.ql.Driver 
(PerfLogger.java:PerfLogEnd(127)) - </PERFLOG method=releaseLocks 
start=1403912527608 end=1403912527608 duration=0>
2014-06-27 23:42:07,611 INFO  org.apache.hadoop.hive.ql.Driver 
(PerfLogger.java:PerfLogBegin(100)) - <PERFLOG method=releaseLocks>
2014-06-27 23:42:07,612 INFO  org.apache.hadoop.hive.ql.Driver 
(PerfLogger.java:PerfLogEnd(127)) - </PERFLOG method=releaseLocks 
start=1403912527611 end=1403912527612 duration=1>
2014-06-27 23:42:07,614 INFO  amazon.emr.metrics.MetricsSaver 
(MetricsSaver.java:performShutdown(423)) - Inside MetricsSaver Shutdown Hook


One INFO which might help in your understanding.

The two properties (mapred.max.split.size -> 25600000 & 
hive.merge.size.per.task -> 256000000) are different for 
"/home/hadoop/hive/conf/hive-default.xml" in PRD. Do they have to be same or 
they could be different?

We have the two values same (256000000) in STG environment, where we don't face 
this issue.



Can you please help?


Thanks,
Shouvanik

________________________________

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise confidential information. If you have received it in 
error, please notify the sender immediately and delete the original. Any other 
use of the e-mail by you is prohibited. Where allowed by local law, electronic 
communications with Accenture and its affiliates, including e-mail and instant 
messaging (including content), may be scanned by our systems for the purposes 
of information security and assessment of internal compliance with Accenture 
policy.
______________________________________________________________________________________

www.accenture.com

Reply via email to