houman babai created OOZIE-3604:
-----------------------------------
Summary: Oozie Luncher & Map-Reduce Action Complete Successfully
However Oozie Still Fails the Action
Key: OOZIE-3604
URL: https://issues.apache.org/jira/browse/OOZIE-3604
Project: Oozie
Issue Type: Bug
Components: core
Affects Versions: 5.1.0
Environment: 5.1.0-cdh-6.3.1
5.1.0-cdh-6.3.2
Reporter: houman babai
I am using oozie 5.1.0-cdh6.3.1
In my workflow I have a mapreduce action, which generates over 300 counters.
The oozie launcher & the mapreduce job successfully complete, however, oozie
reports that:
{code:java}
Error Code: LimitExceededException
LimitExceededException: Too many counters: 121 max=120
{code}
I have updated mapred-site.xml.
The log for the *mapreduce job* reports success, in fact I can see all the
counters & the actual output of the mapreduce job on hdfs.
In the oozie *launcher log* I can see:
* mapreduce.job.counters.max : 8192
* mapreduce.job.counters.groups.max : 100
I set the oozie logs to TRACE, and I can see the following WARN:
{code:java}
WARN org.apache.oozie.command.wf.ActionEndXCommand: SERVER[SERVER-NAME]
USER[MY-NAME] GROUP[-] TOKEN[] APP[APP-NAME]
JOB[0000012-200714223028181-oozie-oozi-W]
ACTION[0000012-200714223028181-oozie-oozi-W@ACTION-NAME] Error ending action
[ACTION-NAME]. ErrorType [ERROR], ErrorCode [LimitExceededException], Message
[LimitExceededException: Too many counters: 121 max=120]
WARN org.apache.oozie.command.wf.ActionEndXCommand: SERVER[SERVER-NAME]
USER[MY-NAME] GROUP[-] TOKEN[] APP[APP-NAME]
JOB[0000012-200714223028181-oozie-oozi-W]
ACTION[0000012-200714223028181-oozie-oozi-W@ACTION-NAME] Setting Action Status
to [ERROR]
{code}
Looking through the oozie source code, I think the issue may be traced to [line
in
MapReduceActionExecutor|https://github.com/apache/oozie/blob/a40ab5361372aa73c9e4aa386a9c81bd21742aa4/core/src/main/java/org/apache/oozie/action/hadoop/MapReduceActionExecutor.java#L250]
I am guessing that for some reason, the context or the config passed into the
[createJobClient|https://github.com/apache/oozie/blob/a40ab5361372aa73c9e4aa386a9c81bd21742aa4/core/src/main/java/org/apache/oozie/action/hadoop/MapReduceActionExecutor.java#L242]
lack the mapreduce.job.counters.max configuration.
I tried several other things hoping that it would somehow work:
* I added the mapreduce.job.counters.max as configuration into the action
(inside the workflow)
* I added the mapreduce.job.counters.max to action config default.xml
Nothing has helped.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)