Restrict number of PIG generated Haddop counters
-------------------------------------------------
Key: PIG-2208
URL: https://issues.apache.org/jira/browse/PIG-2208
Project: Pig
Issue Type: Bug
Components: impl
Affects Versions: 0.9.0, 0.8.1
Reporter: Richard Ding
Assignee: Richard Ding
Fix For: 0.9.1
PIG 8.0 implemented Hadoop counters to track the number of records read for
each input and the number of records written for each output (PIG-1389 &
PIG-1299). On the other hand, Hadoop has imposed limit on per job counters
(MAPREDUCE-1943) and jobs will fail if the counters exceed the limit.
Therefore we need a way to cap the number of PIG generated counters.
Here are the two options:
1. Add a integer property (e.g., pig.counter.limit) to the pig property file
(e.g., 20). If the number of inputs of a job exceeds this number, the input
counters are disabled. Similarly, if the number of outputs of a job exceeds
this number, the output counters are disabled.
2. Add a boolean property (e.g., pig.disable.counters) to the pig property file
(default: false). If this property is set to true, then the PIG generated
counters are disabled.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira