[ 
https://issues.apache.org/jira/browse/PIG-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13089330#comment-13089330
 ] 

Richard Ding commented on PIG-2208:
-----------------------------------

It only logs once per job in the front end so that user is informed that the 
multi-inputs (or outputs) counters are disabled. In the back-end the counters 
are simply disabled without logging. 

> Restrict number of PIG generated Haddop counters 
> -------------------------------------------------
>
>                 Key: PIG-2208
>                 URL: https://issues.apache.org/jira/browse/PIG-2208
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.8.1, 0.9.0
>            Reporter: Richard Ding
>            Assignee: Richard Ding
>             Fix For: 0.9.1
>
>         Attachments: PIG-2208.patch
>
>
> PIG 8.0 implemented Hadoop counters to track the number of records read for 
> each input and the number of records written for each output (PIG-1389 & 
> PIG-1299). On the other hand, Hadoop has imposed limit on per job counters 
> (MAPREDUCE-1943) and jobs will fail if the counters exceed the limit.
> Therefore we need a way to cap the number of PIG generated counters.
> Here are the two options:
> 1. Add a integer property (e.g., pig.counter.limit) to the pig property file 
> (e.g., 20). If the number of inputs of a job exceeds this number, the input 
> counters are disabled. Similarly, if the number of outputs of a job exceeds 
> this number, the output counters are disabled.
> 2. Add a boolean property (e.g., pig.disable.counters) to the pig property 
> file (default: false). If this property is set to true, then the PIG 
> generated counters are disabled.
>   

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to