[ https://issues.apache.org/jira/browse/PIG-1249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838227#action_12838227 ]
Jeff Zhang commented on PIG-1249: --------------------------------- +1, And I find that hive can estimate the reducer number according the input size. This is a really useful feature. > Safe-guards against misconfigured Pig scripts without PARALLEL keyword > ---------------------------------------------------------------------- > > Key: PIG-1249 > URL: https://issues.apache.org/jira/browse/PIG-1249 > Project: Pig > Issue Type: Improvement > Reporter: Arun C Murthy > Priority: Critical > > It would be *very* useful for Pig to have safe-guards against naive scripts > which process a *lot* of data without the use of PARALLEL keyword. > We've seen a fair number of instances where naive users process huge > data-sets (>10TB) with badly mis-configured #reduces e.g. 1 reduce. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.