aggregate files for replicated join
-----------------------------------
Key: PIG-1458
URL: https://issues.apache.org/jira/browse/PIG-1458
Project: Pig
Issue Type: Improvement
Reporter: Olga NatkovichWe have noticed that if the smaller data in replicated join has many files, this puts unneeded burden on the name node. pre-aggregating the files can improve the situation -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
