[
https://issues.apache.org/jira/browse/PIG-3903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rohini Palaniswamy reassigned PIG-3903:
---------------------------------------
Assignee: Rohini Palaniswamy
> Configure mapred.min.split.size to be same as pig.maxCombinedSplitSize
> ----------------------------------------------------------------------
>
> Key: PIG-3903
> URL: https://issues.apache.org/jira/browse/PIG-3903
> Project: Pig
> Issue Type: Bug
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
>
> FileInputFormat calculates the split size as
> Math.max(minSize, Math.min(maxSize, blockSize));
> By default pig.maxCombinedSplitSize is 128MB if pig.noSplitCombinaton is not
> specifically turned off. We should set the mapred.min.split.size (if not
> already set by the user) to same as pig.maxCombinedSplitSize, so the
> underlying FileInputFormat itself gives us bigger splits when possible
> instead of pig combining smaller splits.
--
This message was sent by Atlassian JIRA
(v6.2#6252)