Rohini Palaniswamy created PIG-3903:
---------------------------------------

             Summary: Configure mapred.min.split.size to be same as 
pig.maxCombinedSplitSize
                 Key: PIG-3903
                 URL: https://issues.apache.org/jira/browse/PIG-3903
             Project: Pig
          Issue Type: Bug
            Reporter: Rohini Palaniswamy


FileInputFormat calculates the split size as 
Math.max(minSize, Math.min(maxSize, blockSize));

By default pig.maxCombinedSplitSize is 128MB if pig.noSplitCombinaton is not 
specifically turned off. We should set the mapred.min.split.size (if not 
already set by the user) to same as pig.maxCombinedSplitSize, so the underlying 
FileInputFormat itself gives us bigger splits when possible instead of pig 
combining smaller splits.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to