Greg Malewicz created HCATALOG-506:
--------------------------------------

             Summary: desired number of input splits for large files
                 Key: HCATALOG-506
                 URL: https://issues.apache.org/jira/browse/HCATALOG-506
             Project: HCatalog
          Issue Type: Improvement
            Reporter: Greg Malewicz


Allow user to specify the desired number of input splits through a new 
configuration parameter hcatalog.desiredNumInputSplits. Two existing parameters 
may also need to be specified: mapred.min.split.size and mapred.max.split.size. 
This is useful when there are few but large input files that we want to split 
into many splits, so as to enhance the parallelizm of loading the splits.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to