Shubham Roy created HBASE-29811:
-----------------------------------

             Summary: Introduce capping of input splits based on provided 
config value for MR jobs on live tables.
                 Key: HBASE-29811
                 URL: https://issues.apache.org/jira/browse/HBASE-29811
             Project: HBase
          Issue Type: New Feature
          Components: mapreduce
    Affects Versions: 2.5.13, 2.6.4, 3.0.0
            Reporter: Shubham Roy
            Assignee: Shubham Roy


For merging input splits, we need a config parameter to set the number of 
mappers we want, say M. Then if we have N input splits, we will merge 
ceiling(N/M) consecutive splits. Let s be the start key of the first split in 
the merge list and e is the end key of the last split. Then the result of the 
merge will be one splits [s, e].



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to