[ https://issues.apache.org/jira/browse/HBASE-19226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16248779#comment-16248779 ]
Yun Zhao commented on HBASE-19226: ---------------------------------- Thanks for your review [~reidchan] The buckets used to record the number of regions for each partition. When offset is equal to the first bucket number, use the current startKey as a splitkey and start counting the next bucket. Ignores the startkey of the first bucket, replacing sorted.remove(sorted.first()); > Limit the reduce tasks number of incremental load > ------------------------------------------------- > > Key: HBASE-19226 > URL: https://issues.apache.org/jira/browse/HBASE-19226 > Project: HBase > Issue Type: Improvement > Reporter: Yun Zhao > Assignee: Yun Zhao > Priority: Minor > Attachments: HBASE-19226.master.001.patch, > HBASE-19226.master.002.patch > > > When using MapReduce job to perform an incremental load into a table,the > number of reduce tasks is the current number of regions. If there are too > many regions, will lead to network+disk I/O is too large, affecting the > real-time request. > Need to use a configuration to set a number or ratio? > Limit running reduce tasks since > [https://issues.apache.org/jira/browse/MAPREDUCE-5583], the old version can > only be set reduce number. -- This message was sent by Atlassian JIRA (v6.4.14#64029)