[ 
https://issues.apache.org/jira/browse/HBASE-19226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16248779#comment-16248779
 ] 

Yun Zhao commented on HBASE-19226:
----------------------------------

Thanks for your review [~reidchan]

The buckets used to record the number of regions for each partition. 
When offset is equal to the first bucket number, use the current startKey as a 
splitkey and start counting the next bucket.
Ignores the startkey of the first bucket, replacing  
sorted.remove(sorted.first());

> Limit the reduce tasks number of incremental load
> -------------------------------------------------
>
>                 Key: HBASE-19226
>                 URL: https://issues.apache.org/jira/browse/HBASE-19226
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Yun Zhao
>            Assignee: Yun Zhao
>            Priority: Minor
>         Attachments: HBASE-19226.master.001.patch, 
> HBASE-19226.master.002.patch
>
>
> When using MapReduce job to perform an incremental load into a table,the 
> number of reduce tasks is the current number of regions. If there are too 
> many regions, will lead to network+disk I/O is too large, affecting the 
> real-time request.
> Need to use a configuration to set a number or ratio?
> Limit running reduce tasks since 
> [https://issues.apache.org/jira/browse/MAPREDUCE-5583], the old version can 
> only be set reduce number.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to