[ 
https://issues.apache.org/jira/browse/HBASE-7006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13576304#comment-13576304
 ] 

Enis Soztutar commented on HBASE-7006:
--------------------------------------

Agreed that it is the middle ground. On region open, RS has to do a read on the 
index, and seek, and sequential read for each region. However, in your approach 
as you reported off-list, we are paying for re-locating the regions, and the 
rpc overhead instead of just streaming sequential writes to hdfs. I was just 
curious, given the current implementation, which one would be faster. I am not 
suggesting that we should prototype that as well, especially given that we can 
open the regions for writes in 1-2 secs with this. 

                
> [MTTR] Study distributed log splitting to see how we can make it faster
> -----------------------------------------------------------------------
>
>                 Key: HBASE-7006
>                 URL: https://issues.apache.org/jira/browse/HBASE-7006
>             Project: HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: Jeffrey Zhong
>             Fix For: 0.96.0
>
>         Attachments: LogSplitting Comparison.pdf, 
> ProposaltoimprovelogsplittingprocessregardingtoHBASE-7006.pdf
>
>
> Just saw interesting issue where a cluster went down  hard and 30 nodes had 
> 1700 WALs to replay.  Replay took almost an hour.  It looks like it could run 
> faster that much of the time is spent zk'ing and nn'ing.
> Putting in 0.96 so it gets a look at least.  Can always punt.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to