[ 
https://issues.apache.org/jira/browse/HBASE-20881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16588337#comment-16588337
 ] 

Duo Zhang commented on HBASE-20881:
-----------------------------------

In HBASE-21093 I increase the dispatch delay, to allow batching more requests 
to RS at once. In the successful run, I could see repeated logs like this

{noformat}
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000463,1534870587819.bd87e3924e0683edafb3168e3428d569.
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000496,1534870587819.34f74533f57eac827c94933a1a18e6fc.
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000467,1534870587819.ddb587667d4bdf8f9cec44a027fae399.
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000478,1534870587819.a539a4b1d41ed39c78cc39b32a504286.
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000488,1534870587819.60b3264aae27e8d3ab7d380253d13192.
2018-08-21 16:56:40,273 INFO  
[RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] 
regionserver.RSRpcServices(2012): Open 
TestMRegions,00000490,1534870587819.7886e4ccdaaeeee7b245179dba32e384.
{noformat}

But for the failed run, these logs are separated, each after several seconds. 
So I think the problem is that, on a slow machine, the default 150ms is too 
short for batching remote calls to RS, as we only have 1 procedure worker for 
these tests. In HBASE-21093 I tried to increase it to 2s. Will commit later to 
see if it helps.

Thanks.

> Introduce a region transition procedure to handle all the state transition 
> for a region
> ---------------------------------------------------------------------------------------
>
>                 Key: HBASE-20881
>                 URL: https://issues.apache.org/jira/browse/HBASE-20881
>             Project: HBase
>          Issue Type: Sub-task
>          Components: amv2, proc-v2
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>            Priority: Major
>             Fix For: 3.0.0, 2.2.0
>
>         Attachments: HBASE-20881-v1.patch, HBASE-20881-v10.patch, 
> HBASE-20881-v11.patch, HBASE-20881-v12.patch, HBASE-20881-v13.patch, 
> HBASE-20881-v13.patch, HBASE-20881-v14.patch, HBASE-20881-v14.patch, 
> HBASE-20881-v15.patch, HBASE-20881-v16.patch, HBASE-20881-v2.patch, 
> HBASE-20881-v3.patch, HBASE-20881-v4.patch, HBASE-20881-v4.patch, 
> HBASE-20881-v5.patch, HBASE-20881-v6.patch, HBASE-20881-v7.patch, 
> HBASE-20881-v7.patch, HBASE-20881-v8.patch, HBASE-20881-v9.patch, 
> HBASE-20881.patch
>
>
> Now have an AssignProcedure, an UnssignProcedure, and also a 
> MoveRegionProcedure which schedules an AssignProcedure and an 
> UnssignProcedure to move a region. This makes the logic a bit complicated, as 
> MRP is not a RIT, so when SCP can not interrupt it directly...



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to