[ https://issues.apache.org/jira/browse/HBASE-20881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16588337#comment-16588337 ]
Duo Zhang commented on HBASE-20881: ----------------------------------- In HBASE-21093 I increase the dispatch delay, to allow batching more requests to RS at once. In the successful run, I could see repeated logs like this {noformat} 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000463,1534870587819.bd87e3924e0683edafb3168e3428d569. 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000496,1534870587819.34f74533f57eac827c94933a1a18e6fc. 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000467,1534870587819.ddb587667d4bdf8f9cec44a027fae399. 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000478,1534870587819.a539a4b1d41ed39c78cc39b32a504286. 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000488,1534870587819.60b3264aae27e8d3ab7d380253d13192. 2018-08-21 16:56:40,273 INFO [RpcServer.priority.FPBQ.Fifo.handler=5,queue=0,port=40911] regionserver.RSRpcServices(2012): Open TestMRegions,00000490,1534870587819.7886e4ccdaaeeee7b245179dba32e384. {noformat} But for the failed run, these logs are separated, each after several seconds. So I think the problem is that, on a slow machine, the default 150ms is too short for batching remote calls to RS, as we only have 1 procedure worker for these tests. In HBASE-21093 I tried to increase it to 2s. Will commit later to see if it helps. Thanks. > Introduce a region transition procedure to handle all the state transition > for a region > --------------------------------------------------------------------------------------- > > Key: HBASE-20881 > URL: https://issues.apache.org/jira/browse/HBASE-20881 > Project: HBase > Issue Type: Sub-task > Components: amv2, proc-v2 > Reporter: Duo Zhang > Assignee: Duo Zhang > Priority: Major > Fix For: 3.0.0, 2.2.0 > > Attachments: HBASE-20881-v1.patch, HBASE-20881-v10.patch, > HBASE-20881-v11.patch, HBASE-20881-v12.patch, HBASE-20881-v13.patch, > HBASE-20881-v13.patch, HBASE-20881-v14.patch, HBASE-20881-v14.patch, > HBASE-20881-v15.patch, HBASE-20881-v16.patch, HBASE-20881-v2.patch, > HBASE-20881-v3.patch, HBASE-20881-v4.patch, HBASE-20881-v4.patch, > HBASE-20881-v5.patch, HBASE-20881-v6.patch, HBASE-20881-v7.patch, > HBASE-20881-v7.patch, HBASE-20881-v8.patch, HBASE-20881-v9.patch, > HBASE-20881.patch > > > Now have an AssignProcedure, an UnssignProcedure, and also a > MoveRegionProcedure which schedules an AssignProcedure and an > UnssignProcedure to move a region. This makes the logic a bit complicated, as > MRP is not a RIT, so when SCP can not interrupt it directly... -- This message was sent by Atlassian JIRA (v7.6.3#76005)