[ https://issues.apache.org/jira/browse/HBASE-26590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17469506#comment-17469506 ]
Huaxiang Sun commented on HBASE-26590: -------------------------------------- I modified my testing case, excluding connection setup/teardown from the time counted. Here is the result for 1m random meta lookup. I added option to use BlockingRpcClient for meta lookup against the default NettyRpcClient. ||h5. ~Version~ ||h5. ~Meta Replica Load Balance Enabled~||h5. ~BlockingRpcClient~ ||h5. ~Time(ms)~|| ||h5. ~2.4.5-with-fixed~||h5. ~No~||h5. ~No~||h5. ~370814~|| ||h5. ~2.4.5-with-fixed~||h5. ~No~||h5. ~Yes~||h5. ~358931~|| ||h5. ~2.4.5-with-fixed~||h5. ~Yes~||h5. ~Yes~||h5. ~349485~ || ||h5. ~2.4.5~||h5. ~No~||h5. ~No~||h5. ~516640~ || ||h5. ~2.4.5~||h5. ~Yes~||h5. ~Yes~||h5. ~497509~|| ||h5. ~cdh-5.16.2~||h5. ~No~||h5. ~No~||h5. ~371540~|| When I did the Table.get() test. It is hard to draw a solid conclusion due to key distribution, most of the keys randomly created fall into the the last region and it is cached. BlockingRpcClient/NettyRpcClient difference is about 3% (Not as initially reported as 5 ~ 10%), so not a very big concern here. This difference here is not big as what we observed at the production cluster. I am going to put up the patch and will work with the team to see if it helps. > Hbase-client Meta lookup performance regression between hbase-1 and hbase-2 > --------------------------------------------------------------------------- > > Key: HBASE-26590 > URL: https://issues.apache.org/jira/browse/HBASE-26590 > Project: HBase > Issue Type: Improvement > Components: meta > Affects Versions: 2.4.0, 2.5.0, 2.3.7, 2.6.0 > Reporter: Huaxiang Sun > Assignee: Huaxiang Sun > Priority: Major > > One of our users complained higher latency after application upgrades from > hbase-1.2 client (CDH-5.16.2) to hbase-2.4.5 client with meta replica Load > Balance mode during app restart. I reproduced the regression by a test for > meta lookup. > At my test cluster, there are 160k regions for the test table, so there are > 160k entries in meta region. Used one thread to do 1 million meta lookup > against the meta region server. > > ||Version ||Meta Replica Load Balance Enabled||Time || > ||2.4.5-with-fixed||Yes||336458ms|| > ||2.4.5-with-fixed||No||333253ms|| > ||2.4.5||Yes||469980ms|| > ||2.4.5||No||470515ms|| > | *cdh-5.16.2*| *No* | *323412ms*| > -- This message was sent by Atlassian Jira (v8.20.1#820001)