Re: [DISCUSS]HBase2.1.0 is slower than HBase1.2.0

OpenInx Thu, 11 Jun 2020 03:40:00 -0700

Hi Zheng wang.

Hope this issue will be helpful for you.
https://issues.apache.org/jira/browse/HBASE-21657
Thanks.


On Tue, Jun 9, 2020 at 5:53 PM Anoop John <anoop.hb...@gmail.com> wrote:

> Thanks for the detailed analysis and update zheng wang.
> >The code line below in StoreScanner.next() cost about 100ms in v2.1, and
> it added from v2.0, see HBASE-17647.&nbsp
> So still there is some additional cost in 2.1 right? Do u have any other
> observation?  Are we doing more cell compares in 2.x?
>
> Anoop
>
>
> On Mon, Jun 8, 2020 at 1:50 AM zheng wang <18031...@qq.com> wrote:
>
> > Hi guys:
> >
> >
> > I did some test on my pc to find the reason as Jan Van Besien mentioned
> in
> > user channel.
> >
> >
> > #test env
> > OS : win10
> > JDK: 1.8
> > MEM: 8GB
> >
> >
> > #test data:
> > 1 million rows with only one columnfamily and one qualifier.
> >
> >
> > rowkey: rowkey-#index#
> > value: value-#index#
> >
> >
> > #test method:
> > just use client api to scan with default config several times, no pe, no
> > ycsb
> >
> >
> > #test result(avg):
> > v1.2.0: 800ms
> > v2.1.0: 1050ms
> >
> >
> > So, it is sure that v2.1 is slower than v1.2, after this, i did some
> > statistics on regionserver.
> > Then i find the partly reason is related to the size estimated.
> >
> >
> > The code line below in StoreScanner.next() cost about 100ms in v2.1, and
> > it added from v2.0, see HBASE-17647.&nbsp;
> > "int cellSize = PrivateCellUtil.estimatedSerializedSizeOf(cell);"
> >
> >
> > Should we support to disable the MaxResultSize limit(2MB by default now)
> > to get more efficient if user exactly knows their data and could limit
> > results only by setBatch and setLimit?&nbsp;
>

Re: [DISCUSS]HBase2.1.0 is slower than HBase1.2.0

Reply via email to