[
https://issues.apache.org/jira/browse/HBASE-12411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217447#comment-14217447
]
Lars Hofhansl commented on HBASE-12411:
---------------------------------------
Some numbers:
||scanners per region||pread forced||OS cache||time/s||
|1|yes|no|33|
|1|yes|yes|30|
|1|no|no|22|
|1|no|yes|20|
|5|no|yes|13|
|5|no|no|13.3|
|5|yes|yes|13.2|
|5|no|no|13.4|
|30|no|yes|12.9|
|30|yes|yes|12.8|
|30|yes|no|13.4|
|30|no|no|13.1|
Note: In the 1 scanner per region case I see no visible load on the data node
process. In all other cases, due to the involved p-reads I see considerable
load on the DN.
So it looks like p-read everywhere is not of much help.
> Optionally enable p-reads and private readers for compactions
> -------------------------------------------------------------
>
> Key: HBASE-12411
> URL: https://issues.apache.org/jira/browse/HBASE-12411
> Project: HBase
> Issue Type: Improvement
> Components: Performance
> Reporter: Lars Hofhansl
> Assignee: Lars Hofhansl
> Fix For: 2.0.0, 0.98.9, 0.99.2
>
> Attachments: 12411-v2.txt, 12411-v3.txt, 12411-v4.txt, 12411.txt
>
>
> In the light of HDFS-6735 we might want to consider refraining from seek +
> read completely and only perform preads.
> For example currently a compaction can lock out every other scanner over the
> file which the compaction is currently reading for compaction.
> At the very least we can introduce an option to avoid seek + read, so we can
> allow testing this in various scenarios.
> This will definitely be of great importance for projects like Phoenix which
> parallelize queries intra region (and hence readers will used concurrently by
> multiple scanner with high likelihood.)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)