[ 
https://issues.apache.org/jira/browse/HBASE-12411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14217447#comment-14217447
 ] 

Lars Hofhansl commented on HBASE-12411:
---------------------------------------

Some numbers: 
||scanners per region||pread forced||OS cache||time/s||
|1|yes|no|33|
|1|yes|yes|30|
|1|no|no|22|
|1|no|yes|20|
|5|no|yes|13|
|5|no|no|13.3|
|5|yes|yes|13.2|
|5|no|no|13.4|
|30|no|yes|12.9|
|30|yes|yes|12.8|
|30|yes|no|13.4|
|30|no|no|13.1|

Note: In the 1 scanner per region case I see no visible load on the data node 
process. In all other cases, due to the involved p-reads I see considerable 
load on the DN.

So it looks like p-read everywhere is not of much help. 


> Optionally enable p-reads and private readers for compactions
> -------------------------------------------------------------
>
>                 Key: HBASE-12411
>                 URL: https://issues.apache.org/jira/browse/HBASE-12411
>             Project: HBase
>          Issue Type: Improvement
>          Components: Performance
>            Reporter: Lars Hofhansl
>            Assignee: Lars Hofhansl
>             Fix For: 2.0.0, 0.98.9, 0.99.2
>
>         Attachments: 12411-v2.txt, 12411-v3.txt, 12411-v4.txt, 12411.txt
>
>
> In the light of HDFS-6735 we might want to consider refraining from seek + 
> read completely and only perform preads.
> For example currently a compaction can lock out every other scanner over the 
> file which the compaction is currently reading for compaction.
> At the very least we can introduce an option to avoid seek + read, so we can 
> allow testing this in various scenarios.
> This will definitely be of great importance for projects like Phoenix which 
> parallelize queries intra region (and hence readers will used concurrently by 
> multiple scanner with high likelihood.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to