Vishal Khandelwal created HBASE-16091:
-----------------------------------------

             Summary: Canary takes lot more time when there are delete markers 
in the table
                 Key: HBASE-16091
                 URL: https://issues.apache.org/jira/browse/HBASE-16091
             Project: HBase
          Issue Type: Bug
            Reporter: Vishal Khandelwal
            Assignee: Vishal Khandelwal


We have a table which has lot of delete markers and we running Canary test on a 
regular interval sometimes tests are timing out because to reading first row 
would skip all these delete markers. Since purpose of Canary is to find health 
of the region, i think keeping raw=true would not defeat the purpose but 
provide good perf improvement. 

Following are the example of one such scan where 
without changing code it took 62.3 sec for onre region scan
2016-06-23 08:49:11,670 INFO  [pool-2-thread-1] tool.Canary - read from region  
<tablename>.<region> column family 0 in 62338ms

whereas after setting raw=true, it reduced to 58ms
2016-06-23 08:45:20,259 INFO  [pool-2-thread-1] tests.Canary - read from region 
<tablename>.<region> column family 0 in 58ms

Taking this over multiple tables , with multiple region would be a good 
performance gain.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to