How to scan only Memstore from end point co-processor

2015-05-31 Thread Gautam Borah
Hi all, Here is our use case, We have a very write heavy cluster. Also we run periodic end point co processor based jobs that operate on the data written in the last 10-15 mins, every 10 minute. Is there a way to only query in the MemStore from the end point co-processor? The periodic job scans

Re: How to scan only Memstore from end point co-processor

2015-05-31 Thread ramkrishna vasudevan
We have a postScannerOpen hook in the CP but that may not give you a direct access to know which one are the internal scanners on the Memstore and which one are on the store files. But this is possible but we may need to add some new hooks at this place where we explicitly add the internal scanners

Re: How to scan only Memstore from end point co-processor

2015-05-31 Thread Anoop John
If your scan is having a time range specified in it, HBase internally will check this against the time range of files etc and will avoid those which are clearly out of your interested time range. You dont have to do any thing for this. Make sure you set the TimeRange for ur read -Anoop- On Mon,

Re: How to scan only Memstore from end point co-processor

2015-06-01 Thread Vladimir Rodionov
InternalScan has ctor from Scan object See https://issues.apache.org/jira/browse/HBASE-12720 You can instantiate InternalScan from Scan, set checkOnlyMemStore, then open RegionScanner, but the best approach is to cache data on write and run regular RegionScanner from memstore and block cache. be

Re: How to scan only Memstore from end point co-processor

2015-06-01 Thread Gautam Borah
Thanks Vladimir. We will try this out soon. Regards, Gautam On Mon, Jun 1, 2015 at 12:22 AM, Vladimir Rodionov wrote: > InternalScan has ctor from Scan object > > See https://issues.apache.org/jira/browse/HBASE-12720 > > You can instantiate InternalScan from Scan, set checkOnlyMemStore, then >