[ 
https://issues.apache.org/jira/browse/HBASE-3247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935574#action_12935574
 ] 

Jonathan Gray commented on HBASE-3247:
--------------------------------------

Scanning requires you to look at all the data (or at least, more than just the 
data you need).  I think that would prove far to inefficient for something like 
keeping a search index up to date which you expect to be as "realtime" as 
possible.

This is about only needing to see the deltas.

> Changes API: API for pulling edits from HBase
> ---------------------------------------------
>
>                 Key: HBASE-3247
>                 URL: https://issues.apache.org/jira/browse/HBASE-3247
>             Project: HBase
>          Issue Type: Task
>            Reporter: stack
>
> Talking to Shay from Elastic Search, he was asking where the Changes API is 
> in HBase.  Talking more -- there was a bit of beer involved so apologize up 
> front -- he wants to be able to bootstrap an index and thereafter ask HBase 
> for changes since time t.  We thought he could tie into the replication 
> stream, but rather he wants to be able to pull rather than have it pushed to 
> him (in case he crashes, etc. so on recovery he can start pulling again from 
> last good edit received).  He could do the bootstrap with a Scan.  
> Thereafter, requests to pull from hbase would pass a marker of some  sort.  
> HBase would then give out edits that came in after this marker, in batches, 
> along with an updated marker.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to