Hi All,

I have a use case where I need to replicate data from HBase into
Elasticsearch. I've found two implementations of an HBase River that
accomplishes this.

One uses timestamps to do a timerange scan of the table (since last sync)
and replicates data across. For many reasons this is not desirable.

The other hooks into the HBase replication mechanism to get update from
WALEdits. However, it was written against 0.94 and we're running 0.96. I'm
trying to update/rewrite the river, but I just don't know where to start.
Can anyone give me some guidance for writing a custom HBase replicator?

Thanks,
Pradeep

P.S: For the short term, we're probably going to start using the first even
with it's downsides, but we'll need to migrate off that quickly.

Reply via email to