Hi,

Do you have any suggestions on how to track HBase updates that are less
intrusive that accessing the WAL log on the HDFS directly?

Thanks.

On Thu, Jun 7, 2018 at 3:33 PM, Sergio Esteves <sroeste...@gmail.com> wrote:

> OK, I am starting with the connector that will rely on an abstract event
> listener. I can also work in parallel on the Beam integration, just let me
> know how you want to proceed.
>
> Best,
> Sergio.
>
> On Thu, Jun 7, 2018 at 10:19 AM, Von Gosling <vongosl...@apache.org>
> wrote:
>
>> IMO, Co-processor is seldom used in the production consider its heavy
>> cost and security problem. for simplicity, you could design and implement
>> the connector for HBase.
>>
>> BTW, do we plan to raise the priority of Beam Integration?
>>
>>
>> Best Regards,
>> Von Gosling
>>
>> 在 2018年6月5日,21:41,Sergio Esteves <sroeste...@gmail.com> 写道:
>>
>> n that keeps a database in synch with a rocketmq topic.
>> The first way that comes to my mind to listen to HBase updates on a
>> table, is by using co-processors (akin to BigTable Observers):
>> https://blogs.apache.org/hbase/entry/coprocessor_introduction
>> With co-processors it is possible to insert hooks for data manipulation
>> events (e.g., put, delete). It is also possible to intercept WAL
>> (Write-Ahead Log that keeps track of updates on a table) writing of events.
>> The co-processors reside on the server side and are distributed
>> automatically alongside HBase region servers. Co-processor custom
>> implementations require modifications in the HBase configuration. By not
>> using co-processors, it would be possible to access directly the WAL on
>> HDFS (in case WAL writing is not disabled).
>>
>> These are the two main ways (co-processors or accessing WAL directly) I
>> can see to listen to HBase table updates. Using co-processors seems the
>> most natural way to go. For example, implementing a WALObserver and keep
>> track of the last position that was replicated to the rocketmq topic. Do
>> you agree?
>>
>>
>>
>

Reply via email to