[ https://issues.apache.org/jira/browse/FLINK-10245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16608642#comment-16608642 ]
Shimin Yang commented on FLINK-10245: ------------------------------------- Hi [~hequn8128], For the comments you mentioned last time, I looked into the HBase client implementation and think that I can add a scheduler to flush the data periodically by the time set by user. I am not very sure about should I replace the api with Hbase batch api since it already provided buffer and flush functionality. And if I stick with this api, I think it's hard to deduplicate data using rowkey as it is buffered in the BufferedMutator in HBase client and there's no deletion of Mutator function provided. What do you think? Best Shimin > Add DataStream HBase Sink > ------------------------- > > Key: FLINK-10245 > URL: https://issues.apache.org/jira/browse/FLINK-10245 > Project: Flink > Issue Type: Sub-task > Components: Streaming Connectors > Reporter: Shimin Yang > Assignee: Shimin Yang > Priority: Major > Labels: pull-request-available > > Design documentation: > [https://docs.google.com/document/d/1of0cYd73CtKGPt-UL3WVFTTBsVEre-TNRzoAt5u2PdQ/edit?usp=sharing] -- This message was sent by Atlassian JIRA (v7.6.3#76005)