[
https://issues.apache.org/jira/browse/FLINK-2055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15509756#comment-15509756
]
ASF GitHub Bot commented on FLINK-2055:
---------------------------------------
Github user fhueske commented on the issue:
https://github.com/apache/flink/pull/2332
Hi @delding, I'm sorry that I did not mention this earlier, but I just
noticed that the `HBaseSink` does not implement any logic for checkpointing and
fault-tolerance.
The checkpointing logic of a sink and the guarantees of the external data
store define what kind of emission guarantees a sink function can offer
(exactly once, at-least once). In order to make this PR mergable, we need to
figure out what kind of guarantees it provides under which assumptions / in
which scenarios. As an example see the [documentation of the Cassandra
sink](https://ci.apache.org/projects/flink/flink-docs-master/dev/connectors/cassandra.html).
This is very important because users will rely on the stated guarantees.
I am not very familiar with HBase and what kind of write semantics and
transnational guarantees it provides. Can you describe which kind of guarantees
an `HBaseSink` can achieve and how it would be integrated with Flink's
Checkpointing mechanism?
Thanks, Fabian
> Implement Streaming HBaseSink
> -----------------------------
>
> Key: FLINK-2055
> URL: https://issues.apache.org/jira/browse/FLINK-2055
> Project: Flink
> Issue Type: New Feature
> Components: Streaming, Streaming Connectors
> Affects Versions: 0.9
> Reporter: Robert Metzger
> Assignee: Erli Ding
>
> As per :
> http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Write-Stream-to-HBase-td1300.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)