[ https://issues.apache.org/jira/browse/STORM-960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14638755#comment-14638755 ]
Aaron Dossett commented on STORM-960: ------------------------------------- I realize this PR also contains the commit for STORM-951, which I was testing with. I can remove the STORM-951 commit if necessary. > Hive-Bolt can lose tuples when flushing data > -------------------------------------------- > > Key: STORM-960 > URL: https://issues.apache.org/jira/browse/STORM-960 > Project: Apache Storm > Issue Type: Improvement > Components: external > Reporter: Aaron Dossett > Assignee: Aaron Dossett > Priority: Minor > > In HiveBolt's execute method tuples are ack'd as they are received. When a > batchsize of tuples has been received, the writers are flushed. However, if > the flush fails only the most recent tuple will be marked as failed. All > prior tuples will already have been ack'd. This creates a window for data > loss. -- This message was sent by Atlassian JIRA (v6.3.4#6332)