Matt Burgess created NIFI-12731: ----------------------------------- Summary: GetHBase should save state whenever the session is committed Key: NIFI-12731 URL: https://issues.apache.org/jira/browse/NIFI-12731 Project: Apache NiFi Issue Type: Bug Components: Extensions Reporter: Matt Burgess Fix For: 2.0.0, 1.26.0
Currently there is a place in the GetHBase code where the session is committed after each set of 500 rows/FlowFiles (so as not to run out of memory buffering millions of rows/FlowFiles) but the state is not updated. If an error occurs during processing of the entire table, the state is not updated but FlowFiles have already been sent downstream, so restarting the processor results in duplicate data. GetHBase should save the current state whenever the session is committed. -- This message was sent by Atlassian Jira (v8.20.10#820010)