[ https://issues.apache.org/jira/browse/NIFI-10553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joe Witt updated NIFI-10553: ---------------------------- Fix Version/s: (was: 1.18.0) > MergeContent Prematurely Evicts Bins > ------------------------------------ > > Key: NIFI-10553 > URL: https://issues.apache.org/jira/browse/NIFI-10553 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework > Affects Versions: 1.14.0, 1.16.3 > Reporter: Eric Secules > Priority: Major > > When NiFi's merge processors are configured to defragment, the user wants > flowfiles merged in a specific way according to the `fragment.` attributes. > Hoever, when MergeDocuments is handling many unique values for > `fragment.identifier` it opens up one bin per value until it reaches the > `MAX_BIN_COUNT` parameter configured on this processor. This parameter is > there to limit memory used by merging too many things all at once. It is not > certain that the user will be able to set this to an appropriate value for > every flow, and the consequence is that evicting a partially filled bin will > result in possible downstream issues and flowfiles stuck in the input > connection of MergeDocuments. > > Instead of this behaviour, the merge processor should penalize and requeue > flowfiles that don't fit in any of the existing bins if we have reached the > max number of bins already. Penalizing non-matching flowfiles will give time > for the ones needed to complete the existing bins to arrive. -- This message was sent by Atlassian Jira (v8.20.10#820010)