[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15826193#comment-15826193 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora commented on the issue: https://github.com/apache/nifi/pull/1128 Closing per commit a3d95dc1582f2edfd7997c5d8a23105e88729d11 by @mosermw . > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > Fix For: 0.8.0, 1.2.0 > > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15826194#comment-15826194 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora closed the pull request at: https://github.com/apache/nifi/pull/1128 > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > Fix For: 0.8.0, 1.2.0 > > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15822060#comment-15822060 ] Michael Moser commented on NIFI-2861: - [~jskora] This has been fully merged. Can you close the https://github.com/apache/nifi/pull/1128 PR? > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821221#comment-15821221 ] ASF GitHub Bot commented on NIFI-2861: -- Github user mosermw commented on the issue: https://github.com/apache/nifi/pull/1128 reviewing > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821216#comment-15821216 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora commented on the issue: https://github.com/apache/nifi/pull/1127 @mwoser, I will close and resubmit a clean request with squashed commits. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15821217#comment-15821217 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora closed the pull request at: https://github.com/apache/nifi/pull/1127 > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15764738#comment-15764738 ] Michael Moser commented on NIFI-2861: - [~markap14] I will pick this up for review if you don't have the time, because I've been in the ControlRate code before. Let me know. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15705502#comment-15705502 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora commented on the issue: https://github.com/apache/nifi/pull/1127 @markap14, any other thoughts? Is this dead? > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15584304#comment-15584304 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora commented on the issue: https://github.com/apache/nifi/pull/1127 @markap14 except for removing the limit, review comments have been addressed. The FlowFile limit property has been renamed "Max FlowFiles per Batch" to reflect a batching metaphor instead of onTrigger references, that should be less confusing to users. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15573404#comment-15573404 ] ASF GitHub Bot commented on NIFI-2861: -- Github user jskora commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83325779 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -381,6 +392,14 @@ public boolean tryAdd(final long value) { private class ThrottleFilter implements FlowFileFilter { +private final long flowFilesPerTrigger; +private final AtomicLong flowFilesFiltered = new AtomicLong(0L); --- End diff -- I mean that I agree it should be an int. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571998#comment-15571998 ] ASF GitHub Bot commented on NIFI-2861: -- Github user markap14 commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83218149 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -381,6 +392,14 @@ public boolean tryAdd(final long value) { private class ThrottleFilter implements FlowFileFilter { +private final long flowFilesPerTrigger; +private final AtomicLong flowFilesFiltered = new AtomicLong(0L); + +ThrottleFilter(final String ffPerTrigger) { +super(); +flowFilesPerTrigger = ffPerTrigger == null ? 1L : Long.parseLong(ffPerTrigger); --- End diff -- Should probably be passed in an int or a long, rather than a String > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571996#comment-15571996 ] ASF GitHub Bot commented on NIFI-2861: -- Github user markap14 commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83220874 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -381,6 +392,14 @@ public boolean tryAdd(final long value) { private class ThrottleFilter implements FlowFileFilter { +private final long flowFilesPerTrigger; +private final AtomicLong flowFilesFiltered = new AtomicLong(0L); + +ThrottleFilter(final String ffPerTrigger) { +super(); --- End diff -- The parent class here is Object. I don't think there's a need to call super() > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571997#comment-15571997 ] ASF GitHub Bot commented on NIFI-2861: -- Github user markap14 commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83217567 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -115,6 +115,13 @@ .addValidator(StandardValidators.NON_EMPTY_VALIDATOR) .expressionLanguageSupported(false) .build(); +public static final PropertyDescriptor MAX_FF_PER_TRIGGER = new PropertyDescriptor.Builder() --- End diff -- @jskora I'm not sure that this needs to be configurable. This is an implementation detail that feels a bit leaky to me. Users do not know what an 'onTrigger() call' is. We should probably just cap it at say 1000 and not more than the max number of FlowFiles to transfer per 'Time Duration'. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572000#comment-15572000 ] ASF GitHub Bot commented on NIFI-2861: -- Github user markap14 commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83220736 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -228,12 +238,13 @@ public void onScheduled(final ProcessContext context) { rateControlAttribute = context.getProperty(RATE_CONTROL_ATTRIBUTE_NAME).getValue(); maximumRateStr = context.getProperty(MAX_RATE).getValue().toUpperCase(); groupingAttributeName = context.getProperty(GROUPING_ATTRIBUTE_NAME).getValue(); +maxFlowFilePerTrigger = context.getProperty(MAX_FF_PER_TRIGGER).getValue(); --- End diff -- This should probably be defined as an int, rather than a String, and can then just use context.getProperty().asInteger(). But I really prefer to remove this property all together. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571999#comment-15571999 ] ASF GitHub Bot commented on NIFI-2861: -- Github user markap14 commented on a diff in the pull request: https://github.com/apache/nifi/pull/1127#discussion_r83218107 --- Diff: nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ControlRate.java --- @@ -381,6 +392,14 @@ public boolean tryAdd(final long value) { private class ThrottleFilter implements FlowFileFilter { +private final long flowFilesPerTrigger; +private final AtomicLong flowFilesFiltered = new AtomicLong(0L); --- End diff -- This filter is not thread-safe... don't think we need an AtomicLong here. Can just use an int. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15571004#comment-15571004 ] ASF GitHub Bot commented on NIFI-2861: -- GitHub user jskora opened a pull request: https://github.com/apache/nifi/pull/1128 NIFI-2861 ControlRate should accept more than one flow file per execution (0.x) Thank you for submitting a contribution to Apache NiFi. In order to streamline the review of the contribution we ask you to ensure the following steps have been taken: ### For all changes: - [x] Is there a JIRA ticket associated with this PR? Is it referenced in the commit message? - [x] Does your PR title start with NIFI- where is the JIRA number you are trying to resolve? Pay particular attention to the hyphen "-" character. - [x] Has your PR been rebased against the latest commit within the target branch (typically master)? - [x] Is your initial contribution a single, squashed commit? ### For code changes: - [ ] Have you ensured that the full suite of tests is executed via mvn -Pcontrib-check clean install at the root nifi folder? - [ ] Have you written or updated unit tests to verify your changes? - [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? - [ ] If applicable, have you updated the LICENSE file, including the main LICENSE file under nifi-assembly? - [ ] If applicable, have you updated the NOTICE file, including the main NOTICE file found under nifi-assembly? - [ ] If adding new Properties, have you added .displayName in addition to .name (programmatic access) for each of the new properties? ### For documentation related changes: - [ ] Have you ensured that format looks appropriate for the output in which it is rendered? ### Note: Please ensure that once the PR is submitted, you check travis-ci for build issues and submit an update to your PR as soon as possible. * Support multiple files per onTrigger call. (0.x branch) You can merge this pull request into a Git repository by running: $ git pull https://github.com/jskora/nifi NIFI-2861-0.x Alternatively you can review and apply these changes as the patch at: https://github.com/apache/nifi/pull/1128.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1128 commit 9d4a6e742bc67838176756bb7ef600e54d2904df Author: Joe SkoraDate: 2016-10-13T06:18:23Z NIFI-2861 ControlRate should accept more than one flow file per execution * Support multiple files per onTrigger call. (0.x branch) > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (NIFI-2861) ControlRate should accept more than one flow file per execution
[ https://issues.apache.org/jira/browse/NIFI-2861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15570913#comment-15570913 ] ASF GitHub Bot commented on NIFI-2861: -- GitHub user jskora opened a pull request: https://github.com/apache/nifi/pull/1127 NIFI-2861 ControlRate should accept more than one flow file per execution (1.x) Thank you for submitting a contribution to Apache NiFi. In order to streamline the review of the contribution we ask you to ensure the following steps have been taken: ### For all changes: - [X] Is there a JIRA ticket associated with this PR? Is it referenced in the commit message? - [X] Does your PR title start with NIFI- where is the JIRA number you are trying to resolve? Pay particular attention to the hyphen "-" character. - [X] Has your PR been rebased against the latest commit within the target branch (typically master)? - [X] Is your initial contribution a single, squashed commit? ### For code changes: - [X] Have you ensured that the full suite of tests is executed via mvn -Pcontrib-check clean install at the root nifi folder? - [ ] Have you written or updated unit tests to verify your changes? - [ ] If adding new dependencies to the code, are these dependencies licensed in a way that is compatible for inclusion under [ASF 2.0](http://www.apache.org/legal/resolved.html#category-a)? - [ ] If applicable, have you updated the LICENSE file, including the main LICENSE file under nifi-assembly? - [ ] If applicable, have you updated the NOTICE file, including the main NOTICE file found under nifi-assembly? - [ ] If adding new Properties, have you added .displayName in addition to .name (programmatic access) for each of the new properties? ### For documentation related changes: - [ ] Have you ensured that format looks appropriate for the output in which it is rendered? ### Note: Please ensure that once the PR is submitted, you check travis-ci for build issues and submit an update to your PR as soon as possible. * Support multiple files per onTrigger call. You can merge this pull request into a Git repository by running: $ git pull https://github.com/jskora/nifi NIFI-2861-1.x Alternatively you can review and apply these changes as the patch at: https://github.com/apache/nifi/pull/1127.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1127 commit d66dd6d0f59784da14a26fce50542224e1cf4b07 Author: Joe SkoraDate: 2016-10-13T05:14:23Z NIFI-2861 ControlRate should accept more than one flow file per execution * Support multiple files per onTrigger call. > ControlRate should accept more than one flow file per execution > --- > > Key: NIFI-2861 > URL: https://issues.apache.org/jira/browse/NIFI-2861 > Project: Apache NiFi > Issue Type: Bug > Components: Core Framework >Affects Versions: 1.0.0, 0.7.0 >Reporter: Joe Skora >Assignee: Joe Skora > > The {{ControlRate}} processor implements a {{FlowFileFilter}} that returns > the {{FlowFileFilter.ACCEPT_AND_TERMINATE}} result if the {{FlowFile}} fits > with the rate limit, affectively limiting it to one {{FlowFile}} per > {{ConrolRate.onTrigger()}} invocation. This is a significant bottleneck when > processing very large quantities of small files making it unlikely to hit the > rate limits. > It should allow multiple files, perhaps with a configurable maximum, per > {{ControlRate.onTrigger()}} invocation by issuing the > {{FlowFileFilter.ACCEPT_AND_CONTINUE}} result until the limits are reached. > In a preliminary test this eliminated the bottleneck. -- This message was sent by Atlassian JIRA (v6.3.4#6332)