[ 
https://issues.apache.org/jira/browse/NIFI-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16367513#comment-16367513
 ] 

ASF GitHub Bot commented on NIFI-4872:
--------------------------------------

Github user joewitt commented on a diff in the pull request:

    https://github.com/apache/nifi/pull/2475#discussion_r168796980
  
    --- Diff: 
nifi-nar-bundles/nifi-amqp-bundle/nifi-amqp-processors/src/main/java/org/apache/nifi/amqp/processors/PublishAMQP.java
 ---
    @@ -62,6 +64,7 @@
             + "and Queue is not set up, the message will have no final 
destination and will return (i.e., the data will not make it to the queue). If "
             + "that happens you will see a log in both app-log and bulletin 
stating to that effect. Fixing the binding "
             + "(normally done by AMQP administrator) will resolve the issue.")
    +@HighResourceUsageScenario(resource = SystemResource.MEMORY)
    --- End diff --
    
    We need to be able to articulate the memory usage.  Is it that every 
message published is fully loaded into memory in a byte[] therefore large 
messages will consume large amounts of heap?  Same for a lot of items below.  
We need to be able to let the developer explain.  In some cases we have 
processors that operate on batches of things and people will worry it is the 
batch that is the problem.  But in reality it is that if any single 
event/record is large within a batch that single event will be in mem/etc...


> NIFI component high resource usage annotation
> ---------------------------------------------
>
>                 Key: NIFI-4872
>                 URL: https://issues.apache.org/jira/browse/NIFI-4872
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Core Framework, Core UI
>    Affects Versions: 1.5.0
>            Reporter: Jeff Storck
>            Assignee: Jeff Storck
>            Priority: Critical
>
> NiFi Processors currently have no means to relay whether or not they have may 
> be resource intensive or not. The idea here would be to introduce an 
> Annotation that can be added to Processors that indicate they may cause high 
> memory, disk, CPU, or network usage. For instance, any Processor that reads 
> the FlowFile contents into memory (like many XML Processors for instance) may 
> cause high memory usage. What ultimately determines if there is high 
> memory/disk/cpu/network usage will depend on the FlowFiles being processed. 
> With many of these components in the dataflow, it increases the risk of 
> OutOfMemoryErrors and performance degradation.
> The annotation should support one value from a fixed list of: CPU, Disk, 
> Memory, Network.  It should also allow the developer to provide a custom 
> description of the scenario that the component would fall under the high 
> usage category.  The annotation should be able to be specified multiple 
> times, for as many resources as it has the potential to be high usage.
> By marking components with this new Annotation, we can update the generated 
> Processor documentation to include this fact.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to