[ https://issues.apache.org/jira/browse/NIFI-9598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17642883#comment-17642883 ]
Denis Jakupovic commented on NIFI-9598: --------------------------------------- Any news on this? > Load Balancing on labeled nodes and/or fixed amount of usable nodes in > process groups > ------------------------------------------------------------------------------------- > > Key: NIFI-9598 > URL: https://issues.apache.org/jira/browse/NIFI-9598 > Project: Apache NiFi > Issue Type: Improvement > Affects Versions: 1.15.3 > Reporter: Denis Jakupovic > Priority: Trivial > > One of NiFi's great features is its linear scalability by adding just more > nodes. However by only having the distribute load processor or by round > robin, load balance by attribute name or to a single node feature in the > connection, we could need a more granular form of distributing flowfiles > through the cluster. > Let's assume we have a 10 node NiFi Cluster. > Round Robin: Each node would get 1/10 of the flowfiles. > Single Node: Only one node would process all FF. Chance that other process > groups distribute to same node is 1/10 > By Attribute: 1-10 nodes could get the data, not evenly partitioned > Distribute Load Processor: Manual and fixed process, cannot scale with adding > more nodes to the cluster and needs > By having several dataflows with different use cases with enormous variance > in computation, one or a few dataflows can slow down all other data flows. > Therefore a solution could be partitioning the data to labeled nodes or by > setting the maximum allowed nodes to use for FF partitioning/load balancing > on process groups or a connection. > In the cluster configuration each node could be labeled. Distributing the FF > by round robin would only be distributed to the labeled nodes with the proper > label. A distribution by attribute name would mean to build the attribute > accordingly and cannot be build dynamically. > Another great feature would be the maximum amount of nodes a process group > can use to distribute nodes. -- This message was sent by Atlassian Jira (v8.20.10#820010)