Sophie Blee-Goldman created KAFKA-9282:
------------------------------------------

             Summary: Consider more flexible node grouping for Pattern 
subscription
                 Key: KAFKA-9282
                 URL: https://issues.apache.org/jira/browse/KAFKA-9282
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Sophie Blee-Goldman


The current grouping for input topics using pattern subscription creates a 
single node group for all matching topics, meaning the number of tasks scales 
with the maximum partition count across all topics. This reduces overhead and 
is suitable for some scenarios, but limits the ability to scale out and 
prevents easily parallelized processing of completely independent partitions. 
We should consider making it possible for the number of tasks to instead scale 
with the total number of partitions summed across all matching input topics.

Ideally Streams could just autoscale based on some heuristic and the currently 
available resources, but we would have to be careful if those things change. 
Alternatively we could just leave this up to the user to decide, potentially by 
augmenting the Pattern-based source KStream method with a new overload allowing 
this grouping to be specified. For example
{code:java}
StreamsBuilder {
  public KStream stream(topicPattern, numTasks);
}
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to