Kanak Biscuitwala created KAFKA-3966:
----------------------------------------

             Summary: Consumer briefly ignores partitions on backlogs
                 Key: KAFKA-3966
                 URL: https://issues.apache.org/jira/browse/KAFKA-3966
             Project: Kafka
          Issue Type: Bug
            Reporter: Kanak Biscuitwala
         Attachments: screenshot-1.png

Setup:
Kafka 0.10.0.0
9 brokers
36 partitions
12 members in the consumer group
5 producers, partitioning data roughly evenly
max.poll.records = 10
~1500 events/sec
~500 bytes/message
KafkaConsumer API

In the following setup, 3 partitions are assigned to each consumer (and 12 are 
served by each broker). I've noticed that the third of the three partitions 
tends to be ignored in favor of the first two on each consumer, even though the 
third partition has data available. Only until the first two partitions are 
fully caught up does the consumer return back messages from the third. This 
causes a frustrating imbalance in which the same partitions always fall behind.

As a side note, this is true for all of our topics, regardless of partitioning 
strategy. The problem goes away if there are exactly as many consumers as 
partitions.

I can attach a screenshot showing the same partitions falling behind (verified 
that they're each assigned to different nodes), if that is helpful.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to