[ https://issues.apache.org/jira/browse/KAFKA-4779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rajini Sivaram reassigned KAFKA-4779: ------------------------------------- Assignee: Rajini Sivaram > Failure in kafka/tests/kafkatest/tests/core/security_rolling_upgrade_test.py > ---------------------------------------------------------------------------- > > Key: KAFKA-4779 > URL: https://issues.apache.org/jira/browse/KAFKA-4779 > Project: Kafka > Issue Type: Bug > Reporter: Apurva Mehta > Assignee: Rajini Sivaram > > This test failed on 01/29, on both trunk and 0.10.2, error message: > {noformat} > The consumer has terminated, or timed out, on node ubuntu@worker3. > Traceback (most recent call last): > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/tests/runner_client.py", > line 123, in run > data = self.run_test() > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/tests/runner_client.py", > line 176, in run_test > return self.test_context.function(self.test) > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/mark/_mark.py", > line 321, in wrapper > return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs) > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/tests/kafkatest/tests/core/security_rolling_upgrade_test.py", > line 148, in test_rolling_upgrade_phase_two > self.run_produce_consume_validate(self.roll_in_secured_settings, > client_protocol, broker_protocol) > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/tests/kafkatest/tests/produce_consume_validate.py", > line 100, in run_produce_consume_validate > self.stop_producer_and_consumer() > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/tests/kafkatest/tests/produce_consume_validate.py", > line 87, in stop_producer_and_consumer > self.check_alive() > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.2/kafka/tests/kafkatest/tests/produce_consume_validate.py", > line 79, in check_alive > raise Exception(msg) > Exception: The consumer has terminated, or timed out, on node ubuntu@worker3. > {noformat} > Looks like the console consumer times out: > {noformat} > [2017-01-30 04:56:00,972] ERROR Error processing message, terminating > consumer process: (kafka.tools.ConsoleConsumer$) > kafka.consumer.ConsumerTimeoutException > at kafka.consumer.NewShinyConsumer.receive(BaseConsumer.scala:90) > at kafka.tools.ConsoleConsumer$.process(ConsoleConsumer.scala:120) > at kafka.tools.ConsoleConsumer$.run(ConsoleConsumer.scala:75) > at kafka.tools.ConsoleConsumer$.main(ConsoleConsumer.scala:50) > at kafka.tools.ConsoleConsumer.main(ConsoleConsumer.scala) > {noformat} > A bunch of these security_rolling_upgrade tests failed, and in all cases, the > producer produced ~15k messages, of which ~7k were acked, and the consumer > only got around ~2600 before timing out. > There are a lot of messages like the following for different request types on > the producer and consumer: > {noformat} > [2017-01-30 05:13:35,954] WARN Received unknown topic or partition error in > produce request on partition test_topic-0. The topic/partition may not exist > or the user may not have Describe access to it > (org.apache.kafka.clients.producer.internals.Sender) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.15#6346)