Andrew Olson created KAFKA-4599:
-----------------------------------

             Summary: KafkaConsumer encounters SchemaException when Kafka 
broker stopped
                 Key: KAFKA-4599
                 URL: https://issues.apache.org/jira/browse/KAFKA-4599
             Project: Kafka
          Issue Type: Bug
          Components: consumer
            Reporter: Andrew Olson


We recently observed an issue in production that can apparently occur a small 
percentage of the time when a Kafka broker is stopped. We're using version 
0.9.0.1 for all brokers and clients.

During a recent episode, 3 KafkaConsumer instances (out of approximately 100) 
ran into the following SchemaException within a few seconds of instructing the 
broker to shutdown.

{noformat}
2017-01-04 14:46:19 org.apache.kafka.common.protocol.types.SchemaException: 
Error reading field 'responses': Error reading array of size 2774863, only 62 
bytes available
        at org.apache.kafka.common.protocol.types.Schema.read(Schema.java:71)
        at 
org.apache.kafka.clients.NetworkClient.handleCompletedReceives(NetworkClient.java:439)
        at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:265)
        at 
org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.clientPoll(ConsumerNetworkClient.java:320)
        at 
org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:213)
        at 
org.apache.kafka.clients.consumer.internals.ConsumerNetworkClient.poll(ConsumerNetworkClient.java:193)
        at 
org.apache.kafka.clients.consumer.KafkaConsumer.pollOnce(KafkaConsumer.java:908)
        at 
org.apache.kafka.clients.consumer.KafkaConsumer.poll(KafkaConsumer.java:853)
{noformat}

The exception message was slightly different for one consumer,
{{Error reading field 'responses': Error reading array of size 2774863, only 
260 bytes available}}

The exception was not caught and caused the Storm Executor thread to restart, 
so it's not clear if it would have been transient or fatal for the 
KafkaConsumer.

Here are the initial broker shutdown logs,

{noformat}
2017-01-04 14:46:15,869 INFO kafka.server.KafkaServer: [Kafka Server 4], 
shutting down
2017-01-04 14:46:16,298 INFO kafka.server.ReplicaFetcherThread: 
[ReplicaFetcherThread-1-40], Shutting down
2017-01-04 14:46:18,364 INFO kafka.server.ReplicaFetcherThread: 
[ReplicaFetcherThread-1-40], Stopped 
2017-01-04 14:46:18,364 INFO kafka.server.ReplicaFetcherThread: 
[ReplicaFetcherThread-1-40], Shutdown completed
2017-01-04 14:46:18,612 INFO kafka.server.ReplicaFetcherThread: 
[ReplicaFetcherThread-3-30], Shutting down
2017-01-04 14:46:19,547 INFO kafka.server.KafkaServer: [Kafka Server 4], 
Controlled shutdown succeeded
2017-01-04 14:46:19,554 INFO kafka.network.SocketServer: [Socket Server on 
Broker 4], Shutting down
2017-01-04 14:46:19,593 INFO kafka.network.SocketServer: [Socket Server on 
Broker 4], Shutdown completed
{noformat}

We've found one very similar reported occurrence,
http://mail-archives.apache.org/mod_mbox/kafka-users/201605.mbox/%3CCAGnq0kFPm%2Bd0Xdm4tY_O7MnV3_LqLU10uDhPwxzv-T7UnHy08g%40mail.gmail.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to