[
https://issues.apache.org/jira/browse/KAFKA-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13496374#comment-13496374
]
John Fung commented on KAFKA-609:
---------------------------------
* Testcase : 9001
* Hudson Build : # 250
* Functional Test Group : Migration Tool (No bouncing)
* Failure : ConsoleConsumer doesn't receive any messages as it terminated after
4 "Rebalancing attempt failed":
* Testcase archive :
<host_0996>:/export/home/kafka/kafka_hudson_test_output_logs/test_1352766410/testcase_9001/logs
1. ConsoleConsumer log4j messages showing 4 attempts failed:
[~/kafka_hudson_test_output_logs/test_1352766410/testcase_9001/logs] grep
"Rebalancing attempt failed" console_consumer-8/console_consumer.log
[2012-11-13 00:31:25,017] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Rebalancing attempt
failed. Clearing the cache before the next rebalancing operation is triggered
(kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:27,052] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Rebalancing attempt
failed. Clearing the cache before the next rebalancing operation is triggered
(kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:29,064] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Rebalancing attempt
failed. Clearing the cache before the next rebalancing operation is triggered
(kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:31,076] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Rebalancing attempt
failed. Clearing the cache before the next rebalancing operation is triggered
(kafka.consumer.ZookeeperConsumerConnector)
. . .
[2012-11-13 00:31:31,076] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Cleared all relevant
queues for this fetcher (kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:31,076] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Cleared the data
chunks in all the consumer message iterators
(kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:31,076] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], Committing all
offsets after clearing the fetcher queues
(kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:33,078] ERROR Error processing message, stopping consumer:
(kafka.consumer.ConsoleConsumer$)
kafka.common.ConsumerRebalanceFailedException:
console-consumer-15395_host0997-1352766684787-46c22f38 can't rebalance after 4
retries
at
kafka.consumer.ZookeeperConsumerConnector$ZKRebalancerListener.syncedRebalance(ZookeeperConsumerConnector.scala:388)
at
kafka.consumer.ZookeeperConsumerConnector.kafka$consumer$ZookeeperConsumerConnector$$reinitializeConsumer(ZookeeperConsumerConnector.scala:690)
at
kafka.consumer.ZookeeperConsumerConnector$WildcardStreamsHandler.<init>(ZookeeperConsumerConnector.scala:720)
at
kafka.consumer.ZookeeperConsumerConnector.createMessageStreamsByFilter(ZookeeperConsumerConnector.scala:133)
at kafka.consumer.ConsoleConsumer$.main(ConsoleConsumer.scala:182)
at kafka.consumer.ConsoleConsumer.main(ConsoleConsumer.scala)
[2012-11-13 00:31:33,079] INFO
[console-consumer-15395_host0997-1352766684787-46c22f38], ZKConsumerConnector
shutting down (kafka.consumer.ZookeeperConsumerConnector)
[2012-11-13 00:31:33,080] INFO Forcing shutdown of Kafka scheduler
(kafka.utils.KafkaScheduler)
[2012-11-13 00:31:33,080] INFO [ConsumerFetcherManager-1352766684865] shutting
down (kafka.consumer.ConsumerFetcherManager)
2. All 3 brokers started without any errors nor exceptions until stopped by
System Test:
. . .
[2012-11-13 00:31:20,807] INFO Closing socket connection to /x.x.x.x.
(kafka.network.Processor)
[2012-11-13 00:31:20,807] INFO Closing socket connection to /x.x.x.x.
(kafka.network.Processor)
[2012-11-13 00:31:20,807] INFO Closing socket connection to /x.x.x.x.
(kafka.network.Processor)
[2012-11-13 00:31:44,529] INFO Shutting down Kafka server
(kafka.server.KafkaServer)
[2012-11-13 00:31:44,530] INFO shutdown scheduler kafka-logcleaner-
(kafka.utils.KafkaScheduler)
[2012-11-13 00:31:44,537] INFO shutdown scheduler kafka-logflusher-
(kafka.utils.KafkaScheduler)
[2012-11-13 00:31:44,538] INFO Closing zookeeper client...
(kafka.server.KafkaZooKeeper)
3. Zookeeper log4j messages in Source cluster works fine until terminated by
System Test:
[2012-11-13 00:30:39,802] INFO Got user-level KeeperException when processing
sessionid:0x13af72bbc290003 type:setData cxid:0xb1 zxid:0xfffffffffffffffe
txntype:unknown reqpath:n/a Error
Path:/consumers/test-consumer-group/offsets/test_1/2-3 Error:KeeperErrorCode =
NoNode for /consumers/test-consumer-group/offsets/test_1/2-3
(org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:30:39,803] INFO Got user-level KeeperException when processing
sessionid:0x13af72bbc290003 type:create cxid:0xb2 zxid:0xfffffffffffffffe
txntype:unknown reqpath:n/a Error
Path:/consumers/test-consumer-group/offsets/test_1 Error:KeeperErrorCode =
NodeExists for /consumers/test-consumer-group/offsets/test_1
(org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:44,540] INFO Processed session termination for sessionid:
0x13af72bbc290000 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:44,542] INFO Closed socket connection for client
/x.x.x.x:46477 which had sessionid 0x13af72bbc290000
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:44,736] INFO Processed session termination for sessionid:
0x13af72bbc290002 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:44,737] INFO Closed socket connection for client
/x.x.x.125:39520 which had sessionid 0x13af72bbc290002
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:44,855] INFO Processed session termination for sessionid:
0x13af72bbc290001 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:44,856] INFO Closed socket connection for client
/x.x.x.124:36697 which had sessionid 0x13af72bbc290001
(org.apache.zookeeper.server.NIOServerCnxn)
4. Zookeeper log4j messages in Target cluster :
[2012-11-13 00:31:24,904] INFO Processed session termination for sessionid:
0x13af72bd8660001 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:24,906] INFO Closed socket connection for client
/x.x.x.123:51194 which had sessionid 0x13af72bd8660001
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:24,933] INFO Got user-level KeeperException when processing
sessionid:0x13af72bd8660000 type:create cxid:0x2 zxid:0xfffffffffffffffe
txntype:unknown reqpath:n/a Error Path:/consumers/console-consumer-15395/ids
Error:KeeperErrorCode = NoNode for /consumers/console-consumer-15395/ids
(org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:24,934] INFO Got user-level KeeperException when processing
sessionid:0x13af72bd8660000 type:create cxid:0x3 zxid:0xfffffffffffffffe
txntype:unknown reqpath:n/a Error Path:/consumers/console-consumer-15395
Error:KeeperErrorCode = NoNode for /consumers/console-consumer-15395
(org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:24,936] INFO Got user-level KeeperException when processing
sessionid:0x13af72bd8660000 type:create cxid:0x4 zxid:0xfffffffffffffffe
txntype:unknown reqpath:n/a Error Path:/consumers Error:KeeperErrorCode =
NoNode for /consumers (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:33,085] INFO Processed session termination for sessionid:
0x13af72bd8660000 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:33,088] INFO Closed socket connection for client
/x.x.x.123:51193 which had sessionid 0x13af72bd8660000
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:33,964] INFO Accepted socket connection from /x.x.x.123:51196
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:33,965] INFO Client attempting to establish new session at
/x.x.x.123:51196 (org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:33,967] INFO Established session 0x13af72bd8660002 with
negotiated timeout 30000 for client /x.x.x.123:51196
(org.apache.zookeeper.server.NIOServerCnxn)
[2012-11-13 00:31:33,977] INFO Processed session termination for sessionid:
0x13af72bd8660002 (org.apache.zookeeper.server.PrepRequestProcessor)
[2012-11-13 00:31:33,979] INFO Closed socket connection for client
/x.x.x.123:51196 which had sessionid 0x13af72bd8660002
(org.apache.zookeeper.server.NIOServerCnxn)
5. In 0.7 producer side, the log segment files have data:
[~/kafka_hudson_test_output_logs/test_1352766410/testcase_9001/logs]
.../kafka_08_replication_system_test/system_test/migration_tool_testsuite/0.7/bin/kafka-run-class.sh
kafka.tools.DumpLogSegments -noprint
broker-3/kafka_server_3_logs/test_1-4/00000000000000170959.kafka
Starting offset: 170959
offset: 170959 isvalid: true payloadsize: 76 magic: 1 compresscodec:
NoCompressionCodec
offset: 171045 isvalid: true payloadsize: 61 magic: 1 compresscodec:
NoCompressionCodec
offset: 171116 isvalid: true payloadsize: 160 magic: 1 compresscodec:
NoCompressionCodec
offset: 171286 isvalid: true payloadsize: 253 magic: 1 compresscodec:
NoCompressionCodec
offset: 171549 isvalid: true payloadsize: 419 magic: 1 compresscodec:
NoCompressionCodec
. . .
> System Test Transient Failure 9001 (Migration tool) - ConsoleConsumer
> terminates after "can't rebalance after 4 retries"
> ------------------------------------------------------------------------------------------------------------------------
>
> Key: KAFKA-609
> URL: https://issues.apache.org/jira/browse/KAFKA-609
> Project: Kafka
> Issue Type: Bug
> Reporter: John Fung
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira