[ 
https://issues.apache.org/jira/browse/KAFKA-2891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034892#comment-15034892
 ] 

Rajini Sivaram commented on KAFKA-2891:
---------------------------------------

[~benstopford]  The logs from my failing test runs all show the same pattern - 
ISR set to 1 and messages acked when leader is the only ISR. When the leader 
gets killed by the test, messages are lost, as you would expect. The test was 
intended to run with min.insync.replicas set to 2, but due to a bug in the way 
min.insync.replicas was being set for topics, it was being left as default of 
one. All tests which currently set min.insync.replicas have copied the same 
config with the result that the config is never set. I have updated the PR for 
KAFKA-2642 with a fix for the min.insync.replicas setting in all the tests 
which set this. Have scheduled a build with the fix and will check the results 
in the morning.

> Gaps in messages delivered by new consumer after Kafka restart
> --------------------------------------------------------------
>
>                 Key: KAFKA-2891
>                 URL: https://issues.apache.org/jira/browse/KAFKA-2891
>             Project: Kafka
>          Issue Type: Bug
>          Components: consumer
>    Affects Versions: 0.9.0.0
>            Reporter: Rajini Sivaram
>            Priority: Critical
>
> Replication tests when run with the new consumer with SSL/SASL were failing 
> very often because messages were not being consumed from some topics after a 
> Kafka restart. The fix in KAFKA-2877 has made this a lot better. But I am 
> still seeing some failures (less often now) because a small set of messages 
> are not received after Kafka restart. This failure looks slightly different 
> from the one before the fix for KAFKA-2877 was applied, hence the new defect. 
> The test fails because not all acked messages are received by the consumer, 
> and the number of messages missing are quite small.
> [~benstopford] Are the upgrade tests working reliably with KAFKA-2877 now?
> Not sure if any of these log entries are important:
> {quote}
> [2015-11-25 14:41:12,342] INFO SyncGroup for group test-consumer-group failed 
> due to NOT_COORDINATOR_FOR_GROUP, will find new coordinator and rejoin 
> (org.apache.kafka.clients.consumer.internals.AbstractCoordinator)
> [2015-11-25 14:41:12,342] INFO Marking the coordinator 2147483644 dead. 
> (org.apache.kafka.clients.consumer.internals.AbstractCoordinator)
> [2015-11-25 14:41:12,958] INFO Attempt to join group test-consumer-group 
> failed due to unknown member id, resetting and retrying. 
> (org.apache.kafka.clients.consumer.internals.AbstractCoordinator)
> [2015-11-25 14:41:42,437] INFO Fetch offset null is out of range, resetting 
> offset (org.apache.kafka.clients.consumer.internals.Fetcher)
> {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to