[ 
https://issues.apache.org/jira/browse/KAFKA-9543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17047502#comment-17047502
 ] 

Brian Jones commented on KAFKA-9543:
------------------------------------

I notice the following commit: 
[https://github.com/apache/kafka/commit/a48b5d900c6b5c9c52a97124a1b51aff3636c32c#diff-ffa8861e850121997a534ebdde2929c6L1436]
 changed some code and removed the following comment:

{{// If the fetch occurs on the active segment, there might be a race condition 
where two fetch requests occur after
// the message is appended but before the nextOffsetMetadata is updated. In 
that case the second fetch may
// cause OffsetOutOfRangeException. To solve that, we cap the reading up to 
exposed position instead of the log
// end of the active segment.}}

Any chance that this could have re-introduced the race condition described?

> Consumer offset reset after new segment rolling
> -----------------------------------------------
>
>                 Key: KAFKA-9543
>                 URL: https://issues.apache.org/jira/browse/KAFKA-9543
>             Project: Kafka
>          Issue Type: Bug
>    Affects Versions: 2.4.0
>            Reporter: Rafał Boniecki
>            Priority: Major
>         Attachments: Untitled.png
>
>
> After upgrade from kafka 2.1.1 to 2.4.0, I'm experiencing unexpected consumer 
> offset resets.
> Consumer:
> {code:java}
> 2020-02-12T11:12:58.402+01:00 hostname 4a2a39a35a02 
> [2020-02-12T11:12:58,402][INFO 
> ][org.apache.kafka.clients.consumer.internals.Fetcher] [Consumer 
> clientId=logstash-1, groupId=logstash] Fetch offset 1632750575 is out of 
> range for partition stats-5, resetting offset
> {code}
> Broker:
> {code:java}
> 2020-02-12 11:12:58:400 CET INFO  
> [data-plane-kafka-request-handler-1][kafka.log.Log] [Log partition=stats-5, 
> dir=/kafka4/data] Rolled new log segment at offset 1632750565 in 2 ms.{code}
> All resets are perfectly correlated to rolling new segments at the broker - 
> segment is rolled first, then, couple of ms later, reset on the consumer 
> occurs. Attached is grafana graph with consumer lag per partition. All sudden 
> spikes in lag are offset resets due to this bug.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to