Kamal Chandraprakash created KAFKA-17637:
--------------------------------------------

             Summary: Invert the search for LIST_OFFSETS request for remote 
storage topic
                 Key: KAFKA-17637
                 URL: https://issues.apache.org/jira/browse/KAFKA-17637
             Project: Kafka
          Issue Type: Task
            Reporter: Kamal Chandraprakash


The timestamp in the records are non-monotonic so we begin the search from 
earliest to latest offset for LIST_OFFSETS request.

When tiered storage is enabled for a topic, then we begin the search from 
remote to local storage. There can be possible concurrency issue that can 
happen, when the search moves from remote to local storage, some of the 
local-log segments might get uploaded to remote and deleted from local in the 
meantime. This can lead to loss of precision in returning the offset for the 
given timestamp. If this issue happens, then we might silently search for the 
timestamp in the next/available local-log segment. 

One way to fix this issue, is-to trigger the search in local-log first, then 
move to remote-log, and compare the result. The approach is explained in:

https://github.com/apache/kafka/pull/16602#discussion_r1759757001



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to