Dhruvil Shah created KAFKA-12520:
------------------------------------

             Summary: Producer state is needlessly rebuilt on startup
                 Key: KAFKA-12520
                 URL: https://issues.apache.org/jira/browse/KAFKA-12520
             Project: Kafka
          Issue Type: Bug
            Reporter: Dhruvil Shah


When we find a {{.swap}} file on startup, we typically want to rename and 
replace it as {{.log}}, {{.index}}, {{.timeindex}}, etc. as a way to complete 
any ongoing replace operations. These swap files are usually known to have been 
flushed to disk before the replace operation begins.

One flaw in the current logic is that when we recover these swap files on 
startup, we end up truncating the producer state and rebuild it from scratch. 
This is unneeded as the replace operation does not mutate the producer state by 
itself. It is only meant to replace the {{.log}} file along with corresponding 
indices.

Because of this unneeded producer state rebuild operation, we have seen 
multi-hour startup times for clusters that have large compacted topics.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to