Dhruvil Shah created KAFKA-12520: ------------------------------------ Summary: Producer state is needlessly rebuilt on startup Key: KAFKA-12520 URL: https://issues.apache.org/jira/browse/KAFKA-12520 Project: Kafka Issue Type: Bug Reporter: Dhruvil Shah
When we find a {{.swap}} file on startup, we typically want to rename and replace it as {{.log}}, {{.index}}, {{.timeindex}}, etc. as a way to complete any ongoing replace operations. These swap files are usually known to have been flushed to disk before the replace operation begins. One flaw in the current logic is that when we recover these swap files on startup, we end up truncating the producer state and rebuild it from scratch. This is unneeded as the replace operation does not mutate the producer state by itself. It is only meant to replace the {{.log}} file along with corresponding indices. Because of this unneeded producer state rebuild operation, we have seen multi-hour startup times for clusters that have large compacted topics. -- This message was sent by Atlassian Jira (v8.3.4#803005)