jiafu1115 opened a new pull request, #20007: URL: https://github.com/apache/kafka/pull/20007
We found that one broker's local segment on disk never get removed forever no matter how long it stored. The disk always keep increasing.  note: Partition 2's node is the exception node. After we trouble shooting. we find if one broker is very slow to startup it will cause the TopicBasedRemoteLogMetadataManager#initializeResources's fail sometime. Thus it won't stop the server so that the server still run just with some exception log but not shutdown. It won't upload to remote for the local so that the local segment never to deleted. So propose the change to shutdown the broker to avoid the silence critical error which caused the disk keep increasing forever. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org