jiafu1115 opened a new pull request, #20007:
URL: https://github.com/apache/kafka/pull/20007

   We found that one broker's local segment on disk never get removed forever 
no matter how long it stored. The disk always keep increasing. 
   
   
![image](https://github.com/user-attachments/assets/42129bb6-7d07-481b-923f-971da3ab12da)
   note: Partition 2's node is the exception node.
   
   After we trouble shooting. we find if one broker is very slow to startup it 
will cause the TopicBasedRemoteLogMetadataManager#initializeResources's fail 
sometime. Thus it won't stop the server so that the server still run just with 
some exception log but not shutdown.  It won't upload to remote for the local 
so that the local segment never to deleted.
   
   So propose the change to shutdown the broker to avoid the silence critical 
error which caused the disk keep increasing forever.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to