Dao Quang Minh created KAFKA-7569:
-------------------------------------

             Summary: Kafka doesnt appear to cleanup dangling partitions
                 Key: KAFKA-7569
                 URL: https://issues.apache.org/jira/browse/KAFKA-7569
             Project: Kafka
          Issue Type: Bug
    Affects Versions: 1.0.0
            Reporter: Dao Quang Minh


In our current cluster running kafka 1.0.0, we recently observed that Kafka 
doesnt cleanup dangling partitions ( i.e. partion data on disk, but partition 
is not assigned to the current broker anymore ).

For example of the dangling partition data, we have:

{code}
total 26G
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:19 00000000352433304663.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:16 00000000352414164340.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:24 00000000352466972892.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:23 00000000352457368236.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:17 00000000352423709566.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:21 00000000352447702369.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:20 00000000352442921890.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:22 00000000352452551548.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:17 00000000352418945305.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:18 00000000352428477361.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:15 00000000352409416538.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:24 00000000352462192103.log
-rw-r--r-- 1 kafka kafka 1.9G Aug 6 16:20 00000000352438136012.log
-rw-r--r-- 1 kafka kafka 1.8G Aug 6 17:43 00000000352471757311.log
-rw-r--r-- 1 kafka kafka 10M Oct 16 21:44 00000000352471757311.index
-rw-r--r-- 1 kafka kafka 10M Oct 16 21:44 00000000352471757311.timeindex
drwxr-xr-x 2 kafka kafka 4.0K Oct 8 15:27 .
drwxr-xr-x 49 kafka kafka 4.0K Oct 30 11:21 ..
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352414164340.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352423709566.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352433304663.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352447702369.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352457368236.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352466972892.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352409416538.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352418945305.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352428477361.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352438136012.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352442921890.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352452551548.timeindex
-rw-r--r-- 1 kafka kafka 2.3K Oct 16 21:44 00000000352462192103.timeindex
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352414164340.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352423709566.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352433304663.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352447702369.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352457368236.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352466972892.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352409416538.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352418945305.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352428477361.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352438136012.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352442921890.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352452551548.index
-rw-r--r-- 1 kafka kafka 1.5K Oct 16 21:44 00000000352462192103.index
-rw-r--r-- 1 kafka kafka 20 Aug 6 16:23 leader-epoch-checkpoint
-rw-r--r-- 1 kafka kafka 10 Aug 6 16:24 00000000352466972892.snapshot
-rw-r--r-- 1 kafka kafka 10 Aug 6 16:24 00000000352471757311.snapshot
-rw-r--r-- 1 kafka kafka 10 Oct 8 15:27 00000000352476186724.snapshot
{code}

I'm unsure how we ended up in this situation as partition data should be marked 
as removed and eventually remove when it's not assigned to the broker anymore. 
But in this edge case, should Kafka detect that automatically when it loads the 
partition and re-mark it as to be deleted again ?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to