[ 
https://issues.apache.org/jira/browse/HDFS-729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792683#action_12792683
 ] 

Hairong Kuang commented on HDFS-729:
------------------------------------

I have two concerns about the approach:
1. If the neededReplicationQueue changes while issuing getBadFiles calls, 
consecutive calls may not be able to return all bad files;
2. Because neededReplicationQueue stores blocks that may belong to the same 
file, so two consecutive badFiles calls may contain duplicate files. 

> fsck option to list only corrupted files
> ----------------------------------------
>
>                 Key: HDFS-729
>                 URL: https://issues.apache.org/jira/browse/HDFS-729
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: dhruba borthakur
>            Assignee: dhruba borthakur
>         Attachments: badFiles.txt, badFiles2.txt, corruptFiles.txt
>
>
> An option to fsck to list only corrupted files will be very helpful for 
> frequent monitoring.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to