[ https://issues.apache.org/jira/browse/HDFS-729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12792683#action_12792683 ]
Hairong Kuang commented on HDFS-729: ------------------------------------ I have two concerns about the approach: 1. If the neededReplicationQueue changes while issuing getBadFiles calls, consecutive calls may not be able to return all bad files; 2. Because neededReplicationQueue stores blocks that may belong to the same file, so two consecutive badFiles calls may contain duplicate files. > fsck option to list only corrupted files > ---------------------------------------- > > Key: HDFS-729 > URL: https://issues.apache.org/jira/browse/HDFS-729 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: dhruba borthakur > Assignee: dhruba borthakur > Attachments: badFiles.txt, badFiles2.txt, corruptFiles.txt > > > An option to fsck to list only corrupted files will be very helpful for > frequent monitoring. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.