Lin Yiqun created HDFS-9819: ------------------------------- Summary: FsVolume should tolerate few times check-dir failed due to deletion by mistake Key: HDFS-9819 URL: https://issues.apache.org/jira/browse/HDFS-9819 Project: Hadoop HDFS Issue Type: Bug Reporter: Lin Yiqun Assignee: Lin Yiqun Fix For: 2.7.1
FsVolume should tolerate few times check-dir failed because sometimes we will do a delete dir/file operation by mistake in datanode data-dirs. Then the {{DataNode#startCheckDiskErrorThread}} will invoking checkDir method periodicity and find dir not existed, throw exception. The checked volume will be added to failed volume list. The blocks on this volume will be replicated again. But actually, this is not needed to do. We should let volume can be tolerated few times check-dir failed like config {{dfs.datanode.failed.volumes.tolerated}}. -- This message was sent by Atlassian JIRA (v6.3.4#6332)