Istvan Fajth created HDFS-15303: ----------------------------------- Summary: Provide a tool that can validate/fix the block file placement in DataNode data directories Key: HDFS-15303 URL: https://issues.apache.org/jira/browse/HDFS-15303 Project: Hadoop HDFS Issue Type: Improvement Reporter: Istvan Fajth
We recently run into an issue where during an upgrade from Hadoop2 to Hadoop3, the filesystem under DataNode data directories was left in an intermediate state, and part of the directories were in Hadoop2 format other parts in Hadoop3 format. At first we had to rollback the upgrade, and after the rollback we started to see FileNotFoundExceptions for particular block files. The exception was logged on the DataNodes and failed the jobs as well sometimes. HDFS NameNode did not show any missing blocks, and we found the block files and meta files also in the DataNode's data directories, but at a different location. This was the point when we realized that something went wrong during the rollback, and some of the data directories had blocks placed according to Hadoop3 rules, while other were placed according to Hadoop2 rules. We suspect a possible premature DataNode shutdown or an unknown failure during the rollback, but at the point when we realized what is the issue and could check into things, we already ran out of the logs that would have been able to show us the cause. This JIRA is to suggest two new commands that can help administrators in this situation, to validate the data directory and ensure that blocks are placed correctly according to the rules, and to fix the data directory layout if needed. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org