[ 
https://issues.apache.org/jira/browse/HDFS-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Patrick Kling updated HDFS-1476:
--------------------------------

    Attachment: HDFS-1476.5.patch

- Updated patch to apply to current trunk.
- In BlockManager.markBlockAsCorrupt() only update needed replication queues if 
they have been initialized

ant test-patch results:
{code}
     [exec] +1 overall.  
     [exec] 
     [exec]     +1 @author.  The patch does not contain any @author tags.
     [exec] 
     [exec]     +1 tests included.  The patch appears to include 6 new or 
modified tests.
     [exec] 
     [exec]     +1 javadoc.  The javadoc tool did not generate any warning 
messages.
     [exec] 
     [exec]     +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
     [exec] 
     [exec]     +1 findbugs.  The patch does not introduce any new Findbugs 
(version 1.3.9) warnings.
     [exec] 
     [exec]     +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.
     [exec] 
     [exec]     +1 system test framework.  The patch passed system test 
framework compile.
{code}

ant test failures (same as on clean trunk):
{code}
 [junit] Test org.apache.hadoop.hdfs.server.datanode.TestBlockRecovery FAILED
    [junit] Test org.apache.hadoop.hdfs.TestHDFSServerPorts FAILED
    [junit] Test org.apache.hadoop.hdfs.TestHDFSTrash FAILED (timeout)
    [junit] Test org.apache.hadoop.hdfs.server.namenode.TestBackupNode FAILED
    [junit] Test org.apache.hadoop.hdfs.server.namenode.TestStorageRestore 
FAILED
    [junit] Test org.apache.hadoop.hdfs.TestFileConcurrentReader FAILED 
(timeout)
    [junit] Test 
org.apache.hadoop.hdfs.server.namenode.TestLargeDirectoryDelete FAILED (timeout)
    [junit] Test org.apache.hadoop.hdfs.server.datanode.TestBlockRecovery FAILED
{code}

> listCorruptFileBlocks should be functional while the name node is still in 
> safe mode
> ------------------------------------------------------------------------------------
>
>                 Key: HDFS-1476
>                 URL: https://issues.apache.org/jira/browse/HDFS-1476
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Patrick Kling
>            Assignee: Patrick Kling
>         Attachments: HDFS-1476.2.patch, HDFS-1476.3.patch, HDFS-1476.4.patch, 
> HDFS-1476.5.patch, HDFS-1476.patch
>
>
> This would allow us to detect whether missing blocks can be fixed using Raid 
> and if that is the case exit safe mode earlier.
> One way to make listCorruptFileBlocks available before the name node has 
> exited from safe mode would be to perform a scan of the blocks map on each 
> call to listCorruptFileBlocks to determine if there are any blocks with no 
> replicas. This scan could be parallelized by dividing the space of block IDs 
> into multiple intervals than can be scanned independently.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to