[ 
https://issues.apache.org/jira/browse/HDFS-5247?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13797045#comment-13797045
 ] 

Suresh Srinivas commented on HDFS-5247:
---------------------------------------

This is a rare enough problem that can be worked around by monitoring the 
available disk space. This part of the code has been quite brittle. Some of the 
changes in this area have resulted in more serious bugs and subsequent bug 
fixes for stabilization. My preference is to leave this as is, since monitoring 
disk space can avoid this issue.

> Namenode should close editlog and unlock storage when removing failed storage 
> dir
> ---------------------------------------------------------------------------------
>
>                 Key: HDFS-5247
>                 URL: https://issues.apache.org/jira/browse/HDFS-5247
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 1.2.1
>            Reporter: zhaoyunjiong
>            Assignee: zhaoyunjiong
>             Fix For: 1.2.1
>
>         Attachments: HDFS-5247-branch-1.2.patch
>
>
> When one of dfs.name.dir failed, namenode didn't close editlog and unlock the 
> storage:
> java    24764 hadoop   78uW  REG 252,32        0 393219 
> /volume1/nn/dfs/in_use.lock (deleted)
> java    24764 hadoop  107u   REG 252,32  1155072 393229 
> /volume1/nn/dfs/current/edits.new (deleted)
> java    24764 hadoop  119u   REG 252,32        0 393238 
> /volume1/nn/dfs/current/fstime.tmp
> java    24764 hadoop  140u   REG 252,32  1761805 393239 
> /volume1/nn/dfs/current/edits
> If this dir is limit of space, then restore this storage may fail.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to