bcc: hdfs-user@hadoop.apache.org
+ cdh-u...@cloudera.org

Hey Adam,

Thanks a lot for the bug report. I've added cdh-user@ to this email, which
may be a more appropriate list for this question.

Best,
Aaron

--
Aaron T. Myers
Software Engineer, Cloudera



On Thu, Mar 24, 2011 at 10:47 AM, Adam Phelps <a...@opendns.com> wrote:

> For reference, this is running hadoop 0.20.2 from the CDH3B4 distribution.
>
> - Adam
>
>
> On 3/24/11 10:30 AM, Adam Phelps wrote:
>
>> We have a bad disk on one of our datanode machines, and while we have
>> dfs.datanode.failed.volumes.tolerated set to 2 and didn't see any
>> problem while the DataNode process was running we are seeing a problem
>> when we needed to restart the DataNode process:
>>
>> 2011-03-24 16:50:20,071 WARN org.apache.hadoop.util.DiskChecker:
>> Incorrect permissions were set on /var/lib/stats/hdfs/4, expected:
>> rwxr-xr-x, while actual: ---------. Fixing...
>> 2011-03-24 16:50:20,089 INFO org.apache.hadoop.util.NativeCodeLoader:
>> Loaded the native-hadoop library
>> 2011-03-24 16:50:20,091 ERROR
>> org.apache.hadoop.hdfs.server.datanode.DataNode: EPERM: Operation not
>> permitted
>>
>> In this case /var/lib/stats/hdfs/4 is the mount point for the bad disk.
>> It gets that permission error because we have the mount directory set to
>> be immutable:
>>
>> root@s3:/var/log/hadoop# lsattr /var/lib/stats/hdfs/
>> ------------------- /var/lib/stats/hdfs/2
>> ----i------------e- /var/lib/stats/hdfs/4
>> ------------------- /var/lib/stats/hdfs/3
>> ------------------- /var/lib/stats/hdfs/1
>>
>> As we'd previously seen HDFS just write to the local disk when a disk
>> couldn't be mounted.
>>
>> HDFS is supposed to be able to handle failed disk, but it doesn't seem
>> to be doing the right thing in this case. Is this a known problem, or is
>> there some other way we should be configuring things to allow the
>> DataNode to come up in this situation?
>>
>> (clearly we can remove the mount point from hdfs-site.xml, but that
>> doesn't feel like the correct solution)
>>
>> Thanks
>> - Adam
>>
>>
>

Reply via email to