[jira] Commented: (HDFS-1104) Fsck triggers full GC on NameNode

Hairong Kuang (JIRA) Tue, 27 Apr 2010 12:49:04 -0700

    [ 
https://issues.apache.org/jira/browse/HDFS-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12861513#action_12861513
 ]


Hairong Kuang commented on HDFS-1104:
-------------------------------------

I know that most file systems enable access time by default. But I also know 
that in practice it is turned off by many administrators for the performance 
purpose. I agree that correct access time is sometimes useful, but in most 
cases it is unnecessary and only adds unnecessary I/O to the system. Turning 
this off by default will benefit most of our users.

For fsck, I'd like to propose not to update access time and provide no 
configuration to turn it off. This will greatly improve fsck performance.

Yes, #4 is a must. I will open a different jira for this.


> Fsck triggers full GC on NameNode
> ---------------------------------
>
>                 Key: HDFS-1104
>                 URL: https://issues.apache.org/jira/browse/HDFS-1104
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: name-node
>            Reporter: Hairong Kuang
>            Assignee: Hairong Kuang
>            Priority: Blocker
>             Fix For: 0.20.3, 0.21.0, 0.22.0
>
>
> A NameNode at one of our clusters fell into full GC while fsck was performed. 
> Digging into the problem shows that it is caused by how NameNode handles the 
> access time of a file.
> Fsck calls open on every file in the checked directory to get the file's 
> block locations. Each open changes the file's access time and then leads to 
> writing a transaction entry to the edit log. The current code optimizes open 
> so that it returns without issuing synchronizing the edit log to the disk. It 
> happened that in our cluster no other jobs were running while fsck was 
> performed. No edit log sync was ever called. So all open transactions were 
> kept in memory. When the edit log buffer got full, it automatically doubled 
> its space by allocating a new buffer.  Full GC happened when no contiguous 
> space were found when allocating a new bigger buffer.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HDFS-1104) Fsck triggers full GC on NameNode

Reply via email to