[ 
https://issues.apache.org/jira/browse/HDFS-8710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer updated HDFS-8710:
-----------------------------------
    Resolution: Won't Fix
        Status: Resolved  (was: Patch Available)

> Always read DU value from the cached "dfsUsed" file on datanode startup
> -----------------------------------------------------------------------
>
>                 Key: HDFS-8710
>                 URL: https://issues.apache.org/jira/browse/HDFS-8710
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Xinwei Qin 
>            Assignee: Xinwei Qin 
>         Attachments: HDFS-8710.001.patch
>
>
> Currently, DataNode will cache DU value in "dfsUsed" file termly. When 
> DataNode starts or restarts, it will read in the cached DU value from 
> "dfsUsed" file if the value is less than 600 seconds old, otherwise, it will 
> run DU command, which is a very time-consuming operation(may up to dozens of 
> minutes) when DataNode has huge number of blocks.
> Since slight imprecision of dfsUsed is not critical, and the DU value will be 
> updated every 600 seconds (the default DU interval) after DataNode started, 
> we can always read DU value from the cached file (Regardless of whether this 
> value is less than 600 seconds old or not) and skip DU operation on DataNode 
> startup to significantly shorten the startup time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to