[ 
https://issues.apache.org/jira/browse/HDFS-5498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13825886#comment-13825886
 ] 

Kihwal Lee commented on HDFS-5498:
----------------------------------

For quick restart (e.g. rolling upgrade), the initialization of 
{{BlockPoolSlice}} can be made fater by cashing {{dfsUsed}}.  Currently DU 
unconditionally runs the du command in its constructor. Filed HADOOP-10111 to 
add an option to specify an initial "used" value and delay the initial scan.

When I incorporated this change in {{BlockPoolSlice}}, the initialization time 
for each volume became several milliseconds on restart. It looks for the cache 
file and use the value if it is less than 5 minutes old (the default DU 
interval).

Another part of start-up time improvement will come from parallelizing  the 
scanning & popuating {{ReplicaMap}}.

> Improve datanode startup time
> -----------------------------
>
>                 Key: HDFS-5498
>                 URL: https://issues.apache.org/jira/browse/HDFS-5498
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Kihwal Lee
>
> Similarly to HDFS-5027, an improvement  can be made for getVomeMap(). This is 
> the phase in which ReplicaMap.is populated.  But it will be even better if 
> datanode scans only once and do both.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to