[
https://issues.apache.org/jira/browse/HADOOP-2816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12628160#action_12628160
]
Suresh Srinivas commented on HADOOP-2816:
-----------------------------------------
For reporting the following info needs to be considered:
Total capacity - Capacity of all the data directories
Reserved space - Space reserved for non DFS usage
dfs.datanode.du.pct - When calculating DFS remaining space, only use this
percentage of the real available space
Here is how DFS remaining space is calculated:
Available space is Minimum of (Available space on local file system) or (Total
capacity - DFS used space - Reserved space)
DFS remaining = (dfs.datanode.du.pct) * Available space
Current proposal does not consider the factor dfs.datanode.du.pct. I am not
sure why du.pct is being used. If it is to reduce available disk space for DFS,
to consider factors such as disk fragmentation - it is not serving the purpose.
Available space keeps on decreasing. The percentage is applied to the shrinking
available space. Eventually the DFS ends up using all the available space any
way (in theory) and the du.pct will not serve any purpose.
My proposal:
1) Remove du.pct configuration option
or
2) If du.pct is used, it is calculated on Total capacity and not on available
space. This helps set aside a percentage of total capacity.
> Cluster summary at name node web has confusing report for space utilization
> ---------------------------------------------------------------------------
>
> Key: HADOOP-2816
> URL: https://issues.apache.org/jira/browse/HADOOP-2816
> Project: Hadoop Core
> Issue Type: Improvement
> Components: dfs
> Affects Versions: 0.15.3
> Reporter: Robert Chansler
> Assignee: Suresh Srinivas
>
> In one example:
> Cluster Summary
> Capacity : 1.15 PB
> DFS Remaining : 192 TB
> DFS Used : 717 TB
> DFS Used% : 62 %
> Why is Capacity not equal Used plus Remaining?
> (The answer is that there is an estimated reserve for local files.)
> The presentation should be easily understood by the user.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.