[
https://issues.apache.org/jira/browse/HADOOP-2816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Chansler updated HADOOP-2816:
------------------------------------
Release Note: Improved space reporting for NameNode Web UI. Applications
that parse the Web UI output should be reviewed. (was: This fix changes the
following:
1) Datanode heartbeat reported Capacity information is changed. Earlier the
Capacity was sum of all the diskspace of data directories. With this change, it
is sum of all the diskspace of data directories minus the reserved space
configured using dfs.datanode.du.reserved config param. This change is
reflected by changing the protocol version from 17 to 18.
2) The Namenode Web UI is changed accordingly as detailed below...
Cluster Summary
Capacity : Currently, this is sum of the file system capacity of all the data
directories. This is changed to Sum of the file system capacity of all the data
directories minus Reserved space. The name is changed to "Configured Capacity".
Present Capacity: This is newly added and represents the present capacity
available for DFS use. This is sum of DFS Remaining and DFS Used given below
DFS Remaining : This will remain as it is
DFS Used : This will remain as it is
DFS Used% : This is changed. It is calculated based on Present Capacity and not
Configured Capacity.
Live Nodes : This will remain as it is
Dead Nodes : This will remain as it is
Node data prints currently:
Node Last Contact Admin State Size (TB) Used (%) Used (%) Remaining (TB) Blocks
It will be change to:
Node Last Contact Admin State Capacity (TB) Present Capacity (TB) Used (%) Used
(%) Remaining (TB) Blocks
Size column is renamed as Total Capacity. Previously this was calculated as sum
of file system capacity of all the data directories. It is changed to exclude
reserved space and will be calculated as (sum of file system capacity of all
the data directories - reserved space)
)
Hadoop Flags: [Incompatible change, Reviewed] (was: [Reviewed,
Incompatible change])
This fix changes the following:
1) Datanode heartbeat reported Capacity information is changed. Earlier the
Capacity was sum of all the diskspace of data directories. With this change, it
is sum of all the diskspace of data directories minus the reserved space
configured using dfs.datanode.du.reserved config param. This change is
reflected by changing the protocol version from 17 to 18.
2) The Namenode Web UI is changed accordingly as detailed below...
Cluster Summary
Capacity : Currently, this is sum of the file system capacity of all the data
directories. This is changed to Sum of the file system capacity of all the data
directories minus Reserved space. The name is changed to "Configured Capacity".
Present Capacity: This is newly added and represents the present capacity
available for DFS use. This is sum of DFS Remaining and DFS Used given below
DFS Remaining : This will remain as it is
DFS Used : This will remain as it is
DFS Used% : This is changed. It is calculated based on Present Capacity and not
Configured Capacity.
Live Nodes : This will remain as it is
Dead Nodes : This will remain as it is
Node data prints currently:
Node Last Contact Admin State Size (TB) Used (%) Used (%) Remaining (TB) Blocks
It will be change to:
Node Last Contact Admin State Capacity (TB) Present Capacity (TB) Used (%) Used
(%) Remaining (TB) Blocks
Size column is renamed as Total Capacity. Previously this was calculated as sum
of file system capacity of all the data directories. It is changed to exclude
reserved space and will be calculated as (sum of file system capacity of all
the data directories - reserved space)
> Cluster summary at name node web has confusing report for space utilization
> ---------------------------------------------------------------------------
>
> Key: HADOOP-2816
> URL: https://issues.apache.org/jira/browse/HADOOP-2816
> Project: Hadoop Core
> Issue Type: Improvement
> Components: dfs
> Reporter: Robert Chansler
> Assignee: Suresh Srinivas
> Fix For: 0.19.0
>
> Attachments: HADOOP-2816.patch, HADOOP-2816.patch, HADOOP-2816.patch
>
>
> In one example:
> Cluster Summary
> Capacity : 1.15 PB
> DFS Remaining : 192 TB
> DFS Used : 717 TB
> DFS Used% : 62 %
> Why is Capacity not equal Used plus Remaining?
> (The answer is that there is an estimated reserve for local files.)
> The presentation should be easily understood by the user.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.