Hi Keith,

The "fs -du" computes length of files, and would not report replicated
on-disk size. HDFS disk utilization OTOH, is the current, simple
report of used/free disk space, which would certainly include
replicated data.

On Mon, Apr 22, 2013 at 10:49 PM, Keith Wiley <kwi...@keithwiley.com> wrote:
> Simple question: When I issue a "hadoop fs -du" command and/or when I view 
> the namenode web UI to see HDFS disk utilization (which the namenode reports 
> both as bytes and percentage), should I expect to see disk use reported as 
> "true data size" or as replicated size (e.g. with 3X replication, should I 
> expect reported values to be three times higher than the actual underlying 
> data itself)?
>
> Thanks.
>
> ________________________________________________________________________________
> Keith Wiley     kwi...@keithwiley.com     keithwiley.com    
> music.keithwiley.com
>
> "I used to be with it, but then they changed what it was.  Now, what I'm with
> isn't it, and what's it seems weird and scary to me."
>                                            --  Abe (Grandpa) Simpson
> ________________________________________________________________________________
>



-- 
Harsh J

Reply via email to