[ 
https://issues.apache.org/jira/browse/HDFS-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Colin Patrick McCabe updated HDFS-6604:
---------------------------------------

    Attachment: HDFS-6604.001.patch

The CacheCleaner thread is supposed to time out replicas that haven't been used 
for a while (5 minutes by default).  It looks like this thread has not been 
timing out the non-mmapped entries as it should, due to a typo.  This patch 
should fix it.

We probably should also have the DN notify the client when a block that the 
client is reading is unlinked.  It could do this via the existing shared memory 
segment code, in the same way we handle uncaching now.

> Disk space leak with shortcircuit
> ---------------------------------
>
>                 Key: HDFS-6604
>                 URL: https://issues.apache.org/jira/browse/HDFS-6604
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: hdfs-client
>    Affects Versions: 2.4.0
>         Environment: Centos 6.5 and distribution Hortonworks Data Platform 
> v2.1
>            Reporter: Giuseppe Reina
>            Priority: Critical
>         Attachments: HDFS-6604.001.patch
>
>
> When HDFS shortcircuit is enabled, the file descriptors of the deleted HDFS 
> blocks are kept open until the cache is full. This prevents the operating 
> system to free the space on disk. 
> More details on the [mailing list 
> thread|http://mail-archives.apache.org/mod_mbox/hbase-user/201406.mbox/%3CCAPjB-CA3RV=slhuhwue5cv3pc4+rffz10-tkydbfs9rt2de...@mail.gmail.com%3E]



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to