[ 
https://issues.apache.org/jira/browse/HDFS-11661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15979551#comment-15979551
 ] 

Sean Mackrory commented on HDFS-11661:
--------------------------------------

+1 to the revert - I too would still like to see the original problem fixed, 
but this is worse. It does indeed require global context to do correctly, so 
it'll require some cleverness to make sure we do that without using tons of 
space or locking for a long time. 

[~jojochuang] - to revert cleanly we can revert HDFS-11515 (unless I'm missing 
something and that patch does more than just correct the original changes in 
HDFS-10797) first and then HDFS-10797. As [~xiaochen] is not available right 
now, would you be able to commit the revert when we're satisfied? I'll run 
tests with the reverts committed locally...

> GetContentSummary uses excessive amounts of memory
> --------------------------------------------------
>
>                 Key: HDFS-11661
>                 URL: https://issues.apache.org/jira/browse/HDFS-11661
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 2.8.0, 3.0.0-alpha2
>            Reporter: Nathan Roberts
>            Priority: Blocker
>         Attachments: Heap growth.png
>
>
> ContentSummaryComputationContext::nodeIncluded() is being used to keep track 
> of all INodes visited during the current content summary calculation. This 
> can be all of the INodes in the filesystem, making for a VERY large hash 
> table. This simply won't work on large filesystems. 
> We noticed this after upgrading a namenode with ~100Million filesystem 
> objects was spending significantly more time in GC. Fortunately this system 
> had some memory breathing room, other clusters we have will not run with this 
> additional demand on memory.
> This was added as part of HDFS-10797 as a way of keeping track of INodes that 
> have already been accounted for - to avoid double counting.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to