Re: patch to 0.3.0 ExtensionFactoryMap for serious memory leak

Chris Berry Tue, 29 Apr 2008 11:47:47 -0700

Thanks for responding Dan.

I may be wrong, but (IIRC) I think 0.3.0 was patched a few times inplace (not that its a best practice ;-)

Would 0.3.1 have to go thru the same process as a minor patch release??
If so,  it probably is too much work.
Thanks,

-- Chris

On Apr 29, 2008, at 1:37 PM, Dan Diephouse wrote:

Hi Chris, due to the high amount of work it takes to get a releaseout of the incubator, I really don't see this happening. I don'tknow what you mean by applying this patch to 0.3.0, but we can'treally go back and change the release. It all has to go through theincubator process. And I'm going to guess that if we're goingthrough that process we're going to focus on either 1.0-RC or 0.4.1.
Dan

Chris Berry wrote:
Greetings,
Some time ago we reported a very serious memory leak in Abdera 0.3.0.
It was reported to the abdera users list on Nov 7, 2007 as "memoryleak".
That message is replayed below...
At the time we could workaround the bug in our client, but overtime this became impossible, and we had to hack ExtensionFactoryMap.
Attached is a patch to org.apache.abdera.factory.ExtensionFactoryMap
This patch simply removes the problem -- the WeakHashMap.
This change has a negligible effect on performance (as measured inload testing) But without it, abdera 0.3.0 will getOutOfMemoryExceptions quite quickly under load, rendering itunusable.
BTW: this code has been in Production for many weeks...

We were hoping that you could apply this patch to 0.3.0??
Or better;  produce 0.3.1
We are not able to upgrade to 0.4.0 yet, and would rather not havea hacked copy of 0.3.0.Also, others not able to upgrade yet (since the upgrade is notentirely transparent) may benefit from the patch
I will also open a JIRA on this.

Thanks,
-- Chris------------------------------------------------------------------------
*From: * [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>

*Subject: * *memory leak*

*Date: * November 5, 2007 3:49:24 PM CST
*To: * [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>*Reply-To: * [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
                   Hey All -
I'm working on a data service based on Abdera (working with ChrisBerry, who's a regular on these lists...) When we were running ourfirst battery of serious load testing on our system, we encounteredmemory-leaky behavior, and a profiler showed us that we were indeedleaking hundreds of megabytes a minute, all traceable back to thewrappers field on org.apache.abdera.factory.ExtensionFactoryMap.This field is a map from elements to their wrappers, if any. Atfirst, I was puzzled by the memory leak, as the field isinitialized thusly:
this.wrappers = Collections.synchronizedMap( newWeakHashMap<Element,Element>());
clearly, the implementor took care to make sure that this cachewould not leak by making it a WeakHashMap, which generallyguarantees that the map itself will not keep a key and itscorresponding entry from being garbage collected. I dug throughoutour application code to find if we were actually holding otherreferences to these objects, and I googled for anyone havingproblems with esoteric interactions betweenCollections.synchronizedMap and WeakHashMaps - found nothingthere. Then I went back to square one and re-read the WeakHashMapjavadoc very carefully. Here's the relevant section:
Implementation note: The value objects in a WeakHashMap are held byordinary strong references. Thus care should be taken to ensurethat value objects do not strongly refer to their own keys, eitherdirectly or indirectly, since that will prevent the keys from beingdiscarded. Note that a value object may refer indirectly to its keyvia the WeakHashMap itself; that is, a value object may stronglyrefer to some other key object whose associated value object, inturn, strongly refers to the key of the first value object. One wayto deal with this is to wrap values themselves withinWeakReferences before inserting, as in: m.put(key, newWeakReference(value)), and then unwrapping upon each get.
This is why there is a memory leak - the map is a mapping fromelements to their wrappers - by the very nature of the object beinga wrapper of the element, it will usually have a strong referenceto the element itself, which is the key! You can verify that Abderawrappers, in general, will do this by looking atorg.apache.abdera.model.ElementWrapper, which takes the elementbeing wrapped as its constructor argument, and holds a strongreference to it as an instance variable.
This map is an optimization to memoize the calls togetElementWrapper() and not reconstruct them more than is necessary- it is not needed for abdera to function properly, so we havetemporarily worked around the problem in our own application likeso - we used to acquire our FOMFactory by callingabdera.getFactory() on our org.apache.abdera.Abdera instance, andre-using that singleton throughout our application. Now weconstruct a new FOMFactory with new FOMFactory(abdera) once perrequest to the server, and since the only appreciable state on thefactory is this map itself, this is a valid work-around.
I'd initially planned to really fix this issue and submit a patchalong with this message, but digging a little deeper, I'm not surethat the correct fix is crystal clear... We could do as thejavadoc above suggests, and wrap the values with WeakReferences toplug the leak, or we could use a LinkedHashMap configured as an LRUcache to just bound the cache, so it can't grow out of control -but right now, I don't think that either of those solutions wouldbe correct, because it seems to me that none of the objects in thehierarchy rooted at FOMElement define equals() and/or hashCode()methods, so all of the objects are cached based on their actualobject identity. It seems that in the all likely use cases,instances of FOMElement and its descendants are re-parsed on everyrequest to a server running abdera, and so what we will see iscache misses virtually 100% of the time, so even though we'd haveplugged the leak, strictly speaking, we would have ignored theunderlying issue that we're caching data on every request that willbe fundamentally unable to be retrieved on subsequent requests.This is based only on my reading over the code for a few hours, soI could be missing something, and I also might be forgetting abouta use case that demands and makes proper use of this memoization,but as it stands right now, my recommended fix would probably be tojust cut out the cache altogether, and allow for wrappers to getconstructed fresh every time they are requested. One morepossibility is that the cache is actually a useful optimization,but only during the scope of one request - in which case the "work-around" we are using now is actually the best practice, and the fixwould be to remove the factory instance on the Abdera class...
I'd like to hear from the Abdera developers what their thoughts areon this issue, and what the best resolution is likely to be. Thisis no longer a pressing issue for our team, but it is potentially atime bomb waiting to blow up for any project dependent on Abdera.
thanks! (and thanks for Abdera, generally - we're easily a yearahead of where we'd be on this project without it!)
-Bryon (long-time listener, first-time caller)
--
Dan Diephouse
MuleSource
http://mulesource.com | http://netzooid.com

Re: patch to 0.3.0 ExtensionFactoryMap for serious memory leak

Reply via email to