Re: Question in understanding ClassValue better

Jochen Theodorou Thu, 19 May 2016 16:34:23 -0700

On 19.05.2016 21:32, Peter Levart wrote:
[...]

a ClassValue instance can be thought of as a component of a compound
key. Together with a Class, they form a tuple (aClass, aClassValue) that
can be associated with an "associated value", AV. And yes, the AVs
associated with tuple containing a particular Class are strongly
reachable from that Class.

I see, I was mixing ClassValue and AV, I was more talking about AV, thanClassValue itself, though ClassValue surely plays an important role inhere. Anyway, I am going for that AV is the value computed by aClassValue.

You said that the AV is strongly reachable from aClass. Can I furtherassume, that aClassValue is not strongly reachable from aClass? And thataClassValue can be collected independent of aClass? Can I furtherassume, that aClassValue can be collected even if AVs for it continue toexist?


[...]

An AV associated with a tuple (Integer.TYPE, aClassValue) -> AV can be
garbage collected. But only if aClassValue can be garbage collected 1st.

hmm... so in (aClass, aClassValue)->AV if aClassValue can be collected,AV can, but not the other way around... what about aClass? if nothingbut AV is referencing aClass, can AV be garbage collected, even ifaClassValue cannot? Can I extend your statement to AV can be collectedonly if either aClass or aClassValue can be garbage collected first?


Let us assume this is the case for now.

This is the most tricky part to get right in order to prevent leaks. If
in above example, aClassValue is reachable from the AV, then we have a
leak. The reachability of a ClassValue instance from the associated
value AV is not always obvious. One has to take into account the
following non-obvious references:

1 - each object instance has an implicit reference to its implementing class
2 - each class has a reference to its defining ClassLoader
3 - each ClassLoader has a reference to all classes defined by it
(except VM annonymous classes)
4 - each ClassLoader has a reference to all its predecessors (that it
delegates to)

Since a ClassValue instance is typically assigned to a static final
field, such instance is reachable from the class that declares the field.

I think you can get the picture from that.

yeah... that is actually problematic. Because if I keep no hardreference the ClassValue can be collected, even if the AVs stillexist... meaning they would become unreachable. And if I keep one I havea memory leak... well more about in the program you have shown me later on.


[...]

Ok, let's set up the stage. If I understand you correctly, then:

Groovy runtime is loaded by whatever class loader is loading the
application (see the comment in MetaClass constructor if this is not
true).  This is either the ClassLoader.getSystemClassLoader() (the APP
class loader) if started from command line or for example Web App class
loader in a Web container.

Well, actually... if you start a script on the command line, the loaderis a child to the app class loader, when used as library it could be theapp loader (for example if the groovy program is precompiled) and in atomcat like scenario it could be either the class loader for the webapp, or the loader for all web apps. But let's go with the cases youmentioned first ;)

MetaClass(es) are objects implemented by Groovy runtime class(es). Let's
call them simply MetaClass.


good

Here's how I would do that:


public class MetaClass {

     // this list keeps MetaClass instances strongly reachable from the 
MetaClass
     // class(loader) since they are only weakly reachable from their associated
     // Class(es)
     private static final ArrayList<MetaClass> META_CLASS_LIST = new 
ArrayList<>();

     // this WeakReference is constructed so that it keeps a strong reference
     // to a referent until releaseStrong() is called
     private static final class WeakEntry extends WeakReference<MetaClass> {
         private final AtomicReference<MetaClass> strong;

         WeakEntry(MetaClass mc) {
             super(mc);
             strong = new AtomicReference<>(mc);
         }

         boolean releaseStrong() {
             MetaClass mc = strong.get();
             return mc != null && strong.compareAndSet(mc, null);
         }
     }

     private static final ClassValue<WeakEntry> WEAK_ENTRY_CV =
         new ClassValue<WeakEntry>() {
             @Override
             protected WeakEntry computeValue(Class<?> type) {
                 return new WeakEntry(new MetaClass(type));
             }
         };

     // the public API
     public MetaClass getInstanceFor(Class<?> type) {
         WeakEntry entry = WEAK_ENTRY_CV.get(type);
         MetaClass mc = entry.get();
         if (entry.releaseStrong()) {
             synchronized (META_CLASS_LIST) {
                 META_CLASS_LIST.add(mc);
             }
         }
         return mc;
     }

     MetaClass(Class<?> type) {
         // derive it from 'type', but don't reference it
         // strongly if Groovy runtime is loaded by a parent
         // class loader of the application class loader
         // that loads application classes  you want
         // MetaClass(es) to be derived from.
     }
}


This will keep MetaClass instances isolated from the associated classes
but still keep them reachable as long as the
MetaClass.class.getClassLoader() is alive.

Is this going to help?

partially... MetaClass will realistically have a strong reference to theclass the meta class is for and a lot of other classes. I will need toreference methods and fields by reflection and at some point keep themreferenced. Through the declaring class I should then again have astrong reference to the class.


1_000_000.times { Eval.me("42") }

this will create 1 million classes, in 1 million classloaders, which area child to the class loader for the Groovy runtime. Each of them willget a meta class... ignoring the list, if the strong reference to thescript class in MetaClass is here enough to prevent collection, thenthis is a problem. The list of course is also a problem, since keepingthem alive as long as MetaClass.class.getClassLoader is alive, isactually not the semantics I need.

The semantics I need is something like lifetime(AV) is roughlymin(lifetime(aClass),lifetime(aClassValue)), with AV referencing aClassnot influencing it being collectable, as long as this association is theonly reference to AV. If it is the max of them I am in trouble.

And seeing how aClass kind of has a strong reference to AV and that ifaClassValue and AV are from the same loader and if AV has a strongreference to aClass as well, it will cause aClass not being collectablefor as long as the loader for aClassValue and AV exists... becauseaClassValue is a static value in one of the classes, and AV is notcollected before aClassValue is collected and aClass is not notcollected because of the reference in AV.

The "old" solution in Groovy is to keep a "global" map with weak classkeys and soft references to MetaClass. This allows in theory the classbeing collectable, but only if the MetaClass has been collected before..and some VMs really do not like soft referenced classes much (forced usto use the mark and sweep gc for example). Also soft references used notalways to be clean up when permgen space is low. But if they are notcleaned up, then MetaClass prevents the garbage collection of the class,causing permgen problems. Not sure how that is with meta space, but fromwhat I have seen so far, this can still happen. And of course usingSoftReference means we garbage collect relatively late, and cause a biggarbage collection. I cannot use weak references, because tend to becollected to fast and creating the meta class is not exactly a cheapoperation.


bye Jochen

_______________________________________________
mlvm-dev mailing list
[email protected]
http://mail.openjdk.java.net/mailman/listinfo/mlvm-dev

Re: Question in understanding ClassValue better

Reply via email to