Re: RFR: JDK-8149925 We don't need jdk.internal.ref.Cleaner any more

Roger Riggs Thu, 31 Mar 2016 12:14:08 -0700

Hi Peter,

It would be simpler to understand the changes if we solve the problemsone at a time,

at least for review purposes.


To your question in the 2nd part about the Cleaner. (webrev.11.part2)

I don't think the communication between the memory reserving thread andthe unreserving threadshould be mixed into the Cleaner design or implementation. The logicfor the communicationbetween reserveMemory and unreserveMemory methods should be in those twomethodsand isolated to Bits.java. I understand the intent for the reservingthread to poll for available memoryand it might as well do something useful while it is waiting and get ahint about unreserved memory.

But it mixes together the implementations. (too much)

Having an arbitrary thread (the one trying to allocate a DirectBuffer)help with the Cleaningputs an unknown thread perhaps with limited stack orAccessControlContext in place to call thecleaning functions is unappealing at best. The cleaning functions areless predictable thanthe Reference enqueuing functions already discussed but are not muchmore complex.In most cases they are about the complexity of the Deallocator inDirect-X-Buffer, etc.


Can the pieces be disentangled and still pass the DirectBufferAllocTest?

Roger




On 3/28/2016 1:18 PM, Peter Levart wrote:

Hi Mandy, Kim, Per and Roger
I'd like to continue the discussion about the 2nd part of removingjdk.internal.ref.Cleaner in this discussion thread.
There was some discussion about whether to synchronize withReferenceHandler thread and wait for it to enqueue the Reference(s) orsimply detect that there are no more pending Reference(s) by timingout on waiting for cleanup actions in discussion thread: "Re: Analysison JDK-8022321 java/lang/ref/OOMEInReferenceHandler.java failsintermittently". Based on that discussion, I have prepared a webrevthat uses an approach where the detection is performed using timeout:
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.10.part2/
While this webrev passes the DirectBufferAllocTest, I don't have agood feeling about this approach since it is not very robust. I canimagine situations where it would not behave optimally - it wouldeither trigger reference discovery (System.gc()) more frequently thatnecessary or it would cause delays in execution. So I still prefer theapproach where allocating thread(s) explicitly synchronize withReferenceHandler thread and wait for it to enqueue pendingReference(s). Luckily this can be performed in an easy way (as I willshow you shortly). Waiting on discovery of pending references byReferenceHandler thread and handing them to it could be moved tonative code so that no notification would have to be performed innative code from the ReferenceHandler thread to the allocating thread(s).
But first, let me reply to Mandy's comments...


On 03/25/2016 11:20 PM, Mandy Chung wrote:
On Mar 19, 2016, at 7:00 AM, Peter Levart<[email protected]>  wrote:

Here's the webrev:

     
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.08.part2/
On 03/07/2016 07:35 PM, Mandy Chung wrote:
I studied webrev.06priv and the history of JDK-6857566.

I’m not comfortable for any arbitrary thread to handle the enqueuing of the 
pending references (this change is more about the fix for JDK-6857566).
Why? A Thread is a Thread is a Thread... When legacy Cleaner is removed, 
ReferenceHandler thread will be left with swapping pointers only - no custom 
code will be involved. The only things I can think of against using arbitrary 
thread are:
:
My uncomfort was the fix for JDK-6857566 - both enqueuing pending ref and 
invoking the cleaning code in an arbitrary thread.

Looking at it again - enqueuing the pending reference is not so much of a 
concern (simply updating the link) but the common cleaner could be used by 
other code that may only expect to be invoked in system thread that’s still my 
concern (thinking of thread locals).
As you'll see in the webrev below, enqueueing is performed solely beReferenceHandler thread. Allocating thread(s) just wait for it to doits job. There's a little synchronization action performed at the endof enqueueing a chunk of pending references that notifies waiters(allocating threads) so that they can continue. This actually improvesthroughput (compared to helping enqueue Reference(s) one by one)because there's not much actual work to be done (just swappingpointers) so synchronization dominates. The goal here is to minimizesynchronization among threads and by executing enqueuing of the wholebunch of pending references in private by a single thread achieves areduction in synchronization when lots of Reference(s) are discoveredat once - precisely the situation when it matters.
OTOH helping the Cleaner thread is beneficial as cleanup actions taketime to execute and this is the easiest way to retry allocation whilethere's still chance it will succeed. As the common Cleaner is usingInnocuousThread, cleanup actions can't rely on any thread locals to bepreserved from invocation to invocation anyway - they are clearedafter each cleanup action so each action gets empty thread locals. Wecould simulate this in threads that help execute cleanup actions bysaving thread-locals to local variables, clearing thread-locals,executing cleanup action and then restoring thread-locals from localvariables. Mandy, if you think this is important I'll add suchsave/clear/restore code to appropriate place.
   On the other hand, invoking Deallocator::run (deallocating the native 
memory) in arbitrary threads has no problem.  Consider me being paranoid of the 
fix for JDK-6857566.  The current list of clients using CleanerFactory::cleaner 
may be safe being called from arbitrary threads but I can’t say what will be 
added in the future.
Right, save/clear/restore thread locals then (left for next webrev)...
The allocating thread may do a System.gc() that may discover phantom reachable 
references.  All it’s interested is only the direct byte buffer ones so that it 
can deallocate the native memory.  What is the downside of having a dedicated 
Cleaner for direct byte buffer that could special case for it?
A dedicated Cleaner for direct buffers might be a good idea if other uses of 
shared Cleaner in JDK become heavy. So that helping process Cleanable(s) does 
not involve other unrelated Cleanable(s). But it comes with a price of another 
dedicated background thread.
Perhaps provide one Cleaner specific for native memory deallocation or anything 
safe to be called in arbitrary thread.  It could provide the entry point for 
the allocating thread to assist the cleaning (i.e. Bits::reserveMemory could 
call it).  That will make it explicit that this cleaner provides explicit 
control for other threads to assist the cleaning action (and JavaLangRefAccess 
would only be used by this special cleaner and not in NIO).

All clients of Unsafe.freeMemory could use that special cleaner for native 
memory deallocation use such as IOVecWrapper, DirectByteBuffer, Marlin’s 
OffHeapArray.

The common cleaner would be kept for other things to use and it should be 
lazily created to avoid another thread.

Does this sound reasonable?

Mandy
Of course. Having specialized Cleaner(s) with additional capabilityrequires extension to the Cleaner API for some cleaners. Unfortunatelyjava.lang.ref.Cleaner is a final class.
Here's what I propose: by transforming java.lang.ref.Cleaner into aninterface implemented by a class in a concealed package(jdk.internal.ref.CleanerImpl) the public API can be left unchangedwhile the implementation is actually simplified (there's no injectionof Cleaner.impl access function into CleanerImpl class needed anymore). The result of that transformation is also the ability tospecify an extension interface (ExtendedCleaner) located in aconcealed package so it can only be used by system code (java.base andmodules to which jdk.internal.ref is explicitly exported) and theability to extend the functionality of implementation by subclassingit (CleanerImpl.ExtendedImpl). The guts of previous CleanerImpl aresimply moved into a private nested class CleanerImpl.Task:
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.11.part2/
I'm interested in what Roger has to say about this transformation. Itis source compatible, but not binary compatible (invokevirtual vs.invokeinterface). So it can be safely performed only before JDK 9 ships.
I packed the entire retry-while-helping mechanics into theimplementation of this ExtendedCleaner interface. java.nio.Bits isconsequently much simplified. The common cleaner is nowExtendedCleaner as other usages besides handling deallocation ofnative memory are minor and are not problematic from the standpoint ofarbitrary threads helping with cleanup, especially whensaving/clearing/restoring of thread-locals is implemented. It wouldnot be a problem to provide another instance, simplejava.lang.ref.Cleaner this time, for other usages if needed.
And now a few words about ReferenceHandler thread and synchronizationwith it (for Kim and Per mostly). I think it should not be a problemto move the following two java.lang.ref.Reference methods to nativecode if desired:
    static Reference<?> getPendingReferences(int[] discoveryPhaseHolder)
    static int getDiscoveryPhase()
The 1st one is only invoked by a ReferenceHandler thread while the 2ndis invoked by arbitrary thread. The difference between this andwebrev.09.part2 is that there's no need any more for ReferenceHandlerthread to notify the thread executing the 2nd method and that there'sno need for the 2nd method to perform any waiting. It just needs toobtain the lock briefly so that it can read the consistent state oftwo fields. Those two fields are Java static fields currently:Reference.pending & Reference.discoveryPhase and those two methods areJava methods, but they could be moved to native code if desired tomake the protocol between VM and Java code more robust.
So Kim, Per, what do you think of supporting those 2 methods in nativecode? Would that present any problem?
With webrev.11.part2 I get a 40% improvement in throughput vs.webrev.10.part2 executing DirectBufferAllocTest in 16 allocatingthreads on a 4-core i7 CPU.
Regards, Peter

Re: RFR: JDK-8149925 We don't need jdk.internal.ref.Cleaner any more

Reply via email to