Re: RFR: JDK-8149925 We don't need jdk.internal.ref.Cleaner any more

Peter Levart Thu, 31 Mar 2016 14:14:04 -0700

Hi Roger,

On 03/31/2016 09:12 PM, Roger Riggs wrote:

Hi Peter,
It would be simpler to understand the changes if we solve the problemsone at a time,
at least for review purposes.

Right. We can focus on one aspect at a time, but I'm still trying tokeep the whole thing in a working condition at all times...

To your question in the 2nd part about the Cleaner. (webrev.11.part2)
I don't think the communication between the memory reserving threadand the unreserving threadshould be mixed into the Cleaner design or implementation. The logicfor the communicationbetween reserveMemory and unreserveMemory methods should be in thosetwo methodsand isolated to Bits.java. I understand the intent for the reservingthread to poll for available memoryand it might as well do something useful while it is waiting and get ahint about unreserved memory.
But it mixes together the implementations. (too much)

The problem with reserving thread getting information from unreservingthread solely by communication implemented in methods reserveMemory andunreserveMemory is that this information is not enough. Reserving threadmust also get information about when all the pending unreservations havebeen performed so that it can either:

- trigger System.gc() to discover fresh pending unreservations, or
- finally give up with OOME

In the absence of this information, all reserving thread can do isspeculate about this information by observing the timings ofunreservations happening or not happening - to time out on waiting forunreservations to happen. This works (as shown in webrev.10.part2), butis not very robust or agile - it introduces unnecessary delays. Thisinformation is hidden in ReferenceHandler thread (have all pendingReferences been enqueued?) and in the Cleaner (has the queue drained out?).

I moved the retrial and helping logic to ExtendedCleaner because I thinkit is reusable for other situations. But if you think it doesn't belongto ExtendedCleaner, I can move it back to Bits. We don't strictly needto help the Cleaner thread with cleanups. I did it that way because thisseemed an easy way to communicate the information about the drainedqueue back to allocator thread and to retry reservations at appropriateintervals. But let me think about a way to just get this informationwithout helping - similarly to what I've done it in ReferenceHandler...

Having an arbitrary thread (the one trying to allocate a DirectBuffer)help with the Cleaningputs an unknown thread perhaps with limited stack orAccessControlContext in place to call thecleaning functions is unappealing at best. The cleaning functions areless predictable thanthe Reference enqueuing functions already discussed but are not muchmore complex.In most cases they are about the complexity of the Deallocator inDirect-X-Buffer, etc.

Allocating thread could be "conditioned" before calling the cleanupaction by:

- saving and clearing thread-locals
- saving and setting AccessControlContext to unprivileged.
...and restoring these back after the action

The problem with unsufficient stack is more difficult solve though.Isn't there a new annotation designed to help with that (mainly intendedfor critical sections of java.util.concurrent classes). The problem withusing it here would be in that we don't know what cleanup action(s)might be executed since Cleaner is a general-purpose API and thisannotation is only designed for parts of code that are known in advance...


Can the pieces be disentangled and still pass the DirectBufferAllocTest?

If we want to get that additional piece of information fromReferenceHandler and Cleaner, they must be entangled with Bits, but Imight be able to loosen this entanglement a bit. Will try these ideastomorrow. Stay tuned.


Regards, Peter

Roger




On 3/28/2016 1:18 PM, Peter Levart wrote:
Hi Mandy, Kim, Per and Roger
I'd like to continue the discussion about the 2nd part of removingjdk.internal.ref.Cleaner in this discussion thread.
There was some discussion about whether to synchronize withReferenceHandler thread and wait for it to enqueue the Reference(s)or simply detect that there are no more pending Reference(s) bytiming out on waiting for cleanup actions in discussion thread: "Re:Analysis on JDK-8022321 java/lang/ref/OOMEInReferenceHandler.javafails intermittently". Based on that discussion, I have prepared awebrev that uses an approach where the detection is performed usingtimeout:
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.10.part2/
While this webrev passes the DirectBufferAllocTest, I don't have agood feeling about this approach since it is not very robust. I canimagine situations where it would not behave optimally - it wouldeither trigger reference discovery (System.gc()) more frequently thatnecessary or it would cause delays in execution. So I still preferthe approach where allocating thread(s) explicitly synchronize withReferenceHandler thread and wait for it to enqueue pendingReference(s). Luckily this can be performed in an easy way (as I willshow you shortly). Waiting on discovery of pending references byReferenceHandler thread and handing them to it could be moved tonative code so that no notification would have to be performed innative code from the ReferenceHandler thread to the allocating thread(s).
But first, let me reply to Mandy's comments...


On 03/25/2016 11:20 PM, Mandy Chung wrote:
On Mar 19, 2016, at 7:00 AM, Peter Levart<peter.lev...@gmail.com>  wrote:

Here's the webrev:

     
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.08.part2/
On 03/07/2016 07:35 PM, Mandy Chung wrote:
I studied webrev.06priv and the history of JDK-6857566.

I’m not comfortable for any arbitrary thread to handle the enqueuing of the 
pending references (this change is more about the fix for JDK-6857566).
Why? A Thread is a Thread is a Thread... When legacy Cleaner is removed, 
ReferenceHandler thread will be left with swapping pointers only - no custom 
code will be involved. The only things I can think of against using arbitrary 
thread are:
:
My uncomfort was the fix for JDK-6857566 - both enqueuing pending ref and 
invoking the cleaning code in an arbitrary thread.

Looking at it again - enqueuing the pending reference is not so much of a 
concern (simply updating the link) but the common cleaner could be used by 
other code that may only expect to be invoked in system thread that’s still my 
concern (thinking of thread locals).
As you'll see in the webrev below, enqueueing is performed solely beReferenceHandler thread. Allocating thread(s) just wait for it to doits job. There's a little synchronization action performed at the endof enqueueing a chunk of pending references that notifies waiters(allocating threads) so that they can continue. This actuallyimproves throughput (compared to helping enqueue Reference(s) one byone) because there's not much actual work to be done (just swappingpointers) so synchronization dominates. The goal here is to minimizesynchronization among threads and by executing enqueuing of the wholebunch of pending references in private by a single thread achieves areduction in synchronization when lots of Reference(s) are discoveredat once - precisely the situation when it matters.
OTOH helping the Cleaner thread is beneficial as cleanup actions taketime to execute and this is the easiest way to retry allocation whilethere's still chance it will succeed. As the common Cleaner is usingInnocuousThread, cleanup actions can't rely on any thread locals tobe preserved from invocation to invocation anyway - they are clearedafter each cleanup action so each action gets empty thread locals. Wecould simulate this in threads that help execute cleanup actions bysaving thread-locals to local variables, clearing thread-locals,executing cleanup action and then restoring thread-locals from localvariables. Mandy, if you think this is important I'll add suchsave/clear/restore code to appropriate place.
   On the other hand, invoking Deallocator::run (deallocating the native 
memory) in arbitrary threads has no problem.  Consider me being paranoid of the 
fix for JDK-6857566.  The current list of clients using CleanerFactory::cleaner 
may be safe being called from arbitrary threads but I can’t say what will be 
added in the future.
Right, save/clear/restore thread locals then (left for next webrev)...
The allocating thread may do a System.gc() that may discover phantom reachable 
references.  All it’s interested is only the direct byte buffer ones so that it 
can deallocate the native memory.  What is the downside of having a dedicated 
Cleaner for direct byte buffer that could special case for it?
A dedicated Cleaner for direct buffers might be a good idea if other uses of 
shared Cleaner in JDK become heavy. So that helping process Cleanable(s) does 
not involve other unrelated Cleanable(s). But it comes with a price of another 
dedicated background thread.
Perhaps provide one Cleaner specific for native memory deallocation or anything 
safe to be called in arbitrary thread.  It could provide the entry point for 
the allocating thread to assist the cleaning (i.e. Bits::reserveMemory could 
call it).  That will make it explicit that this cleaner provides explicit 
control for other threads to assist the cleaning action (and JavaLangRefAccess 
would only be used by this special cleaner and not in NIO).

All clients of Unsafe.freeMemory could use that special cleaner for native 
memory deallocation use such as IOVecWrapper, DirectByteBuffer, Marlin’s 
OffHeapArray.

The common cleaner would be kept for other things to use and it should be 
lazily created to avoid another thread.

Does this sound reasonable?

Mandy
Of course. Having specialized Cleaner(s) with additional capabilityrequires extension to the Cleaner API for some cleaners.Unfortunately java.lang.ref.Cleaner is a final class.
Here's what I propose: by transforming java.lang.ref.Cleaner into aninterface implemented by a class in a concealed package(jdk.internal.ref.CleanerImpl) the public API can be left unchangedwhile the implementation is actually simplified (there's no injectionof Cleaner.impl access function into CleanerImpl class needed anymore). The result of that transformation is also the ability tospecify an extension interface (ExtendedCleaner) located in aconcealed package so it can only be used by system code (java.baseand modules to which jdk.internal.ref is explicitly exported) and theability to extend the functionality of implementation by subclassingit (CleanerImpl.ExtendedImpl). The guts of previous CleanerImpl aresimply moved into a private nested class CleanerImpl.Task:
http://cr.openjdk.java.net/~plevart/jdk9-dev/removeInternalCleaner/webrev.11.part2/
I'm interested in what Roger has to say about this transformation. Itis source compatible, but not binary compatible (invokevirtual vs.invokeinterface). So it can be safely performed only before JDK 9 ships.
I packed the entire retry-while-helping mechanics into theimplementation of this ExtendedCleaner interface. java.nio.Bits isconsequently much simplified. The common cleaner is nowExtendedCleaner as other usages besides handling deallocation ofnative memory are minor and are not problematic from the standpointof arbitrary threads helping with cleanup, especially whensaving/clearing/restoring of thread-locals is implemented. It wouldnot be a problem to provide another instance, simplejava.lang.ref.Cleaner this time, for other usages if needed.
And now a few words about ReferenceHandler thread and synchronizationwith it (for Kim and Per mostly). I think it should not be a problemto move the following two java.lang.ref.Reference methods to nativecode if desired:
    static Reference<?> getPendingReferences(int[] discoveryPhaseHolder)
    static int getDiscoveryPhase()
The 1st one is only invoked by a ReferenceHandler thread while the2nd is invoked by arbitrary thread. The difference between this andwebrev.09.part2 is that there's no need any more for ReferenceHandlerthread to notify the thread executing the 2nd method and that there'sno need for the 2nd method to perform any waiting. It just needs toobtain the lock briefly so that it can read the consistent state oftwo fields. Those two fields are Java static fields currently:Reference.pending & Reference.discoveryPhase and those two methodsare Java methods, but they could be moved to native code if desiredto make the protocol between VM and Java code more robust.
So Kim, Per, what do you think of supporting those 2 methods innative code? Would that present any problem?
With webrev.11.part2 I get a 40% improvement in throughput vs.webrev.10.part2 executing DirectBufferAllocTest in 16 allocatingthreads on a 4-core i7 CPU.
Regards, Peter

Re: RFR: JDK-8149925 We don't need jdk.internal.ref.Cleaner any more

Reply via email to