OOMEInReferenceHandler.java failed with java.lang.Exception: Reference Handler thread died

Peter Levart Thu, 07 May 2015 12:42:12 -0700

On 05/07/2015 08:06 PM, Laurent Bourgès wrote:

Peter,

Thanks for long and detailled answer.
I know now better why OOME should not happen. However any applicationmay also use phantom references and the ReferenceHandler thread willcall Cleaner.run () which could catch OOME from the application codeimplementing thunk.run (). Am I right ?

Any application may use PhantomReference(s) but not Cleaner(s). Cleaneris a PhantoReference, but it is a JDK-internal API. I belive in JDK9 itwill not be visible at all (sun.misc.Cleaner). The Cleaner(s) arereserved for JDK-internal use in particular because they utilize asingle ReferenceHandler thread that also serves a vital role ofdispatching of cleared Reference(s) to their ReferenceQueue(s)...

>> If this block also throws a new oome2 due to the first oome1 (nomemory left), it will work but I would have prefered a more explicitsolution and check oome1 first ...
I looked back at your patch and it is fine. Howevdr I wonder if itwould be possible to avoid any allocation in the catch(Throwable) block:
- preallocate the PriviledgeAction
- avoid new Error(x) to get its stack trace ? Do you know any tricklike ones in SharedSecrets that could dump the stack without anyallocation in case of urgency ?

What about the printing path? Who can guarantee that it doesn't use anyallocation? The diagnostic print-out that precedes System.exit() shouldpreferably be equipped with a stack-trace of the original exception.Formatting a stack trace needs allocation.

But it's a good idea to try in that direction too. Perhaps 1st try toprint like now and if OOME #2 is thrown, resort to minimal printing thatdoesn't allocate...

> You have a point and I asked myself the same question. The questionis how to treat OOME thrown from thunk.run(). Current behavior is toexit() JVM for any exception (Throwable). I maintained that semantics.I only added a handler for OOME thrown in the handler of the 1stexception. I might have just exit()-ed the VM if OOME is thrown, butleaving no trace and just exiting VM would not help anyone diagnosewhat went wrong. So I opted for keeping the VM running for a while bydelaying the handling of 1st exception to "better times". If bettertimes never come, then the application is probably stuck anyway.
Seems very good after a 2nd look.
However, it could loop for a while if no more memory left ?
For example: oome1 => oome2 (catch) => throw x=> oome2 (catch) => ....

The retries are based on Cleaner processing code-path liveness. So it'snot really looping and doing nothing. Each time some Cleaner is found tobe processed, the pending exception is checked too. If no Cleaner(s) aredequeued by ReferenceHandler thread after the one that 'saved' theexception, the exception will not be handled and VM will not exit. I'maware of that, so your idea of trying to print something minimal withoutallocation immediately if it can't be printed nicely, seems even moreattractive.

> An alternative would be to catch OOME from thunk.run() and ignore it(printing it out would be ugly if VM is left to run), but that wouldsilently ignore OOMEs thrown from thunk.run() and noone would noticethat Cleaner(s) might not have clean-ed up the resources they should.
I am a bit lost but I like logging such exceptional case but if noallocation can happen, how to ensure logging such case anyway ?
...

I must check printing code-path. Perhaps it doesn't need to allocateanything.

> Anyway. If none of the Cleaner.thunk's run() methods can throw anyexception, then my handling of OOME is redundant and a code-path nevertaken. But I would still leave it there in case some new Cleaner usecomes along which is not verified yet...
Agreed. It is always better to handle such exceptional case if you canat least log them...
Best regards,
Laurent


Regards, Peter

Re: RFR: JDK-8066859 java/lang/ref/OOMEInReferenceHandler.java failed with java.lang.Exception: Reference Handler thread died

Reply via email to