Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

Peter Levart Fri, 28 Sep 2018 00:23:01 -0700

Hi Stuart,

I mostly agree with your assessment about the suitability of theByteBuffer API for nice multithreaded use. What would such API looklike? I think pretty much like ByteBuffer but without things that mutatemark/position/limit/ByteOrder. A stripped-down ByteBuffer API therefore.That would be in my opinion the most low-level API possible. If you addthings to such API that coordinate multithreaded access to theunderlying memory, you are already creating a concurrent data structurefor a particular set of use cases, which might not cover all possibleuse cases or be sub-optimal at some of them. So I think this is betterlayered on top of such API not built into it. Low-level multithreadedaccess to memory is, in my opinion, always going to be "unsafe" from thestandpoint of coordination. It's not only themark/position/limit/ByteOrder that is not multithreaded-friendly aboutByteBuffer API, but the underlying memory too. It would be nice ifmark/position/limit/ByteOrder weren't in the way though.


Regards, Peter

On 09/28/2018 07:51 AM, Stuart Marks wrote:

Hi Andrew,
Let me first stay that this issue of "ByteBuffer might not be theright answer" is something of a digression from the JEP discussion. Ithink the JEP should proceed forward using MBB with the API that youand Alan had discussed previously. At most, the discussion of the"right thing" issue might affect a side note in the JEP text aboutpossible limitations and future directions of this effort. However,it's not a blocker to the JEP making progress as far as I'm concerned.
With that in mind, I'll discuss the issue of multithreaded access toByteBuffers and how this bears on whether buffers are or aren't the"right answer." There are actually several issues that figure into the"right answer" analysis. In this message, though, I'll just focus onthe issue of multithreaded access.
To recap (possibly for the benefit of other readers) the Buffer classdoc has the following statement:
Buffers are not safe for use by multiple concurrent threads. If abuffer is to be used by more than one thread then access to the buffershould be
    controlled by appropriate synchronization.
Buffers are primarily designed for sequential operations such as I/Oor codeset conversion. Typical buffer operations set the mark,position, and limit before initiating the operation. If the operationcompletes partially -- not uncommon with I/O or codeset conversion --the position is updated so that the operation can be resumed easilyfrom where it left off.
The fact that buffers not only contain the data being operated uponbut also mutable state information such as mark/position/limit makesit difficult to have multiple threads operate on different parts ofthe same buffer. Each thread would have to lock around setting theposition and limit and performing the operation, preventing anyparallelism. The typical way to deal with this is to create multiplebuffer slices, one per thread. Each slice has its ownmark/position/limit values but shares the same backing data.
We can avoid the need for this by adding absolute bulk operations, right?
Let's suppose we were to add something like this (consideringByteBuffer only, setting the buffer views aside):
    get(int srcOff, byte[] dst, int dstOff, int length)
    put(int dstOff, byte[] src, int srcOff, int length)
Each thread can perform its operations on a different part of thebuffer, in parallel, without interference from the others. Presumablythese operations don't read or write the mark and position. Oh, wait.The existing absolute put and get overloads *do* respect the buffer'slimit, so the absolute bulk operations ought to as well. This meansthey do depend on shared state. (I guess we could make the absolutebulk ops not respect the limit, but that seems inconsistent.)
OK, let's adopt an approach similar to what was described by PeterLevart a couple messages upthread, where a) there is an initializationstep where various things including the limit are set properly; b) thebuffer is published to the worker threads properly, e.g., using a lockor other suitable memory operation; and c) all worker threads agreeonly to use absolute operations and to avoid relative operations.
Now suppose the threads have completed their work and you want to,say, write the buffer's contents to a channel. You have to carefullymake sure the threads are all finished and properly publish theirresults back to some central thread, have that central thread receivethe results, set the position and limit, after which the centralthread can initiate the I/O operation.
This can certainly be made to work.

But note what we just did. We now have an API where:
- there are different "phases", where in one phase all the methodswork, but in another phase only certain methods work (otherwise itbreaks silently);
- you have to carefully control all the code to ensure that the wrongmethods aren't called when the buffer is in the wrong phase (otherwiseit breaks silently); and
- you can't hand off the buffer to a library (3rd party or JDK)without carefully orchestrating a transition into the right phase(otherwise it breaks silently).
Frankly, this is pretty crappy. It's certainly possible to work aroundit. People do, and it is painful, and they complain about it up anddown all day long (and rightfully so).
Note that this discussion is based primarily on looking at theByteBuffer API. I have not done extensive investigation of the impactof the various buffer views (IntBuffer, LongBuffer, etc.), nor have Ilooked thoroughly at the implementations. I have no doubt that we willrun into additional issues when we do those investigations.
If we were designing an API to support multi-threaded access to memoryregions, it would almost certainly look nothing like the buffer API.This is what Alan means by "buffers might not be the right answer." Asthings stand, it appears quite difficult to me to fix themulti-threaded access problem without turning buffers into somethingthey aren't, or fragmenting the API in some complex and uncomfortableway.
Finally, note that this is not an argument against adding bulkabsolute operations! I think we should probably go ahead and do thatanyway. But let's not fool ourselves into thinking that bulk absoluteoperations solve the multi-threaded buffer access problem.
s'marks

Re: RFR: 8207851 JEP Draft: Support ByteBuffer mapped over non-volatile memory

Reply via email to