Re: RFR(trivial): 8222394: HashMap.compute() throws CME on an empty Map if clear() called concurrently

Stuart Marks Wed, 01 May 2019 17:40:01 -0700

...merely to serve as a discussion point about the policy for throwingConcurrentModificationException?

Yes, for the time being, I want to see and welcome more ideas on this. Itseems to me that the policy for throwing CME here is not a unified one,mostly based on experience and testing. Clear, compute, and computeIfAbsentare more special as I described.


OK. For reference, here are some of the words from the
ConcurrentModificationException specification: [1]

This exception may be thrown by methods that have detected concurrent
modification of an object when such modification is not permissible.

For example, it is not generally permissible for one thread to modify a
Collection while another thread is iterating over it. In general, the results
of the iteration are undefined under these circumstances. Some Iterator
implementations (including those of all the general purpose collection
implementations provided by the JRE) may choose to throw this exception if
this behavior is detected. Iterators that do this are known as fail-fast
iterators, as they fail quickly and cleanly, rather that risking arbitrary,
non-deterministic behavior at an undetermined time in the future.

Note that this exception does not always indicate that an object has been
concurrently modified by a different thread. If a single thread issues a
sequence of method invocations that violates the contract of an object, the
object may throw this exception. For example, if a thread modifies a
collection directly while it is iterating over the collection with a
fail-fast iterator, the iterator will throw this exception.

Note that fail-fast behavior cannot be guaranteed as it is, generally
speaking, impossible to make any hard guarantees in the presence of
unsynchronized concurrent modification. Fail-fast operations throw
ConcurrentModificationException on a best-effort basis. Therefore, it would
be wrong to write a program that depended on this exception for its
correctness: ConcurrentModificationException should be used only to detect
bugs.

[1]https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/ConcurrentModificationException.html



Similar words are repeated in several different locations around the
specification, such as in the ArrayList and HashMap class specifications.

I'm not entirely sure what your concerns are with
ConcurrentModificationException (and the "fail-fast" concurrent modification
policy), but let me discuss a few points.

1. throwing of CME is not guaranteed - "best effort"

Unlike most Java specifications, the specification around CME is fairlyindefinite. The wording is hedged -- "This exception may be thrown...." Thisimplies that CME might or might not be thrown, even in cases where one mightexpect it to be.

It also says that CME is thrown on a "best effort" basis. This doesn't mean thatthe library makes the maximum possible effort to throw CME in every possiblesituation. Maybe "best effort" is somewhat misleading. Perhaps "reasonable"effort is more descriptive.

For example, ArrayList keeps a modCount field and increments and checks itoccasionally. No synchronization is done. If the ArrayList is modified byanother thread, the update to modCount might not be visible to all threads,which might result in data corruption instead of a CME.

One way to "fix" this would be to make access to modCount synchronized (or tomake it volatile, or to make it an AtomicInteger or something) to improve thereliability of detecting concurrent modifications from other threads. This wouldadd complexity to the code and also slow down common operations. Making thisextra effort doesn't seem to be worthwhile.


2. throwing CME sometimes done even when not absolutely necessary

Another point is that the detection and throwing of a CME is an approximation ofwhen concurrent modification would have any impact. In some cases CME will bethrown even when one wouldn't think it strictly necessary.

For example, consider a loop in the middle of iterating an ArrayList.ArrayList's iterator simply keeps an index to represent its current position. Ifan element is added to or removed from the front of the list, this would resultin the iteration skipping or repeating elements. Thus, throwing CME seemswarranted in this case.

Now consider an iteration in the middle of an ArrayList, and an addition orremoval is made to the *end* of the list. This doesn't affect the currentiteration; yet CME is thrown anyway. Why?

To avoid throwing CME in this case (but to throw it in the previous case) theArrayList and its Iterators would have to keep track of more information aboutwhat changes were made and would have to do more checking at each iterationstep. This could increase code complexity considerably. Again, this seems likeit isn't worthwhile. Keeping a simple counter (modCount) and checking it at eachiteration step is quite cheap, although it arguably does throw CME unnecessarilyin this case.

Some of the cases you're talking about seem to fall into this category. A CME isthrown from compute() if an operation is merely attempted, even if the actualoperation performed would have no ill effect.


3. state-dependent behavior

I discussed this in a previous message. My personal design style is to try toavoid this, although the library isn't wholly consistent in this regard. Thediscussion regarding CME and the compute() and similar methods is also relatedto state-dependent behavior.


4. edge cases

There are a number of edge cases that aren't treated wholly consistently acrossthe libraries. Again with ArrayList, consider the following:


        List<Integer> list = new ArrayList<>(List.of(0, 1, 2))
    [0, 1, 2]
        var it = list.iterator()
        it.hasNext()
    true
        it.next()
    0
        it.hasNext()
    true
        it.next()
    1
        list.remove(0)
    0
        it.hasNext()
    false

Arguably, hasNext() should throw CME. If this were in a for-loop, the concurrentmodification would be missed and the loop would terminate normally. Manyiterators check for concurrent modification in their next() method but not inhasNext(). Perhaps this should be fixed, but it might break code that isapparently behaving well today, so we've left it unchanged.

There is also the case of JDK-8114832, where a failed attempt at modificationwill still cause a CME. This is more state-dependent behavior: should modCountbe incremented when a modification is *attempted* or when modification*actually* occurs? (Martin says, "attempted murder is still a crime!")

This bug is still open, though Martin and I agree that no change should be madehere. It's questionable to me whether we want to go through the old collectionsand update things to be more consistent. The effort is high, the benefit isfairly low, in my estimation, and there is a risk of breaking existing code. Sowe live with the inconsistencies.


*******

I'm kind of rambling here. Is this the kind of discussion you're interested in?Do you have any specific questions?


s'marks

Re: RFR(trivial): 8222394: HashMap.compute() throws CME on an empty Map if clear() called concurrently

Reply via email to