Re: Proposal: optimization of Map.copyOf and Collectors.toUnmodifiableMap

Roger Riggs Mon, 18 Jun 2018 07:27:01 -0700

Hi Stuart,

In regard to new SharedSecret interfaces, one option is move shared (butprivate) implementation classesto a jdk.internal.xx package (not exported). This only works well ifthey are not tightly coupled to other

package private classes.

SpinedBuffer might be a good candidate, I have some IO cases in mindthat could benefit from

the allocation/reallocation savings.  (ByteArrayOutputStream for 1).

Regards, Roger

On 6/15/2018 5:22 PM, Stuart Marks wrote:

Hi Peter,

Finally getting back to this.
I think there's clearly room for improvement in the creation of theunmodifiable collections. Right now the creation paths (mostly) useonly public APIs, which can result in unnecessary allocation andcopying. Certainly Map.copyOf does this, but there are also othercases as well.
For copying from an unknown map, there are a couple approaches:
* accumulate keys and values in a single ArrayList, which can bepresized as necessary, but which will grow if necessary; then copyelements from a subrange of ArrayList's internal array (similar toyour MapN(Object[], len) overload)
* accumulate keys and values into a SpinedBuffer, which doesn'trequire copying to grow, which is preferable if for some reason wecan't pre-size accurately; and then copy the elements out of it
The Collectors.toUnmodifiableMap implementations are known to createHashMap instances, so they could pass the HashMap directly to aprivate MapN constructor that in turn could talk directly to HashMapto get the keys and values. This avoids allocation of a full-sizedbuffer and one copy.
Note that these techniques involve creating new interfaces, sometimesthat cross package boundaries. It's a bit of an irritant to have toplumb new paths that go through SharedSecrets, but it seems likely tobe worthwhile if we can avoid bulk allocation and copying steps.
Given these, it doesn't seem to me that the BiBuffer approach helpsvery much. I think there are many other avenues that would beworthwhile to explore, and that possibly can provide bigger savings.
s'marks





On 6/11/18 3:57 AM, Peter Levart wrote:
Hi,
Those two methods were added in JDK 10 and they are not very optimal.Map.copyOf(Map map) 1st dumps the source map into an array ofMap.Entry(s) (map.entrySet().toArray(new Entry[0])), which typicallycreates new Map.Entry objects, then passes the array toMap.ofEntries(Map.Entry[] entries) factory method that iterates thearray and constructs a key/value interleaved array from it which ispassed to ImmutableCollections.MapN constructor, which thenconstructs a linear-probing hash table from it. So each key and valueis copied 3 times, while several temporary objects are created in theprocess.
One copying step could be eliminated and construction of temporaryMap.Entry objects too:
http://cr.openjdk.java.net/~plevart/jdk-dev/UnmodifiableMap_copyOf/webrev.01/
Collecting stream(s) using Collectors.toUnmodifiableMap() 1stcollects key/value pairs into a HashMap, then it performs equivalentoperations as Map.copyOf(hashMap) at the end. Using Map.copyOf()directly benefits this collection operation too.
The following benchmark:
http://cr.openjdk.java.net/~plevart/jdk-dev/UnmodifiableMap_copyOf/UnmodifiableMapBench.java
Shows up to 30% improvement of .copyOf operation with this patchapplied:
Original:
Benchmark (size) Mode Cnt Score Error UnitsUnmodifiableMapBench.copyOf 10 avgt 10 403.633 ±2.640 ns/opUnmodifiableMapBench.copyOf 100 avgt 10 3489.623 ±44.590 ns/opUnmodifiableMapBench.copyOf 1000 avgt 10 40030.572 ±277.075 ns/opUnmodifiableMapBench.toUnmodifiableMap 10 avgt 10 831.221 ±3.816 ns/opUnmodifiableMapBench.toUnmodifiableMap 100 avgt 10 9783.519 ±43.097 ns/opUnmodifiableMapBench.toUnmodifiableMap 1000 avgt 10 96524.536 ±670.818 ns/op
Patched:
Benchmark (size) Mode Cnt Score Error UnitsUnmodifiableMapBench.copyOf 10 avgt 10 264.172 ±1.882 ns/opUnmodifiableMapBench.copyOf 100 avgt 10 2318.974 ±15.877 ns/opUnmodifiableMapBench.copyOf 1000 avgt 10 29291.782 ±3139.737 ns/opUnmodifiableMapBench.toUnmodifiableMap 10 avgt 10 771.221 ±65.432 ns/opUnmodifiableMapBench.toUnmodifiableMap 100 avgt 10 9275.016 ±725.722 ns/opUnmodifiableMapBench.toUnmodifiableMap 1000 avgt 10 82204.342 ±851.741 ns/op
Production of garbage is also reduced, since no Map.Entry temporaryobjects are constructed:
Original:

Benchmark (size)  Mode  Cnt      Score       Error   Units
UnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 10 avgt 10 416.001± 0.002 B/opUnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 100 avgt 102936.005 ± 0.019 B/opUnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 1000 avgt 1028136.059 ± 0.199 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 10 avgt10 1368.001 ± 0.004 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 100 avgt10 10208.139 ± 0.045 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 1000 avgt10 93025.923 ± 0.573 B/op
Patched:

Benchmark (size)  Mode  Cnt      Score       Error   Units
UnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 10 avgt 10 304.000± 0.001 B/opUnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 100 avgt 102464.004 ± 0.012 B/opUnmodifiableMapBench.copyOf:·gc.alloc.rate.norm 1000 avgt 1024064.040 ± 0.137 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 10 avgt10 1256.001 ± 0.003 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 100 avgt10 9720.153 ± 0.055 B/opUnmodifiableMapBench.toUnmodifiableMap:·gc.alloc.rate.norm 1000 avgt10 88905.688 ± 0.574 B/op
So what do you think? Is this an improvement?

Regards, Peter

Re: Proposal: optimization of Map.copyOf and Collectors.toUnmodifiableMap

Reply via email to