Re: HashMap collision speed (regression 7->8)

Peter Levart Sun, 11 Jan 2015 07:10:51 -0800

Hi,

I wasn't comfortable with Bernd's HMH benchmark results jitter, so Ichanged the mode of operation to be SingleShotTime (since a particularinvocation is from 0.6 to 3sec anyway). GC is triggered before eachinvocation (-gc true). I also added -XX:-TieredCompilation VM option andrun 6 forks of 10 iterations of each test. By Doug's suggestion I alsoadded a variant of unchanged HashMap where TREEIFY_THRESHOLD = 1 << 20,UNTREEIFY_THRESHOLD = TREEIFY_THRESHOLD - 2, MIN_TREEIFY_CAPACITY =TREEIFY_THRESHOLD * 4 as a reference to compare with. Here are the results:


Original JDK9 HashMap:

Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 ss 603011.738 78.249 msj.t.HashMapCollision.badDistWithComp 16 ss 602984.280 48.315 msj.t.HashMapCollision.goodDistNoComp 16 ss 60682.060 52.341 msj.t.HashMapCollision.goodDistWithComp 16 ss 60685.705 55.183 ms


Original JDK9 HashMap with TREEIFY_THRESHOLD = 1 << 20:

Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 ss 602780.771 236.647 msj.t.HashMapCollision.badDistWithComp 16 ss 602541.740 233.429 msj.t.HashMapCollision.goodDistNoComp 16 ss 60757.364 67.869 msj.t.HashMapCollision.goodDistWithComp 16 ss 60671.617 54.943 ms

Caching of comparableClassFor (in ClassRepository - good forheterogeneous keys too):

Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 ss 603014.888 71.778 msj.t.HashMapCollision.badDistWithComp 16 ss 602279.757 54.159 msj.t.HashMapCollision.goodDistNoComp 16 ss 60760.743 70.674 msj.t.HashMapCollision.goodDistWithComp 16 ss 60725.188 67.853 ms


Caching of comparableClassFor (internally - good for homogeneous keys only):

Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 ss 603026.707 84.571 msj.t.HashMapCollision.badDistWithComp 16 ss 602137.296 66.140 msj.t.HashMapCollision.goodDistNoComp 16 ss 60635.964 8.213 msj.t.HashMapCollision.goodDistWithComp 16 ss 60685.129 46.783 ms




Regards, Peter

On 01/11/2015 12:55 PM, Peter Levart wrote:

On 01/11/2015 02:27 AM, Martin Buchholz wrote:
Peter,
You are adding the ability to add "app-specific storage" to Classobjects ("Class-local variables"?), which is pretty unusual.
Well, that was my intention, since the logic about what should becached is very specific to the usecase and might change in the future.Anyway, this is only internal API. Users have a public alternative inClassValue. That's one reason. The other is space overhead introducedwhen caching with ClassValue and inability to initialize ClassValue sovery early in the boot-up sequence.
I was thinking instead of a very dumb 1-element cache, rememberingClass and comparableClassFor, which will work for typical homogeneousHashMaps.
This seems like a good idea. We would actually need only one field oftype Class<?> and a boolean flag.
Unfortunately, comparableClassFor is a static method used also fromvarious other contexts that don't have access to HashMap instance, forexample from TreeNode. We would have to extend the internal API withan additional HashMap argument to pass the HM instance around. Not tomention that this would be tricky because retaining the last usedcomparable Class object in the HM instance could prevent GC fromreleasing a ClassLoader in an app server environment for example. AWeakReference<Class<?>> would have to be used and new WeakReferenceobject created each time cached value changes. Unless we cache onlythe 1st comparableClassFor result and never change it, which has thesame cache-hit ratio for homogeneous keys.
Right, here's what this looks like:

http://cr.openjdk.java.net/~plevart/jdk9-dev/HM.comparableClassFor/HomogeneousKeysCache/webrev.01/
I modified Bernd's JMH benchmark a little to use ThreadLocalRandominsted of Random, so results express more what is going on withHashMap and less with Random synchronization:
http://cr.openjdk.java.net/~plevart/jdk9-dev/HM.comparableClassFor/HashMapCollision.java

Results:

Original JDK9 HashMap:
Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 avgt6 3101.247 435.866 ms/opj.t.HashMapCollision.badDistWithComp 16 avgt6 2410.202 478.247 ms/opj.t.HashMapCollision.goodDistNoComp 16 avgt6 615.100 7.063 ms/opj.t.HashMapCollision.goodDistWithComp 16 avgt6 614.229 159.558 ms/op
Caching of comparableClassFor (in ClassRepository - good forheterogeneous keys too):
Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 avgt6 3305.967 652.791 ms/opj.t.HashMapCollision.badDistWithComp 16 avgt6 2030.965 241.910 ms/opj.t.HashMapCollision.goodDistNoComp 16 avgt6 611.202 6.440 ms/opj.t.HashMapCollision.goodDistWithComp 16 avgt6 582.890 4.896 ms/op
Caching of comparableClassFor (internally - good for homogeneous keysonly):
Benchmark (initialSize) ModeSamples Score Score error Unitsj.t.HashMapCollision.badDistNoComp 16 avgt6 3265.673 660.030 ms/opj.t.HashMapCollision.badDistWithComp 16 avgt6 1875.204 224.682 ms/opj.t.HashMapCollision.goodDistNoComp 16 avgt6 598.949 25.484 ms/opj.t.HashMapCollision.goodDistWithComp 16 avgt6 585.278 8.103 ms/op
Regards, Peter
On Sat, Jan 10, 2015 at 5:01 AM, Peter Levart <[email protected]<mailto:[email protected]>> wrote:
    On 01/10/2015 01:20 AM, Doug Lea wrote:
    On 01/09/2015 06:29 PM, Martin Buchholz wrote:
    Given the prevalence of sub-optimal hashcodes, my own intuition
    is also that
    raising the treeification threshold from 8 will be a win.
    That's what I thought at first. But 8 is a better choice for String
    and other Comparable keys, which account for the majority of
    HashMaps
    out there. (For non-comparables, infinity is the best threshold.)
    How much slower should we make the most common cases to make the
    others
    faster? The only way to decide empirically is to take a large
    corpus of programs and vary thresholds. Short of that, speeding up
    comparableClassFor is still the best bet for reducing impact on
    non-comparables.
    Hi Doug,

    comparableClassFor() for non-comparables that don't implement
    Comparable is already as fast as it can be (the 1st check is
    instanceof Comparable). For other comparables (and
    non-comparables) that implement Comparable (except for String
    which is special-cased), we could improve the situation by
    caching the result.

    Here's another attempt at that. This time it uses plain old JDK1
    stuff, so it actually works even in HashMap (using
    IdentityHashMap so no danger of circular usage if it is to be
    applied to CHM also):

    
http://cr.openjdk.java.net/~plevart/jdk9-dev/Class.getGenericDerivative/webrev.01/
    
<http://cr.openjdk.java.net/%7Eplevart/jdk9-dev/Class.getGenericDerivative/webrev.01/>

    With this patch, the results of Bernd's JMH benchmark do give
    some boost to keys that implement Comparable (badDistWithComp case).

    These are the results with original JDK9 HashMap:

    Benchmark (initialSize)   Mode   Samples        Score Score
    error    Units
j.t.HashMapCollision.badDistNoComp 16 avgt 63104.047 278.057 ms/opj.t.HashMapCollision.badDistWithComp 16 avgt 62754.499 243.780 ms/opj.t.HashMapCollision.goodDistNoComp 16 avgt 61031.992 26.422 ms/opj.t.HashMapCollision.goodDistWithComp 16 avgt 61082.347 30.981 ms/op
    And this is with patch applied:

    Benchmark (initialSize)   Mode   Samples        Score Score
    error    Units
j.t.HashMapCollision.badDistNoComp 16 avgt 63081.419 386.125 ms/opj.t.HashMapCollision.badDistWithComp 16 avgt 62116.030 281.160 ms/opj.t.HashMapCollision.goodDistNoComp 16 avgt 61015.224 81.843 ms/opj.t.HashMapCollision.goodDistWithComp 16 avgt 61078.719 38.351 ms/op
    Caching is performed as part of Class generic types information
    caching (ClassRepository), so there's no overhead for those that
    don't need generic types information. All logic is kept inside (C)HM.

    Regards, Peter
    -Doug

Re: HashMap collision speed (regression 7->8)

Reply via email to