Re: RFR - 8132734: java.util.jar.* changes to support multi-release jar files

Xueming Shen Fri, 12 Feb 2016 17:21:02 -0800


Steve,

I would assume the difference of the webrev.04 "old" iterator and thewebrev.06 "new" iteratorthat might trigger a performance is how you create the JarFileEntry. Theone parameter constructorinvokes isMultiRelease(), which might be relatively expensive, when the"mr" is enabled. There

are couple "if" checks involved.

For the "new" iterator is slower than the current jdk9 one. It might bedesired to have two JarEntryIteratorclasses defined, one for "notVersioned", one for the "versioned". Ofcourse keep the two parameterconstructor for the "notVersioned". This might bring performance backfor the normal "notVersioned"

usage, which I would assume is the majority.

There is also a "bug" in the new iterator. The iterator/Enumerationreturned from ZipFile.entries()checks "ensureOpen() in both hasNext() and next(). So close() theZipFile after itr.hasNext() willfails the next next() invocation in the old implementation. The latestitr will returns the cached"ze". Not a big issue for sure, but a kind of "regression". Defining twoiterator classes as suggestedabove will also workaround this issue, as the "notVersioned" one willwork just as expected, no

regression.

-Sherman

On 2/9/16 5:04 PM, Steve Drach wrote:

Hi,
Yet another webrev,http://cr.openjdk.java.net/~sdrach/8132734/webrev.06/<http://cr.openjdk.java.net/%7Esdrach/8132734/webrev.06/>, with achange to JarEntryIterator to fix a problem discovered by performancetests — calling hasNext() twice as often as needed. I also removedthe @since 9 tags from the methods entries() and stream(), and addedan additional sentence to the spec for each of those methods thatclarifies what a base entry is (actually is not).
I think having stream and entries do this is right although I wouldlike to see some performance data if possible.
Seehttp://cr.openjdk.java.net/~sdrach/8132734/JarFile%20Performance.pdf<http://cr.openjdk.java.net/%7Esdrach/8132734/JarFile%20Performance.pdf>
I used JMH to run the benchmark. Seehttp://cr.openjdk.java.net/~sdrach/8132734/MyBenchmark.java<http://cr.openjdk.java.net/%7Esdrach/8132734/MyBenchmark.java>. Iused two jar files, the rt.jar file from JDK 8 that has 20653 entriesand the multi-release.jar found in the test directory with 14 entries.Obviously rt.jar is not a multi-release jar file.
The first two tables (1 and 2) are comparable and the second twotables are somewhat comparable (3 and 4).
Tables 1 and 2 have 4 sections that show the results of tests on thetwo jar files in 4 configurations of JarFile. The tests were donewith a JarFile object constructed without the Release object argument,essentially the legacy constructor. The section labeled "JDK 8JarFile” was done with JDK 8u66. The section labeled “JDK 9 JarFile”was done with the latest build of openjdk/jdk9/dev without any changesin my 8132734 changeset. I chose this section as the reference, sothe last column shows the values normalized to 1 micro/milli secondper operation (rt.jar times are in milliseconds and multi-release.jartimes are in microseconds). It should be obvious that JDK 9 is muchfaster than JDK 8, somewhere on the order of 5-6 times faster. Ithink that is because ZipFile doesn’t use JNI in JDK 9.
Of the two remaining sections in Tables 1 and 2, the section labeled“MultiRelease JarFile” differs from the section labeled “MultiReleaseJarFile, new iterator” only in the JarEntryIterator class. The firstone uses the original iterator in JarFile.java that can be seenstarting with line 551 of webrev.04<http://cr.openjdk.java.net/%7Esdrach/8132734/webrev.04/>, and the newiterator starts with line 553 of webrev.06<http://cr.openjdk.java.net/%7Esdrach/8132734/webrev.06/>. Theresults are strange. The new, more complex, iterator appears to befaster than the old, simpler, iterator. I double, and triple, checkedit, but it was always faster. I used jitwatch to look at the hotspotlogs generated during compilation and neither method was compiled. Isuppose I could dig into it further but decided not to. Consider itgood news. The results do show that the multi-release enhancementslows JarFile entries/stream down by 2-18% depending on the size ofthe jar file. But they are still much better than the JDK 8 values.
Also I would expect that if a JAR file is not mult-release but thelibrary opens it with Release.RUNTIME to perform the same as openingthe JAR file with the Release-less constructors.
The results in Table 3 attempts to answer this question since rt.jaris not a multi-release jar file. This tells me that if one opens theJarFile with Release.RUNTIME, that there is a performance penalty of2-6% on this very large jar file.
Finally, the results in Table 4 tell me that processing a truemulti-release jar file takes about 80% more time per entries() orstream() operation. I’ve looked at this in a profiler and there is noparticular area that stands out to me, it’s just more complicated toprocess a multi-release jar file, as would be expected.
I think the javadoc will need to also need to make it clear whetherentries with names starting with META-INF/versions/ are returned.
Fixed.
I see you've added @since 9 to the existing methods, I assume youdidn't mean to do this.
Fixed.
At some point then we need to discuss how RUNTIME_VERSION iscomputed. Iris (via Mandy) has pushed jdk.Version to jdk9/dev buthaving it exported by java.base conflicts with the design principlesin JEP 200. Moving it to another module means that code in java.basecannot use it and thus the JAR file can't use it.
Left as is.

Re: RFR - 8132734: java.util.jar.* changes to support multi-release jar files

Reply via email to