Re: [cp-patches] RFC: changes to java.lang.Integer, Long...

Ian Rogers Mon, 14 Apr 2008 13:26:33 -0700

David Daney wrote:

Ian Rogers wrote:
Hi,
please give your comments on the attached patch. It tries to reducethe size of char[] for strings used to hold numbers. It changesFloat/Double equals to use bit based comparisons rather thandivision. It increases the use of valueOf methods. It adds a cache ofvalues from -128 to 127 for Long. It adds a cache of the values ofzero and one to Float and Double.
The string size is an estimate. For decimal numbers it will dividethe value repeatedly by 8, causing the string length to be overestimated by a character for values like 999. This string size isstill better than the current estimate of 33 characters. It alsoavoids the use of division (shifts are used) and/or lookup tables.
I would like to know your motivation for doing this. Do you have anyevidence that this will reduce memory usage and speed up realapplications?
That said, in our gcj-3.4 based application, we had to create a cacheof Integers because we were creating large numbers of them all with asmall set of values.
So in principle this could be a good approach, but I don't know if wecan assume that there is universal benefit from a patch like this.Can you point to any benchmarks where this helps?
Thanks,
David Daney

Hi David,

I'm having a crack down on wasted memory in the Jikes RVM.

For DaCapo fop (single iteration) there are 270 and 977 occurrences ofDouble 0 and 1 and 20 occurrences of other Doubles. On the other handDaCapo bloat has very few 0 and 1 values. My motivation to cache these,other than fop, is that they exist as bytecodes (fconst0/fconst1 anddconst0/dconst1, although I'm ignoring fconst2 and dconst2). We alreadycache Integers in the intCache. I do extend this concept to Long, as isdone in OpenJDK, and to Float and Double.

Currently we always allocated 33 char arrays to hold the string value,this is 4.625 the size of the minimum object in the RVM. In the case ofa single character string, 18.86% of Integer strings in DaCapo bloat,this code doesn't allocate any char arrays. For other integers the chararray is reduced to either the exact or (20% of the time for decimalvalues) 1 character longer char arrays. This is at the cost of up to 32compares, branches and shifts. For DaCapo bloat a little under 50% ofinteger strings created are for values between -128 and 127.

So the trade offs in the code are, slower Float/Double valueOf code, butfewer Float and Double objects (hopefully improving GC). A small time tocalculate string sizes vs smaller strings and less GC pressure.

For the Jikes RVM we measure performance 4 times a day [1], I introducedthis patch in r14113 and there are no peaks or troughs that appear atthis point. Given the patch is performance neutral but saves memory(although not improving GC performance for the RVM markedly) I thinkit's worth including. GC is less than 6% of execution time, so timesaved may be difficult to measure in the bigger picture (unless itpushes you under or over a particular threshold).


Regards,
Ian

[1]http://jikesrvm.anu.edu.au/cattrack/results/rvmx86lnx32.anu.edu.au/perf/3437/performance_report

Re: [cp-patches] RFC: changes to java.lang.Integer, Long...

Reply via email to