Re: Use of long in Nashorn

Hannes Wallnoefer Fri, 11 Dec 2015 07:09:05 -0800

I uploaded a new webrev that includes most of the changes you suggested.Conversion of long[] from Java now works without losing precision, usingint, double, or Object arrays. I also added a test for this.


http://cr.openjdk.java.net/~hannesw/8144020/webrev.01/

I didn't implement the int/double overloading of array iterator actions.Unless I missed something, I would have to implement two forEach methodsin each subclass, which seem ugly and error prone.

Additionally, I removed the ArrayData.set method that takes a longvalue, something I had overlooked in my previous patch.


Hannes

Am 2015-12-06 um 11:12 schrieb Hannes Wallnoefer:

Thanks for the quick review, Attila. Answers inline.

Am 2015-12-04 um 18:39 schrieb Attila Szegedi:
* In CodeGenerator SHR implementations (both self-assign andordinary) you have method.shr() in loadStack instead of consumeStack.I was actually staring at this for a while as it seemed wrong toperform an operation in loadStack, but in the end I decided it’s okaylike this. After all, it’s the toUint32 that’s the optimistic parthere, so this should be fine indeed. I think we had a JIRA issuesaying “make SHR optimistic” but I can’t find it now. If it pops up,we can mark it as a duplicate of this one.
I've looked for that Jira issue but didn't find it either.
* I see "assert storeType != Type.LONG;” do we even need Type.LONGand LongType class anymore?
That assert is a leftover from the conversion process, it shouldn't beneeded anymore. We do still use Type.LONG for creating and handlingthe primitive fields and spill slots with dual fields. That's why Ihad to keep it.
* Symbol.java: you could reclaim the HAS_LONG_VALUE bit by shiftingthe rest down by one
Will do.
* optimization idea: have versions of callback invokers inNativeArray.java for both int and double indices. Since we know thelength of the array when we enter forEach etc. we could select thedouble version when length > maxint and the int version otherwise.Actually, we could even have IteratorAction.forEach be overloaded forint and double, and write the body of IteratorAction.apply() to startout with the int version, and when the index crosses maxint startcalling the double version (so even for large arrays we’ll iteratecalling int specialization of functions for the cases where it’sshort circuited).
Nice idea, and should be easy to implement. I'll try it out.
* array length: could we still have Nashorn APIs that return long?Optimistic filters will deal with these appropriately, won’t they? Iguess they should since they also need to be able to handle returnvalues from POJO methods that return long (e.g.System.currentTimeMillis()). Hence, you could have NativeArray.lengthreturn “long” and let the optimistic machinery decide whether to castit as int or double. That would allow you to not have to box thereturn value of NativeArray.length.
Yes, we could have things returning long, but it will deoptimize toObject. OptimisticReturnFilters (which do the runtime checks) are notused for ScriptObject properties.
* NativeNumber: unused import?
Fixed.
*Unit32ArrayData: getBoxedElementType went from INTEGER to DOUBLE.I’m not sure I understand that. I mean, was INTEGER incorrect before?
That obviously has been incorrect before. Actually, that method isonly used in NativeArray#concat and will never be invoked on typedarrays. Looking at that NativeArray#concat method it looks a bit fishyto me, assuming all NativeArrays use ContinuousArrayData. I have toinvestigate further on this.
Back to the issue at hand, int.class/Integer.class is definitely wrongfor element type for Uint32. When returning int.class ingetElementType, optimistic code that uses optimstic int getter getsincredibly slow when actually deoptimizing to double, because we keeptrying to handle elements as ints. (I had this in my code at one timeand found pdfjs slowed down to a crawl when changing the optimisticint getter to always deoptimize to double.)
Probably getBoxedElementType should just be a final method instead ofabstract in ContinuousArrayData and convert getElementType to boxedcounterpart on the fly.
* You didn’t remove LongArrayData.java?
I think I did:http://cr.openjdk.java.net/~hannesw/8144020/webrev.00/src/jdk.scripting.nashorn/share/classes/jdk/nashorn/internal/runtime/arrays/LongArrayData.java.patch
* It looks like long[] arrays can now lose precision if passedthrough Java.from(), E.g. if you have Java methods “long[]getLongArray()” and “void setLongArray(long[] arr)” thenpojo.setLongArray(Java.from(pojo.getLongArray()) will lose precisionbecause NativeJava.copyArray(long[]) produces double[]. Of course, wecould argue that this is expected behavior if you explicitly useJava.from. Just passing around and manipulating a raw long[] won’thave that effect (although it’d be good to confirm that in test), itrequires an explicit Java.from(). Still, I wonder if it’d make senseto have copyArray(long[]) either return Object[] or choosedynamically between double[] and Object[] based on the maximummagnitude of an element (you can start with double[] and reallocateas Object[] when you bump into a large long).
Good catch. I'll see if I can use existing code in ArrayData toconvert to the narrowest array type.
Thanks!

Hannes
Other than that: great work! Nice to see large swaths of code removed.

Attila.
On Dec 4, 2015, at 4:27 PM, Hannes Wallnoefer<hannes.wallnoe...@oracle.com> wrote:
After receiving another long/precision related bug last week Idecided to go ahead with the removal of longs in Nashorn. It's beenquite an effort, but I think it looks good now. Below are the linksto the webrev and Jira issue.
http://cr.openjdk.java.net/~hannesw/8144020/
https://bugs.openjdk.java.net/browse/JDK-8144020
This is a rather big patch, but it mostly removes code. There areover 2000 deletions vs. 400 insertions. I removed all uses of longin our code where it represented JS numbers, including ScriptObjectproperty accessors with longs as key or value, and the LongArrayDataclass. With this, all JS numbers are represented as int or double inNashorn. Longs will not "naturally" occur anymore and only bepresent as java.lang.Long instances.
As expected, the areas that demanded special care were those whereES prescribes use of uint32. These are mostly unsigned right shift,Uint32Array elements, and the length property of arrays. For rightshift and Uint32Array elements, I added optimistic implementationsthat return int if possible and deoptimize to double. Pdfjs andmandreel are benchmarks that make use of these features, and Ididn't notice any observable impact on performance. Even when Isimulated fallback to double performance was still ok (previouslyreported performance regressions for this case were in fact causedby an error of mine).
For the Array length property, I changed the getter in NativeArrayto return int or double depending on actual value. Previously, thelength getter always returned long. Since the property is actuallyevaluated by OptimisticTypesCalculator, for-loops with an arraylength as limit now use ints instead of longs to iterate througharray indices, which is probably a good thing.
As for longs returned by Java code, their value is always preserved.Only when they are used for JS math they will be converted to doubleas prescribed by ECMA. When running with optimistic types we checkif a long fits into an int or double to avoid deoptimization toobject. Previously we did this only when converting long tooptimistic int, but optimistic double did not use any return filterfor longs, so a long returned for an optimistic double could easilylose precision.
You can find the related changes in OptimisticReturnFilters.java. Ialso added tests to make sure long values are preserved in variousoptimistic and non-optimstic contexts, some of which would havepreviously fail.
In a previous version of this patch it included functionality toonly treat ints and doubles or their wrapper objects as native JSnumbers (e.g. you could invoke Number.prototype methods only on intsand doubles). However, this is a quite a hairy issue and I reckonedthe patch is large enough without it, so I pulled it out and willfix this separately as JDK-8143896.
I've testing and refining this patch for the last few days and thinkit looks pretty good. I thought it was a good idea to discuss thisto this existing thread before posting a review request. Please letme know what you think.
Thanks,
Hannes


Am 2015-11-13 um 15:06 schrieb Attila Szegedi:
Well, we could just box them in that case. If we only used int anddouble as primitive number types internally, then a naturalrepresentation for a long becomes Long. Java methods returning longcould have an optimistic filter that first checks if the value fitsin an int (32-bit signed), then in a double (53-bit signed) withoutloss of precision, and finally deoptimizes to Object and uses Long.int->long->double->Object becomes int->double->Object, with longsof too large magnitude becoming boxed.
Attila.
On Nov 13, 2015, at 2:46 PM, Jim Laskey(Oracle)<james.las...@oracle.com> wrote:
The main thing to watch for here is that longs/Longs need tosurvive unobstructed between Java calls. Remember we ran intotrouble in this area (loss of precision when using longs forencoding.)
On Nov 13, 2015, at 8:26 AM, AttilaSzegedi<attila.szeg...@oracle.com> wrote:
On Nov 13, 2015, at 10:31 AM, HannesWallnoefer<hannes.wallnoe...@oracle.com> wrote:
Hi all,
I'm currently questioning our use of longs to represent numbersin combination with optimistic typing.
I often feel that the presence of longs is more hassle thanthey’re worth. I’d be all for eliminating them.
After all, there's a pretty large range where long arithmeticwill be more precise than the doubles required by ECMA.
There’s currently several different places in Nashorn where wetry to confine the precision of longs to 53 bits (so they aren’tmore precise than doubles). It’s a bit of a whack-a-mole whereyou can’t always be sure whether you found all instances.
For context see this bug, especially the last two comments (theoriginal bug was about number to string conversion which hasbeen solved in the meantime):
https://bugs.openjdk.java.net/browse/JDK-8065910
So the question is: can we use longs at all and be confidentthat results won't be too precise (and thus, strictly speaking,incorrect)?
Internally, the longs are also used for representing UInt32 evenin non-optimistic setting, which is only really significant forthe >>> operator and array indices and lengths; maybe we shouldlimit longs to that usage only, or even just use doublesinternally for UInt32 values that can’t be represented as Int32.FWIW, even for the >>> operator we only need it when shifting byzero, as in every other case the result will have the topmost bitset to 0 and thus fit in Int32 too.
I guess once Valhalla rolls around, we could easily have anunsigned 32 bit int type.
Hannes
Attila.

Re: Use of long in Nashorn

Reply via email to